
En cours
Publié
Payé lors de la livraison
The development of an automated web scraping system is required to extract event information from multiple websites. The system must process data from a list of 34 venues and bars, whose information is contained in a provided Excel file. The goal is to automatically generate a weekly event sheet in Google Sheets. The developer will be responsible for: - Conducting a thorough technical analysis of each of the 34 sources to determine their scraping feasibility and the most appropriate method. It is anticipated that not all sources will have an optimal structure for automated scraping. - Implementing scraping for all technically feasible sources. - Managing the heterogeneity of the websites, including proprietary websites, WordPress-based sites, and ticketing platforms. - Developing solutions for scraping dynamic content (generated with JavaScript) from sources that require it. It is important to note that Instagram scraping is not included in the scope of this phase of the project. The data to be extracted for each event, when available, includes: - Event Name - Date - Time - Venue/Bar - Ticket Price - Description - Performing Musicians (first and last name separated by commas in a single cell) - Link to the event - Source URL Technical and Functional Requirements: - Standardized formats for dates (yyyy-mm-dd) and times (hh:mm). - Implementation of a basic mechanism for duplicate removal, based on the combination of event name, date, and venue. - Automatic generation of a weekly event sheet in Google Sheets, covering the period from Monday to Sunday. - Use of the Google Sheets API for interaction with the spreadsheets. - The system must be implemented and run on a Google Cloud (GCP) virtual machine, configured to allow manual on/off operation by the client. - The source code must be documented, modular, and designed to be scalable. Clear and concise documentation on the use and maintenance of the system must be provided. The proposal should include information on the technologies to be used, the proposed methodology for evaluating the feasibility of scraping, prior experience with complex scraping projects, and an estimate of maintenance time and costs.
N° de projet : 40274212
40 propositions
Projet à distance
Actif à il y a 6 jours
Fixez votre budget et vos délais
Soyez payé pour votre travail
Surlignez votre proposition
Il est gratuit de s'inscrire et de faire des offres sur des travaux

Hi, I can build a scalable scraping system that pulls event data from all feasible 34 venues, handles static and dynamic sites (including JS-heavy ones), removes duplicates, standardizes date/time formats, and automatically generates a Monday–Sunday weekly sheet in Google Sheets via the API. It will run on a GCP VM, be modular and documented, and easy for you to turn on/off. Clean, reliable, and built for long-term use. Let’s start with the source review.
$18 USD en 1 jour
0,6
0,6
40 freelances proposent en moyenne $57 USD pour ce travail

I will build a modular Python-based scraping system (Scrapy + Playwright) to process 34 venues, extract standardized event data, deduplicate records, and auto-generate a weekly (Mon–Sun) Google Sheet via Google Sheets API. Deployed on GCP VM with logging, error handling, and documentation. Includes feasibility audit, dynamic-site handling, and scalable architecture. Maintenance: ~3–5 hrs/month.
$250 USD en 5 jours
7,3
7,3

With over 15 years of experience, I'm a highly skilled and detail-oriented professional with demonstrated expertise in web scraping and automation. I'm confident that I have the skills and the technical knowledge to handle your Automated Web Scraping System for Event Info project. Throughout my career, I have successfully developed similar systems, providing clean, organized, and reliable data extraction in desired formats (CSV, Excel, JSON or database-ready). I'm well-versed in Python (Selenium, BeautifulSoup, Scrapy, Requests), which will be essential in dealing with the heterogeneity of website structures, varied platforms like proprietary websites, WordPress-based sites, and ticketing platforms. Furthermore, this experience has made me comfortable tackling dynamic content generated with JavaScript as well. Additionally, I have extensive skills in Google Sheets API that will ensure efficient data transfer from my system to your Weekly Event Sheets. With my profound experience and meticulous approach to work, I'm confident that not only will you receive the accurate and complete event information you seek but also a highly maintainable and scalable system designed to facilitate your future needs.
$30 USD en 7 jours
6,9
6,9

Hello, I will create a PHP script to automate your task. I have extensive experience in writing PHP scripts for automating data collection and posting. Please see my reviews for reference.
$500 USD en 2 jours
6,5
6,5

I understand the nuances and complexities involved in a project like automated web scraping system for event information, and I have the expertise needed to successfully deliver on this project. With over 8 years of experience as a Python Developer, I have become intimately familiar with the tools, frameworks, and strategies that make automated data scraping systems successful. So if you're ready to transform your data collection process with an efficient automated system or examined reliable patterns in events that can dramatically inform business decisions amongst others - let's get in touch. Your satisfaction is my prime goal!
$20 USD en 1 jour
5,8
5,8

Hi, there, I have extensive experience in web scraping and data extraction, making me well-equipped to handle your project. ✅ Leveraging Python and advanced scraping techniques, I will conduct a detailed analysis of the 34 sources to assess scraping feasibility and implement automated scraping. ✅ I will address the heterogeneity of websites, including proprietary sites and ticketing platforms, ensuring optimal data extraction. ✅ Implementing solutions for scraping dynamic content from JavaScript-based sources is a key focus. ✅ Using standardized formats and an automated duplicate removal mechanism, I will generate a weekly event sheet in Google Sheets, interfacing with the Google Sheets API for seamless interaction. ✅ The system will be deployed on a Google Cloud virtual machine, with modular, scalable, and well-documented code for easy maintenance. I look forward to working with you. Best Regards, Brayan
$30 USD en 1 jour
5,4
5,4

Hi there, I have extensive experience in data extraction and I can certainly build the scraping system you need. I noticed some of the sites are from Argentina; if you speak Spanish, I am a native speaker as well. To get straight to the point: I can develop this using the "king" of scraping, Python. I would use the Selenium library depending on the feasibility of each site, though I always opt for the simplest, most lightweight method whenever possible. Regarding the methodology, I will personally analyze each site to identify the most efficient entry point for data extraction. As for experience, I have over 15 years developing data extraction solutions, including projects significantly more complex than this one. Timeline and Maintenance: - Timeline: It would take me between 7 to 15 days to complete. - Maintenance costs: It is best to calculate this once the solution is deployed, as it will depend on the specific extraction method used for each individual site. Feel free to message me so we can get started right away.
$30 USD en 7 jours
4,4
4,4

Hi there! I understand that scraping event data from 34 different venues with mixed website structures can be technically challenging. You need a reliable, scalable system that handles dynamic content and generates a clean weekly Google Sheet automatically. I have experience building Python-based web scraping systems using tools like BeautifulSoup, Selenium, and Playwright for dynamic JavaScript-heavy sites. I have worked with heterogeneous sources including custom websites, WordPress, and ticketing platforms, implementing structured data extraction, duplicate filtering logic, and standardized date/time formatting. I have also integrated Google Sheets API and deployed automation systems on Google Cloud VM with proper documentation and modular architecture. My approach will start with a feasibility audit of all 34 sources, selecting the most stable scraping method per site. I will implement structured extraction with standardized formats, duplicate removal logic, and automated Monday–Sunday sheet generation. The system will be modular, well-documented, scalable, and easy for you to manually control on GCP. I will also provide maintenance guidance and clear cost estimates for future updates if site structures change. Check our work https://www.freelancer.com/u/ayesha86664 Could you share whether any of the 34 venues already use structured event schema (like JSON-LD) that we can leverage? Let me know if you’re interested & we can discuss it. Best Regards, Ayesha
$30 USD en 1 jour
3,9
3,9

HELLO, HOPE YOU ARE DOING WELL! I understand you need an automated web scraping system to extract event information from 34 venues and bars detailed in the provided Excel file. My expertise in data extraction and automation aligns perfectly with your project requirements. My plan involves a thorough technical review of each data source to determine scraping feasibility, implementing solutions for dynamic content, and ensuring a smooth process for generating weekly event sheets in Google Sheets. I will prioritize scalability and provide comprehensive documentation to support the system's maintenance. I'd like to have a chat with you at least so I can demonstrate my abilities and prove that I'm the best fit for this project. Warm regards, Natan.
$15 USD en 1 jour
3,4
3,4

Automated Web Scraping System for Event Info I'm excited after reviewing your project details! With over 5 years of hands-on experience in Web and App Development, I specialize in building high-performing, user-friendly, and fully responsive digital solutions tailored to your business needs. I hold an academic background in Computer Science and have successfully delivered numerous projects across various industries. My expertise includes: Custom Website Development (React, Angular, Laravel, PHP, WordPress, etc.) Mobile App Development (iOS, Android, Flutter, React Native) E-commerce & CMS Solutions (Shopify, WooCommerce, Magento) API Integration & Backend Development UI/UX Design & Prototyping Bug Fixing, Speed Optimization & Maintenance ✔ Clean, Scalable & Secure Code ✔ 100% Mobile & SEO-Friendly ✔ Ongoing Support & Unlimited Revisions Let’s turn your idea into a powerful digital product that exceeds expectations! Check my profile: https://www.freelancer.com/u/QuickMentor Looking forward to working with you!
$20 USD en 1 jour
3,0
3,0

As a highly experienced and technically adept programmer, I'm confident in my ability to develop and execute an automation web scraping system that meets all of your project needs. My proficiency in Python, along with other relevant programming languages, makes me well-equipped to conduct the thorough technical analysis your project requires. I have a proven track record of successfully navigating heterogeneous websites - including proprietary sites, WordPress-based sources, and ticketing platforms - which will be crucial for your event information extraction process. Additionally, I've excelled at scraping dynamic content utilizing JavaScript across various sources. Moreover, my ability to create scalable, modular source codes will provide you not only with an exceptional solution to your current demands but also ensures future usability as your project evolves. Lastly and significantly, rest assured that I'll always deliver precisely-documented code along with clear system use & maintenance instructions to empower you in managing the system. With my background in complex scraping projects and proven ability to meet deadlines and exceed expectations; let’s ensure timely action while maintaining optimum data quality
$30 USD en 1 jour
2,4
2,4

From your project description, you need an automated web scraping system that extracts event data from 34 diverse venues and bars, handling dynamic JavaScript content and varying site structures, then compiles a weekly event sheet in Google Sheets using the Google Sheets API. You also require a feasibility analysis for each source and a solution hosted on a Google Cloud VM with manual control. I have over 15 years of experience completing 200+ projects, specializing in Python, API integration, web scraping, data processing, and automation. My background includes designing scalable, modular scraping systems that manage heterogeneous sources and dynamic content effectively, which aligns well with your needs. For your project, I will start by analyzing each venue’s website to determine scraping feasibility, focusing on dynamic content extraction via Python tools like Selenium or Playwright. The system will standardize date/time formats, remove duplicates, and automate weekly Google Sheets updates through the API. I’ll deploy the solution on a GCP VM configured for your manual on/off control, with clean documentation for easy future maintenance. The entire process can be delivered within two weeks given the scope. Feel free to reach out so we can discuss your project in more detail.
$11 USD en 7 jours
2,0
2,0

Hello, thanks for posting this project. I will design and implement an automated web scraping system to extract event data from your 34 sources and publish a weekly Google Sheets sheet. I will perform a feasibility assessment for each site, tackle dynamic content, and standardize outputs (yyyy-mm-dd, hh:mm). The solution will deduplicate by event name/date/venue, support multiple source formats (proprietary, WordPress, and ticketing platforms), and generate a Monday-to-Sunday sheet via Google Sheets API on a Google Cloud VM with a simple on/off control. The code will be modular, well-documented, and scalable, with clear maintenance guidance. Technologies include Python, requests/selenium for JS-driven sites, pandas for processing, Google Sheets API with OAuth, and a lightweight scheduler. I have delivered complex scraping pipelines for multi-site calendars and can adapt to the 34-source variance and the provided JM_clubes_scrappinglistxlsxclubs.csv. What is your preferred hosting region for the GCP VM and do you have an existing Google service account or credentials plan for Sheets access? Looking forward to hearing from you. Best regards,
$22 USD en 14 jours
1,1
1,1

I specialize in data extraction and web scraping to collect and structure information from any public website. ✅ Python & Scrapy: Build robust, scalable scrapers for large-scale data collection. ✅ Playwright & Selenium: Handle JavaScript-heavy sites with headless browser automation. ✅ Clean Output: Deliver structured Excel, CSV, or JSON files with de-duplicated data. ✅ Anti-Block Measures: Rotating proxies, user-agents, and respectful delays. Let's turn web data into actionable insights. Best, Usman Kokab Data Extraction & Web Scraping Specialist
$30 USD en 7 jours
0,6
0,6

I can build an automated scraping system for your 34 venues using .NetCore(Scrapy/Playwright) on GCP—handling dynamic/JS content, extracting event data (name, date, time, price, musicians, etc.), removing duplicates, and generating a weekly Google Sheet via API. Feasibility analysis, modular code, and full documentation included. Experienced in complex scraping. Ready to start immediately.
$30 USD en 5 jours
0,6
0,6

Leveraging my extensive experience in web development and design, I am confident that I can develop an exceptional automated web scraping system for your event information. Over the years, I have not only proficiently managed complex ERP applications but have also designed and developed multiple web applications integrated with runtime servers, ensuring fluid processes as required in this project. With deep understandings of numerous databases, programming languages, and frameworks, including MySQL, Python and Django respectively which would be vital in executing thorough technical analysis of each source and making optimal feasible scraping methods. One aspect I am particularly familiar with is managing heterogeneous websites, such as WordPress-based sites and ticketing platforms just as specified. Understanding the significance of flawless communication between data from different sources, my expertise would ensure a smooth integration of even proprietary websites yielding a unified weekly event sheet in Google Sheets under the ambitious deadline. Moreover, my familiarity with runtime environments like Docker and Jenkins would complement the use of Google Sheets API for interaction with spreadsheets—empowering efficient data manipulation from Monday to Sunday. Paying utmost attention to comprehensive documentations and modularity while designing systems make me your ideal candidate. A documentation that's crafted to summarize usage guidance and maintenance assistance will be at your disposal with the final product. Choosing me guarantees great technical dexterity throughout your current project phase—especially if you need seamless manual operation too!
$20 USD en 2 jours
0,6
0,6

Hi there, I understand you're looking for an automated web scraping system to extract event information from 34 venues. I have extensive experience in building scalable web scraping systems using Python and various frameworks. I will conduct a thorough technical evaluation of each source, implementing scraping techniques tailored for their structures and ensuring data integrity through standardized formats and duplicate removal mechanisms. The next step would involve confirming the list of venues and discussing any specific concerns you might have, after which I can provide a detailed timeline for implementation. What specific formats or structures do you prefer for the weekly event sheet design? Thanks, Muskan
$50 USD en 2 jours
0,0
0,0

Hello, With over 15 years of experience in web design and development, I possess the skills necessary to tackle your automated web scraping project. I am adept at Python automation and have a knack for executing complex web scraping tasks. I utilize Selenium, Beautiful Soup, and other robust web scraping tools to efficiently collect data from different websites. This proficiency enables me to conduct detailed technical analyses of all 34 sources specified in your Excel file and choose the most optimal method for each. My familiarity with heterogeneous platforms, including proprietary websites, WordPress-based sites, and ticketing platforms is unparalleled. This places me in an advantageous position to handle the diverse sources you have stated. Additionally, I am skilled at handling dynamic content and gaining access to JavaScript-generated data when required. I understand the need for a robust and scalable solution, hence my extensive experience with implementing modular and scalable systems as demanded by your project. So trust me; I will deliver a well-documented code that is scalable for future maintenance. In conclusion, my proven record with complex scraping projects coupled with my technological insights makes me the perfect fit for this task. Thanks!
$10 USD en 3 jours
0,0
0,0

Hi there, I am excited about the opportunity to develop an automated web scraping system for extracting event information from various venues and bars. My expertise in Python web scraping and data processing makes me confident in crafting a robust solution that meets your needs. With extensive experience in managing disparate scraping sources, including dynamic content extraction, I can effectively analyze the 34 venues to determine the best techniques for automation. I will ensure the data is captured in a standardized format and will implement a system for managing duplicates to keep your weekly event sheet organized. My approach includes leveraging the Google Sheets API for seamless integration and providing clear documentation to facilitate maintenance. I believe a collaborative approach will ensure the final product aligns perfectly with your requirements. Could we schedule a brief call to discuss specific functionalities?
$25 USD en 1 jour
0,0
0,0

Hello, I see you need an automated web scraping system to extract event information from 34 venues and bars. With over five years of experience in web scraping using Python, I have successfully developed similar systems that ensured seamless extraction from diverse site structures, including dynamic content. My approach will begin with a comprehensive feasibility analysis of each source to determine scraping methods. I’ll develop the modular scraping solution to handle elements like JavaScript-generated content, and ensure data is standardized for Google Sheets integration. As an enhancement, I suggest optimizing data cleaning processes to minimize duplicates effectively. Recently, I led a project that consolidated event data efficiently from various sources, achieving a similar goal. I estimate completing this project within 7 days. Regards, Khurshid Ahmed
$25 USD en 1 jour
0,0
0,0

Hi, We went through your project description and it seems like our team is a great fit for this job. We are an expert team which have many years of experience on Python, Data Processing, Web Scraping, Software Architecture, Data Mining, Data Extraction, Automation, Data Management Lets connect in chat so that We discuss further. Regards
$10 USD en 7 jours
0,0
0,0

United States
Méthode de paiement vérifiée
Membre depuis mars 3, 2026
€6-12 EUR / heure
₹1500-12500 INR
$30-250 USD
$30-250 USD
$15-25 USD / heure
$25-50 USD / heure
$30-250 CAD
$10-30 USD
$10-30 AUD
₹750-1250 INR / heure
$250-750 USD
₹12500-37500 INR
₹1250-2500 INR / heure
$25-50 USD / heure
$10-30 USD
$250-750 USD
$30-250 USD
₹750-1250 INR / heure
₹1500-12500 INR
₹750-1250 INR / heure