
In Progress
Posted
Paid on delivery
We are looking for an experienced web scraping and data extraction specialist to help us collect and enrich German B2B business data from large online business directories. This is an ongoing project with long-term work potential. Initial engagement will start with two websites, with additional directories to follow based on performance. Data Sources Examples: * [login to view URL] * [login to view URL] * Additional German business directories (30+ websites available) Each website may contain between 2–3 million business listings. Scope of Work Phase 1 – Directory Scraping Extract all available B2B business records from the target directory. Required fields must be delivered in our predefined Excel format. The output format must be followed exactly. If a field is unavailable, leave it blank. Do not remove columns or alter the structure. Phase 2 – Data Enrichment Records will be categorized into three groups: Category A Businesses that already have an email address available within the directory listing. Example: * Company Name * Website * Email Category B Businesses that have a website but no email address available. Requirements: * Visit the company’s website * Locate contact details * Extract and populate email addresses when available Category C Businesses that have neither a website nor an email address. Requirements: * Research the company online * Identify official website where possible * Locate contact information * Find relevant matching company information Deliverables The final deliverable must: * Follow the supplied Excel template exactly * Preserve all columns * Preserve column order * Leave unavailable fields blank * Contain clean and structured data * Be delivered in Excel or CSV format Technical Requirements Applicants should have experience with: * Large-scale web scraping * Data extraction * Proxy management * Anti-bot handling * Data cleaning * Data enrichment * Email extraction * High-volume data processing (millions of records) Important Please include the following in your proposal: 1. Your experience with large-scale scraping projects 2. Estimated processing capacity per day/week 3. Technologies and tools used 4. Expected turnaround time 5. Pricing model * Per million records * Per website * Or any alternative pricing structure you recommend 6. Examples of similar work completed Initial Volume * 2 German business directories * Approximately 4–6 million total records This may expand significantly if the collaboration is successful. We are looking for reliable long-term partners capable of handling large-scale data extraction and enrichment operations.
Project ID: 40490465
17 proposals
Remote project
Active 7 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
17 freelancers are bidding on average ₹25,941 INR for this job

Hello, I can help collect and organise large-scale business data from online directories with accuracy and attention to detail. My focus is on delivering clean, structured, and reliable data while following your required format and project guidelines. I have 5+ years of experience in data processing, web research, and database management, and I am confident in handling high-volume data extraction projects. Regards, Jignesh Shah
₹25,000 INR in 7 days
4.3
4.3

I have extensive experience in large-scale web scraping and data enrichment and can handle this project end to end across both Phase 1 directory extraction and Phase 2 email enrichment. I use Python with Scrapy and Playwright for dynamic site handling, residential proxy pools for anti-bot bypass, and structured pipelines for data cleaning and deduplication. I can process between 500,000 and 1 million records per day depending on site complexity and rate limits. For Phase 2 Category B I will visit company websites to extract contact emails, and for Category C I will research companies online to identify official websites and contact information. All output will follow your Excel template exactly with no structural changes. Similar work I have completed includes large-scale scraping of German and Austrian business directories with email enrichment across several hundred thousand records, and B2B contact extraction from Italian and French commercial databases. I can begin with the two initial directories covering approximately 4 to 6 million records and work on a per-million pricing model. My rate is negotiable. Please share your Excel template and I will confirm capacity and turnaround.
₹25,000 INR in 7 days
3.7
3.7

Hello dear, I can handle the large scale scraping and enrichment of German B2B business directories, including extracting millions of records and organizing them exactly according to your Excel template. I understand the need to scrape, categorize, enrich missing contact details, and maintain clean structured data. I will deliver complete business records with accurate data extraction, email discovery, website research, and proper formatting while preserving all columns and order. I can discuss processing capacity, turnaround time, tools, and a pricing model that best fits your long term project. Best Regards, Muhammad Saad.
₹12,500 INR in 7 days
3.4
3.4

Hi, having successfully delivered over 80 projects on this platform with a consistent 5-star rating, I can efficiently process your initial 4–6 million records using automated headless browsers and custom parsers to handle both the bulk directory extraction and the multi-tiered Category B and C deep-web email enrichment. I am ready to deliver perfectly cleaned, structured data that maps exactly to your predefined Excel template, and I will gladly share my processing capacity, tools, and custom per-million pricing structure in our chat.
₹17,000 INR in 15 days
2.3
2.3

Hi, Collecting clean, structured B2B data from German sources is the kind of project where the real difficulty isn't the scraping — it's handling inconsistent site structures, DSGVO-compliant contact fields, and enrichment that actually holds up under validation. For this, I'd build a Python pipeline using Playwright for JS-heavy pages and lxml for static ones, with Scrapy as the orchestration layer. Enrichment would run through a matching step against Handelsregister data and optionally Clearbit/Hunter for contact validation — each record flagged with a confidence score so you know exactly what's clean versus what needs review. Output goes to PostgreSQL with a deduplication pass before final export. First step: send me the target sources or a sample output schema and I'll return a scoped delivery plan within 24 hours — record count estimate, enrichment depth, and a realistic timeline. Or if the scope is still forming: what's the primary use case — outbound sales, market research, or something else? That determines which enrichment fields actually matter. Best regards, Val
₹12,500 INR in 7 days
1.8
1.8

With vast knowledge and experience in data collection, internet research, Excel, and Google Sheets, I am your reliable partner for this German B2B data scraping and enrichment project. I understand the essence of accuracy, consistency, and professionalism in these colossal tasks. Having completed similar projects before, I have honed my skills in collecting accurate business information, verified emails, and company data efficiently. Using my meticulously developed web scraping techniques, I can handle the vast scope of millions of records from various German business directories effectively. To ensure quality work and timely delivery, I use innovative tools and technologies like proxy management and anti-bot handling as necessary. My aim is to provide clean and structured datasets that exactly match your predefined format. In terms of pricing and turnaround time, I propose a fair model based on the nature of your project. This could be per million records or per website, depending on which works best for you. My estimated processing capacity is x records per day/week ensuring steady progress for the project. Given my skills and expertise, I assure you that choosing me as your long-term partner will be a decision you won't regret.
₹12,500 INR in 7 days
0.0
0.0

Hello, I can handle the German B2B directory scraping and enrichment with a clean, repeatable workflow. I will keep your Excel structure exactly as specified, extract available listing fields, then enrich records by checking company websites for contact details while leaving unavailable fields blank. For large directories, I will deliver in validated batches with duplicate checks, source URL tracking, and clear notes on fields found versus missing so your team can review quality early. I can start with the first two directories, confirm the output format on a pilot batch, then scale the collection safely for the remaining sources.
₹18,000 INR in 10 days
0.0
0.0

I am Deepak, a seasoned freelancer with over 15 years of experience in Website Design and Development, Research Writing, Cloud Computing, Data Scraping and more. As an expert in proxy management and anti-bot handling, I am ready to assist you in not only scraping data from the initial 2 German business directories but also processing the estimated 4-6 million records within a reasonable timeframe. My capacity for high-volume data processing, coupled with my familiarity with data enrichment techniques, would undoubtedly be of great benefit to your project. With regards to pricing structure, we can discuss a per million records approach or an alternative that aligns with both your requirements and budget. My availability can handle large projects 탄 web scraping international directories and their various technologies including anti-bot mechanisms wielded by sifes. Also integrated tools like MySQL for database synchronization are distinct advantage. Thanks, Deepak
₹25,000 INR in 7 days
0.0
0.0

Gurgaon, India
Payment method verified
Member since Jul 4, 2025
₹37500-75000 INR
₹600-1500 INR
₹1500-12500 INR
₹12500-37500 INR
₹12500-37500 INR
₹12500-37500 INR
₹600-1500 INR
₹750-1250 INR / hour
£20-250 GBP
₹12500-37500 INR
$30-250 USD
₹12500-37500 INR
$30-250 USD
$30-250 USD
₹1500-12500 INR
£20-250 GBP
min ₹600 INR
$10-30 USD
$15-25 USD / hour
₹600-1500 INR
₹600-1500 INR
$10-30 USD
$10-30 USD
₹12500-37500 INR
$10-30 CAD