
Fermé
Publié
PDF to Excel Data Scraper Needed Job Title: Data Scraper Needed: Convert 24 PDF Factsheets to Clean Excel (Mutual Fund Portfolios) Project Overview: I need a freelancer to extract detailed stock portfolio data from ~24 Mutual Fund Monthly Factsheets (PDFs). I will provide the URLs/Files. Your job is to extract the full stock holdings table for specific funds and deliver a consolidated, clean Excel/CSV file. The Goal: I need the complete list of stocks (100% of the portfolio), NOT just the Top 10. The data is used for financial backtesting, so accuracy is critical. Even top 85-90% data works. Scope of Work: Input: ~24 PDF Files (Monthly Factsheets). Target Funds: For each month, extract data for the Top 10 Equity Funds (e.g., Bluechip, Midcap, Smallcap, Value Discovery, etc. - list will be provided). Total Volume: Approx. 240 tables (24 months × 10 funds). Output Format: A single Consolidated Excel/CSV file with these columns: Date (Month-Year) Fund Name Stock Name (Full name as in PDF) Percentage (%) (Holding weight) Sector (If available) Requirements: Complete Data: If the sum of holdings es les than 85%, I will reject the work. Attention to Detail: The PDFs have complex multi-column layouts. Rows must not be merged incorrectly and weightage for each stock should be specified correctly . Clean Formatting: Numbers must be formatted as numeric values (not text) for analysis. Tools: AI / OCR / Scripts Allowed: You can use ChatGPT, Gemini, Python (pdfplumber), or any tool. I only care about the final accuracy. Budget & Timeline: Type: Fixed Price. Budget: 1000 Deadline: 6 Days ( Its very easy task, which can be completed within 3-4 hours) To Apply: Please confirm you understand that "Top 10 holdings" is NOT enough. Mention "100% Portfolio" in your proposal so I know you read this.
N° de projet : 40217243
87 propositions
Projet à distance
Actif à il y a 22 jours
Fixez votre budget et vos délais
Soyez payé pour votre travail
Surlignez votre proposition
Il est gratuit de s'inscrire et de faire des offres sur des travaux
87 freelances proposent en moyenne $19 USD/heure pour ce travail

I am a proficient data scraper with extensive experience in converting complex PDF data into structured Excel spreadsheets, suitable for financial analysis. My expertise lies in extracting detailed financial and stock data, ensuring complete and accurate results. With proficiency in tools such as Python's pdfplumber and advanced OCR technologies, I am confident in my ability to handle your requirements efficiently and deliver a consolidated file with all necessary details. Having successfully completed similar projects, I understand the importance of capturing 100% of portfolio data, especially for financial backtesting purposes. My approach ensures thorough attention to detail, maintaining accurate weightage information without errors in multi-column layouts. I utilize automated and manual validation techniques to guarantee data precision and clean formatting, crucial for downstream analysis. I am keen to discuss your project requirements further. Could you specify any additional formatting preferences for the Excel output? Please let me know if you want more information on my previous projects. Best regards.
$20 USD en 40 jours
7,9
7,9

As a seasoned Web-Scraping Specialist and a full-time freelancer, I am keen on employing my extensive experience in extracting data from websites, including those with advanced anti-bot protection for your project. Over the years, I have scraped thousands of complex and protected sites to deliver reliable, structured, and clean data—something that will be critically essential in extracting precise Mutual Fund Portfolio data for your financial backtesting purposes. My proficiency in using a wide range of tools like Selenium, BeautifulSoup, Scrapy, Requests, and others combined with my Python expertise can make this task an absolute breeze as well as can guarantee you 100% accurate data. I've also worked a great deal with PDFs using tools such as pdfplumber to convert them into various formats. Additionally, being detail-oriented and dedicated to delivering quality outputs has always been my top priority. Ensuring clean formatting is quite familiar to me—it includes properly not merging rows and correctly identifying the weightage of each stock—which are quite important instructions you've laid out in your project description. With such wealth of experience and adaptability to various tasks,I can confidently promise not only accurately extracted data information that meet the 85% minimum portfolio requirement, but that you will be thoroughly satisfied in every step of our collaboration. Let's talk more about how we can transform your data vision into a graspable reality!
$20 USD en 40 jours
7,6
7,6

⭐⭐⭐⭐⭐ Convert 24 PDF Factsheets to Clean Excel for Mutual Fund Data ❇️ Hi My Friend, I hope you are doing well. I just checked all of your project requirements and I can see you are looking for a data scraper to convert PDF factsheets to Excel. You have no need to look any further as Zohaib is here to help you! My team is already doing 50+ similar projects for data extraction. I will extract the full stock holdings for each fund accurately and deliver a clean Excel file that meets your needs. ➡️ Why Me? I can easily do your data scraping project as I have 5 years of experience in data extraction, PDF processing, and Excel data management. My expertise includes using tools like Python, pdfplumber, and OCR for accurate data retrieval. I also have a strong grip on data cleaning and formatting to ensure everything is easy to analyze. ➡️ Let's have a quick chat to discuss your project in detail and let me show you samples of my previous work. Looking forward to discussing with you in chat. ➡️ Skills & Experience: ✅ Data Extraction ✅ PDF Processing ✅ Excel Data Management ✅ Data Cleaning ✅ Python Programming ✅ OCR Tools ✅ Data Formatting ✅ Database Handling ✅ Attention to Detail ✅ Financial Data Analysis ✅ Scripting Automation ✅ Problem Solving Waiting for your response! Best Regards, Zohaib
$17 USD en 40 jours
7,6
7,6

Hi there, I’m excited about the opportunity to help you extract a complete list of stock holdings from your mutual fund factsheets. As a top freelancer based in California, I have a proven track record of delivering high-quality data extraction projects, earning 5-star reviews along the way. I understand the importance of accuracy in your work, especially since you need 100% of the portfolio data for your financial backtesting. I'll utilize advanced tools such as Python and OCR capabilities to ensure we capture every detail required, specifically focusing on the layout and formatting you mentioned, so you receive a clean and consolidated Excel/CSV file. Let's aim to get you this data within your timeline of 6 days to keep your project on track. Please feel free to message me right away with any additional instructions or specifics about the PDFs. Can you provide the URLs or files for the PDF factsheets as soon as possible?
$30 USD en 34 jours
4,8
4,8

I can extract 100% Portfolio data from all 24 Mutual Fund PDFs, capturing every stock holding with correct percentages and sectors, and deliver a single clean Excel/CSV file with Date, Fund Name, Stock Name, Percentage, and Sector, fully formatted for analysis within your 6-day timeline.
$15 USD en 40 jours
4,9
4,9

AM READY TO START: 100% PORTFOLIO EXTRACTION FROM 24 MUTUAL FUND FACTSHEETS TO EXCEL Hello, I am John K., a full-time data extraction specialist with over 15 years of experience. I specialize in accurately converting complex financial PDFs, like mutual fund factsheets with multi-column layouts, into clean, analysis-ready datasets using advanced Python libraries (pdfplumber, Camelot) and meticulous verification to ensure 100% data integrity. My understanding is: You need the 100% Portfolio holdings data (not just Top 10) from approximately 24 monthly factsheets for 10 target funds each, extracted into a single Excel file with Date, Fund Name, Stock Name, Percentage, and Sector, ensuring the sum of holdings exceeds 85% for acceptance. I will deliver: • ✅ A single, consolidated Excel/CSV file with all ~240 tables accurately extracted. • ✅ Complete 100% portfolio data for each fund, with numeric formatting and verified sums. • ✅ The final dataset ready for financial backtesting, delivered within the 6-day deadline. I confirm I will extract the 100% Portfolio holdings, not just the Top 10. I am ready for this precise task. Let's connect via chat to transfer the PDFs and fund list—I can begin the extraction immediately. Respectfully, John K.
$15 USD en 40 jours
4,4
4,4

With comprehensive programming skills that range from Python (pdfplumber) to Laravel, I am more than competent to tackle this data scraping and migration project for you. Throughout my extensive 12-year career, I've amassed a wealth of experience in tasks just like these. The sophistication of the PDF layouts in question poses no problem for me thanks to my attention to detail and meticulous work ethic- expect no disproportionate merges and perfectly specified stock weights. I understand that you require full stock holdings data rather than a mere top ten list, as accurate data is vital for financial backtesting applications. My prior encounters with similar undertakings combined with my analytical mindset make me well-acquainted with the rigors of your task. Rest assured, I will deliver exactly what you need: clean Excel/CSV files with columns marked for Date, Fund Name, Stock Name, Percentage, and Sector as available - all properly formatted for easy numeric analysis.
$15 USD en 40 jours
4,4
4,4

Hello, Having worked extensively with data processing and entry for over eight years, I bring a level of expertise to the table that is perfect for your PDF-to-Excel project. My proficiency in Python (including pdfplumber), Excel and other relevant AI tools enables me to accurately extract, consolidate, and format your mutual fund portfolio data in a clean and organized manner - reflecting the 100% holdings you require. Attention to detail is paramount in this project considering the complex layouts of the PDFs and precision demanded for accurate stock weightage information. My track record of delivering production-grade web platforms, including L2 rollups, that prioritize accuracy speaks to my commitment to detail-oriented work. Furthermore, I can assure you complete data retrieval (>85%) and numerical formatting too. Given your tight deadline, my extensive experience in managing time-bound projects aligns well with your needs. My top priority will be to deliver your consolidated Excel/CSV files well within the timeframe provided without any compromises on quality. Let's leverage my skills in data entry and data processing alongside my eye for detail to guarantee accurate financial backtesting data that will prove invaluable for your evaluations. Thanks!
$50 USD en 1 jour
3,9
3,9

Hi, I understand that “Top 10 holdings” is NOT enough—you need “100% Portfolio” coverage (≥85% aggregate weight per fund/month or the work is rejected). Please send the samples so I confirm I can extract all stock holdings from ~24 monthly PDF factsheets and deliver a single, clean Excel/CSV ready for analysis. Approach Parse PDFs with a robust pipeline (pdfplumber/tabula + checks) to handle multi‑column tables, wrapped text, and page breaks. Normalize columns: Date (MMM‑YYYY), Fund Name, Stock Name (as in PDF), Percentage (numeric), Sector (if available). Validate totals per fund/month (flag if <85%), enforce numeric types, de‑dupe, and preserve exact naming. Deliverables One consolidated .xlsx/.csv. A QA log listing each fund/month with total % sum and any notes (e.g., “sector missing in source”). Optional small data dictionary. To get started (and ensure accuracy), please share: 2–3 sample PDFs (different months + different funds). The exact fund list per month. Any quirks (e.g., annex pages with full holdings, sector on separate table). Preference: Excel or CSV (I can provide both). Ready to begin as soon as you send the samples.
$15 USD en 20 jours
4,0
4,0

Hello, I understand the requirement clearly and confirm that Top 10 holdings are NOT enough, I will extract the 100% Portfolio (or minimum 85%+ as specified) for each fund. I have experience extracting complex multi-column tables from PDFs into clean, analysis-ready Excel/CSV files with correct numeric formatting. I will ensure: Accurate stock names and weightages No merged or misplaced rows Proper consolidation across all months and funds I can complete this well within the deadline with high accuracy. Looking forward to working with you. Best regards, Sidra
$20 USD en 40 jours
3,6
3,6

Hi, We went through your project description and it seems like our team is a great fit for this job. We are an expert team which have many years of experience on Data Processing, Data Entry, Excel, Web Scraping, PDF, Data Scraping, Data Extraction, Data Analysis Lets connect in chat so that We discuss further. Regards
$19 USD en 40 jours
3,4
3,4

I understand this project requires extracting the FULL holdings data, not just the Top 10, from ~24 monthly mutual fund factsheets. I will deliver the 100% Portfolio (minimum 85%+ coverage per fund), accurately structured for financial backtesting. I have experience extracting complex multi-column tables from PDFs and handling financial datasets where accuracy is critical. I will ensure correct stock names, exact holding percentages, clean numeric formatting, and a fully consolidated Excel/CSV file. I can use Python (pdfplumber), OCR, and manual validation where needed to guarantee precision. If required, I’m happy to provide a small sample from one PDF before final delivery. Keyword confirmation: 100% Portfolio
$16 USD en 20 jours
3,5
3,5

★★★ TOP 1% IN FREE LANCER WORLD ★★★ ★★★ 20+ Year Experience in IBD being CMD★★★ ★★★ 200+ Country Satisfied Clientele ★★★ ★Linkedin★ ★Data Entry★ ★Business Plans★★★ ★★★ Operational Strategic planner Customer Support 24*7★★★ ★★★Excel/Word Operation★★★ ★★★Chat Support★★★ ★★★Calling Support★★★ ★★★Business Plans / Marketing Strategy ★★★ * Digital Marketing★★★ ★★★Social Media Marketing ★★★ ★★★Internet Marketing ★★★ ★★★Any type of Data Projects★★★ ★★★★★★ Regards, ★★★CMD★★★ ★★★PVSYS GROUP (INDIA)★★★ ★★★IF YOU THINK THEN I CAN★★★
$15 USD en 40 jours
3,4
3,4

Hello, I'm excited about the opportunity to help you convert the 24 Mutual Fund Factsheets into a clean Excel file. I understand that you need a comprehensive extraction of the entire stock portfolio, ensuring accuracy by capturing 100% of the holdings. With over 10 years of experience in data extraction and analysis, I'm well-equipped to handle complex PDF layouts and deliver exact results. I have a solid background in using tools like Python and OCR for effective data scraping. My attention to detail ensures that each stock's weightage and sector information will be accurately represented in your final output. I’m ready to start immediately and commit to delivering the consolidated Excel file within your deadline. Could you please clarify if there are any specific formatting styles you prefer for the Excel sheet?
$25 USD en 1 jour
3,1
3,1

Thank you for considering my proposal for the PDF Data Scrape to Excel project. I understand your need to extract detailed stock portfolio data from 24 Mutual Fund Monthly Factsheets accurately for financial backtesting purposes. With my expertise in data processing, extraction, Excel, and data analysis, I ensure precise results. I will deliver a consolidated Excel/CSV file with 100% complete stock portfolio listings, meeting your accuracy requirement. You can count on my attention to detail, and I am familiar with using tools like Python (pdfplumber) to ensure efficient extraction. Entrust your project to me for thorough and top-notch results. Looking forward to the opportunity to work together. Regards, Jason McLachlan
$20 USD en 3 jours
3,2
3,2

As an experienced data scientist and engineer with a deep-rooted commitment to precision, I am uniquely positioned to complete this scraping endeavor with utmost accuracy and efficiency. My proficiency in Python, including expertise with relevant libraries like pdfplumber, is an asset that I believe will elevate the quality of your project. My track record demonstrates my ability to handle complex data sets (such as multi-column layouts) without compromising on clarity; this aligns perfectly with your specific needs for clean formatting. Moreover, my knowledge of OCR tools such as Gemini ensures that AI or scripts are utilized maximally, guaranteeing the best possible outcome for your project. Capitalizing off my meticulous nature, I guarantee that the numbers in your final Excel/CSV files will be formatted as numeric values rather than text; thus they'll be readily usable for any analysis you may require. With me by your side, you'll meet both your budget and deadline expectations without any compromise on quality; you can expect substantial progress within just a few hours of kickstarting this project. Let's bring precision to your financial backtesting process—I'm here to help!
$20 USD en 40 jours
2,9
2,9

Hello, I’ve reviewed your project on converting 24 PDF mutual fund factsheets into a single, clean Excel/CSV file with 100% portfolio data. I’m confident I can deliver accurate, source-verified holdings for all 10 funds per month, with precise weights and full stock names, formatted for reliable backtesting. What I bring: a data-focused approach using Python (pdfplumber or OCR) to navigate complex multi-column PDFs, robust checks to ensure 100% holdings (not just the top 10), and clean numeric formatting suitable for analysis. I will consolidate 24 months × 10 funds into one dataset, with fields: Date, Fund Name, Stock Name, Percentage, Sector (if available). What you’ll get: an end-to-end workflow that preserves data integrity, rigorous sanity checks (sum of holdings across funds), and a single consolidated file that’s ready for analysis. I can start immediately and complete within six days as agreed. I understand you require 100% Portfolio data (not just the Top 10). This aligns with my approach and quality controls. What is the preferred file naming convention and the exact column mapping you want beyond the listed fields (e.g., handling of missing sectors, exact formatting for percentages)? Best regards, RICHARD
$50 USD en 37 jours
2,6
2,6

Hi , I don’t use auto-bidding because I value genuine, personalized communication. I’d love the chance to connect and discuss your project, "PDF Data Scrape to Excel - Mutual Funds - 08/02/2026 02:52 EST," in more detail. Based on your description, I’m confident my experience and creative approach align well with your goals. I’m Altaf Rattani, a U.S.-based Technology and Creative Consultant helping clients bring their ideas to life through thoughtful design and smart digital solutions. Whether it’s branding, UI/UX, web design, or development, I focus on creating results-driven, visually engaging, and user-friendly outcomes tailored to each project’s unique vision. Here’s what you can expect when working with me: Custom concepts and clear communication throughout Revisions until you’re completely satisfied Final deliverables in all required professional formats 100% original work with full ownership rights My portfolio: https://www.freelancer.com/u/altafr99 Thanks for considering my proposal. I’d love to connect and discuss your project in more detail! Regards, Altaf Rattani
$15 USD en 17 jours
2,0
2,0

Hi there. I can extract the 100% Portfolio holdings from all 24 monthly factsheets into one clean, analysis-ready Excel/CSV. Please skim the full bid below because it explains how I’ll keep accuracy high and avoid the usual PDF table issues. I understand this is NOT “Top 10 holdings” فقط. You need the full holdings table per fund per month (about 240 tables total), and you’ll reject anything under ~85% coverage. How I’ll do it Use Python (pdfplumber/tabula/camelot as needed) plus light OCR only when a page is scanned Parse each fund’s holdings table into a standard schema: Date, Fund Name, Stock Name, % Weight, Sector (if shown) Normalize numbers to true numeric cells (no text percent issues), keep names exactly as shown in PDF Handle multi-column layouts carefully (no merged rows, no shifted weights) Quality checks Per fund-month: verify % totals and flag any missing rows Quick spot-check against the PDF for each file, especially where layouts change Deliver a consolidated Excel + CSV, plus separate “raw extracts” if you want auditability If you share the PDFs/URLs + the exact list of the Top 10 equity funds to pull each month, I can start right away. — Dax Manning
$25 USD en 40 jours
2,0
2,0

Hello, I’ve reviewed your requirements carefully and fully understand that Top 10 holdings are NOT enough. This project requires extracting the complete / 100% Portfolio (minimum 85%+) from each mutual fund factsheet, and accuracy is critical. ▶What I’ll Deliver ●Full stock holdings tables from ~24 PDF factsheets ●Clean, consolidated Excel/CSV with: ●Date (Month-Year) ●Fund Name ●Stock Name ●Holding Percentage (%) ●Sector (if available) ●Numeric formatting ready for financial backtesting ●Verified totals to ensure portfolio coverage ≥ 85% ▶Why Choose Me ●Strong experience handling complex, multi-column financial PDFs ●No merged rows, no missing holdings, no formatting errors ●Comfortable using powe query, OCR, and manual validation to ensure accuracy I confirm again that this task requires 100% Portfolio data, not just top holdings. I can complete this efficiently and within your deadline. Best regards, Muzammal Pasha
$15 USD en 40 jours
2,0
2,0

Sri Jayawardenepura Kotte, Sri Lanka
Membre depuis juil. 26, 2025
$15-25 USD / heure
₹1500-12500 INR
₹1500-12500 INR
₹12500-37500 INR
€30-250 EUR
₹600-1500 INR
$1500-3000 USD
$3000-5000 USD
$15-25 USD / heure
₹1250-2500 INR / heure
$250-750 USD
$2-8 USD / heure
$8-15 USD / heure
$30-250 USD
$30-250 USD
$750-1500 USD
₹1500-12500 INR
₹1500-12500 INR
₹750-1250 INR / heure
₹37500-75000 INR
₹1500-12500 INR