
Closed
Posted
Paid on delivery
I hope this message finds you well. We have an urgent requirement to source real enterprise-grade legacy codebases for internal evaluation and benchmarking purposes. We request your support in identifying and sharing repositories that strictly meet the criteria outlined below. 1. Minimum Eligibility (Mandatory) Repositories must meet all of the following: Minimum 100,000+ Lines of Code (LOC) At least 100+ Pull Requests (PRs) with meaningful discussions Minimum 50+ Issues, including several with detailed problem descriptions 200+ commits distributed over time (no bulk or single-day commits) Real, human-written production code (no AI-generated or synthetic projects) Originating from a real, verifiable company Must have legal rights available to share or transfer 2. Critical Requirement: PR Quality (Must Have) Each Pull Request should: Be linked to a specific issue Address a clearly defined problem Include both code changes and corresponding test updates Be reasonably scoped (neither too large nor trivial) Highly Preferred: PRs demonstrating Fail → Pass (F2P) behavior (i.e., tests fail before the fix and pass after implementation) ⚠️ Note: Repositories where PRs contain only code changes without test coverage will not be considered. 3. Preferred Technology Stack C# Java Python PHP .NET Framework COBOL Other legacy enterprise technologies 4. Preferred Industry Domains (High Priority) Banking / Financial Services Accounting Insurance Healthcare Legal Technology Government Systems Enterprise SaaS (complex workflow-driven platforms) Note: Ecommerce, retail, content platforms, and frontend-heavy applications are not within scope. 5. Technical Readiness (Very Important) Repositories should: Build and run successfully Include a Dockerfile (preferred) or clear setup instructions Have proper dependency management Follow a clean and structured project layout Contain test suites (preferably 50+ test files) Maintain clear PR-to-issue linkage Ensure each PR ideally resolves one issue 6. Required Metadata (Exact Figures) For each repository submitted, please provide: Company name, industry, and country Primary programming language(s) Exact Lines of Code (LOC) Number of files Number of commits Number of Pull Requests Number of Issues Number of contributors Repository age (years active) 7. Additional Notes Strong preference will be given to repositories with robust PR and test linkage Well-structured development history is critical Low-quality, bulk-imported, or poorly maintained repositories will be rejected We are specifically looking for high-quality engineering datasets, not just large codebases. Your careful validation before submission will be highly appreciated. Please treat this request as high priority and share suitable options at the earliest. If you have any questions or need clarification, feel free to reach out. Thank you for your support. Warm regards,
Project ID: 40406747
46 proposals
Remote project
Active 13 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
46 freelancers are bidding on average ₹443,798 INR for this job

Hi, sourcing repositories like this isn’t just about size most large codebases fail on PR quality, test linkage, or clean history when you actually audit them. I can help identify and validate enterprise-grade repositories that meet your strict criteria (LOC, PR→issue linkage, test coverage, commit quality, and legal usability), rather than just sending bulk GitHub links. I’ll focus on: - Verified company-backed repos (banking, SaaS, gov, etc.) - Strong PR discussions + test-backed changes (incl. F2P where possible) - Clean build setup (Docker / reproducible environments) - Complete metadata with exact metrics as requested For similar research/technical work: https://www.freelancer.com/u/Microlent Happy to share a first curated batch quickly for your review. ~ Rajesh
₹375,000 INR in 7 days
9.4
9.4

Hello! As per your project post, you’re looking to source Enterprise Grade Legacy Codebases that meet strict benchmarks for size, contribution quality, and real world usage, specifically for internal evaluation and benchmarking. The goal is to identify high quality, verifiable repositories with strong PR discipline, meaningful issue tracking, and production ready code that reflects real development practices. My focus will be on delivering a curated set of qualified repositories, featuring: projects exceeding 100K+ lines of code with structured commit history, repositories with 100+ meaningful pull requests linked to well defined issues, inclusion of test coverage and PR validation workflows, identification of F2P style contributions where applicable, verification of real company origin, and confirmation of legal shareability or licensing compliance. I specialize in software architecture review and repository analysis, with experience in evaluating large scale codebases across different tech stacks. My focus will be on identifying repositories that not only meet your numeric criteria but also demonstrate clean engineering workflows, maintainability, and real world complexity. Let’s connect to review your preferred tech stacks, domains, and usage goals so we can finalize a targeted shortlist and evaluation framework. Best regards, Nikita Gupta.
₹250,000 INR in 45 days
6.7
6.7

Hello I will be able to help you. Please message me so that we will have detail technical discussion. I have 9+ years of combined experience in Mobile Application development, Website development, Desktop application development, 3rd party Artificial Intelligence api, AR/ VR, Chatbot, Blockchain- Cryptocurrency, CRM & ERP, Game Development and any other Software development. I am having expertise in Native on Android Java, kotlin and IOS Swift, and For Hybrid Cross platform on Flutter Dart & React- Native, and for web and backend on react js and node js, Python Django. Please consider me and initiate a chat for further detailed discussion. Regards, Anju Logical Soft Tech Pvt Ltd, Indore(M.P)
₹250,000 INR in 45 days
6.4
6.4

I have worked with clients who needed to benchmark legacy enterprise applications, and I understand how critical quality and clarity in repo history are for this kind of evaluation. To meet your criteria, I can help source repositories from verified companies in Banking and Financial Services that have clean issue-to-PR linkage, detailed commit histories spread over time, and include tests reflecting fail-then-pass patterns. I can also verify the build processes with Docker or setup instructions to ensure these repositories run without hassle. A couple of quick clarifications: Are you open to repos using older versions of .NET Framework or Java if they meet all other requirements? Also, would you prefer repos hosted publicly or are private repositories accessible with your legal approval acceptable? Once I have your preferences, I can prioritize quick and thorough validation of candidate repos and provide the exact metadata you requested. I’m ready to start sourcing and vetting immediately.
₹375,000 INR in 7 days
5.9
5.9

With our multidisciplinary expertise rooted in full-stack development, AI-powered systems, and deep knowledge of Java and Node.js, our team at MHTechFusion not only has the technical prowess but also the industry experience to find repositories that fulfill every single criterion you listed. We have an expansive understanding of legacy enterprise technologies like C#, Python, PHP, .NET Framework, COBOL, and Java; making the search for you a breeze. Moreover, we understand the value of quality PRs in maintaining robust systems. Our team is well-versed in producing clean code with meaningful test coverage, properly resolving distinct issues. Your mission of finding high-quality engineering datasets resonates strongly with us. Over the years, we've delivered on similar assignments demanding precision and detail; ensuring your compliance needs are met impeccably. Lastly, our strong command over backend development and DevOps tooling (AWS, Docker) ensures that not only are the repositories we find equipped with proper test suites but also follow structured project layouts with a complete setup so that they can be reliably and swiftly integrated into your evaluation processes. Looking forward to assisting you effectively and efficiently at MHTechFusion!
₹500,000 INR in 60 days
6.2
6.2

As a seasoned developer with a rich experience of over 9+ years in web and mobile development, I bring to the table a refined understanding of the technologies that are critical for your project, namely C#, Java, Python, PHP, .NET Framework, and COBOL. My expertise extends to other legacy enterprise technologies as well. Furthermore, my proficiency in clean coding practices ensures that every line of code I contribute is highly organized and thoroughly documented, leading to an easy-to-navigate repository structure. At Neha Developers, we prioritize quality above all else – which resonates perfectly with your requirements. As professionals who constantly seek out opportunities to improve our craft, your initiative to collect real enterprise-grade legacy codebases for evaluation and benchmarking deeply appeals to us. We believe sound engineering datasets are essential for meaningful insights – an outlook that aligns with yours. Drawing on my extensive expertise in building large-scale projects and my familiarity with Dockerfile and dependency management, I assure you that any repositori
₹375,000 INR in 7 days
5.4
5.4

Good to see this project, I will source enterprise codebases meeting the full bar: 100k+ LOC, 100+ PRs with issue linkage, 50+ detailed issues, 200+ commits over time, real test suites, Dockerized builds, and complete metadata. Stallyons holds legal rights on internal enterprise SaaS work in healthcare workflow and finance back-office that fits this profile. One angle: F2P (fail-to-pass) PRs and tight PR-to-issue linkage are the hardest filters here. Most repos clear LOC and commits easily but fail on test discipline. I will pre-validate every candidate against the full schema before submission. Questions: 1) Target volume, timeline, and is payment per accepted repo or pooled in the posted range? 2) Acceptable to send one vetted sample first for sign-off before bulk? 3) For healthcare/banking, is anonymized code acceptable, or strictly originals with full ownership? Looking forward to your response. Best regards, Faizan
₹300,000 INR in 7 days
5.3
5.3

Hello! I’m ready to start immediately and can handle this project efficiently for you with full attention. You can review my portfolio here: https://www.freelancer.in/u/NareshJoshiTech I’d really appreciate the chance to discuss your project in detail and explore how we can create something great together. Looking forward to hearing from you. Warm regards, Naresh Joshi
₹375,000 INR in 7 days
3.6
3.6

Hi, I’m Karthik from Resonite Technologies with 15+ yrs in enterprise systems & legacy codebases. Your requirement needs **curated, high-quality sourcing**—not generic repos. Most public projects fail PR→Issue→Test linkage and F2P standards, so we handle strict validation. **Approach** • Source from verified enterprise/open-core + partner networks • Filter: 100K+ LOC, 100+ PRs, 50+ issues, 200+ commits • Deep audit: PR↔Issue linkage, test coverage, F2P patterns • Validate: Build/run (Docker), clean structure, dependencies • Focus domains: Banking, Healthcare, Insurance, Gov, SaaS **Deliverables** ✔ Vetted repositories with full metadata (LOC, PRs, issues, commits, contributors) ✔ PR quality + test linkage report ✔ Setup/build verification notes ✔ Legal clarity on sharing rights I can share 2–3 validated samples quickly for your review. Best regards, Karthik Resonite Technologies
₹555,550 INR in 7 days
3.7
3.7

I understand your urgent requirement for sourcing and evaluating real enterprise-grade legacy codebases. With my experience in identifying and analyzing complex systems, I can help you find suitable repositories that meet the specified criteria. I'll leverage my knowledge of open-source platforms and networks to search for codebases that fit your description. I'll approach this project by utilizing my expertise in code analysis, searching through reputable platforms like GitHub, GitLab, and Bitbucket. I'll focus on finding repositories that have undergone significant development, with a high number of commits, pull requests, and meaningful discussions. In the past, I've worked with large-scale systems, including ERP, CRM, and SaaS platforms, which require strict adherence to coding standards and best practices. My experience in this area will enable me to identify codebases that meet the minimum eligibility criteria. Key features of the codebases I'll identify include a minimum of 100,000+ Lines of Code, at least 100+ Pull Requests with meaningful discussions, and a minimum of 50+ Issues with detailed problem descriptions. I'll also ensure that the codebases are human-written, originated from a real company, and have legal rights available for sharing or transfer. I'll also consider any additional requirements or edge cases that you may want me to focus on during my search. I'm confident that I can deliver a comprehensive list of suitable codebases that meet your requirements. Is there a specific format or structure you'd like me to use for presenting the codebases, or any particular details you'd like me to focus on during my analysis? I can deliver this in 5 days.
₹357,370 INR in 5 days
3.3
3.3

Hello! Based on your project description, you are looking to source and evaluate real enterprise grade legacy codebases for internal benchmarking and engineering analysis. The focus is on highly structured, production level repositories with strong development history, meaningful pull request workflows, test coverage, and verifiable real world usage from established companies across regulated or enterprise heavy domains. I will focus on assisting you in identifying, filtering, and structuring candidate repositories that meet strict engineering quality standards including large scale codebases with substantial commit history, PR to issue traceability, test driven development patterns, and proper CI/CD readiness. I will also ensure any suggested datasets or repositories align with your required enterprise domains such as banking, healthcare, insurance, and government systems, while excluding non relevant ecommerce or frontend heavy projects. I specialize in full stack enterprise systems and code architecture analysis with 7+ years experience and I have done similar work in past please open the chat window so I can share with you. Please contact us to discuss your evaluation framework, preferred data sources (GitHub Enterprise, internal repos, or open source forks), and validation depth so we can define the most accurate sourcing strategy. Best regards, Nikita Gupta.
₹250,000 INR in 46 days
3.2
3.2

This looks straightforward at first, but sourcing true enterprise-grade legacy repositories with clean PR-to-issue-test linkage is where most attempts fail—especially when filtering out synthetic, low-quality, or poorly maintained codebases. I’ve handled similar research tasks where strict validation criteria like yours required careful screening, not just scraping large repos. I can help you identify and verify high-quality, legally shareable repositories that meet your exact benchmarks—focusing on real production systems with structured development history, meaningful PR discussions, and solid test coverage. Each submission will be validated against your checklist (LOC, PR quality, issue linkage, contributors, build readiness, etc.) and delivered with complete metadata in a structured format. The approach will prioritize enterprise domains (finance, healthcare, SaaS) and mature stacks like Java, .NET, and Python—ensuring the dataset is actually useful for benchmarking, not just large in size. Let's connect to start working.
₹375,000 INR in 7 days
3.0
3.0

After carefully reading your job description, I’m confident we can support this high-priority engineering dataset sourcing requirement with a structured and professional approach. With 13+ years of experience, I will personally lead this project with my expert research and engineering team. We understand you need real enterprise-grade legacy repositories with strict filters: 100k+ LOC, strong PR-to-issue linkage, meaningful commit history, production-quality code, tests, legal shareability, and verified company origin. Our expertise includes GitHub/GitLab research, codebase auditing, software due diligence, metadata extraction, legacy stacks (C#, Java, Python, PHP, .NET, COBOL). We will deliver fully validated repository options with exact metrics, quality screening, and benchmarking-ready documentation. Can we connect?
₹375,000 INR in 7 days
2.6
2.6

We understand that your requirement goes beyond simply collecting large repositories you need carefully vetted, enterprise-grade legacy codebases that reflect real world engineering practices and can be reliably used for benchmarking and evaluation. Our approach is to identify and validate repositories that strictly meet your criteria, including 100K+ lines of code, meaningful pull requests linked to well defined issues, strong test coverage, and a consistent, distributed commit history. We will prioritize codebases from verifiable companies operating in high-value domains such as banking, insurance, healthcare, and enterprise SaaS, ensuring the data represents authentic production environments. From a technical perspective, we will focus on repositories built with technologies like Javascript, Python, and COBOL, while ensuring they are buildable, well structured, and include proper dependency management and test suites. Each repository will be audited for PR quality, issue linkage, and development consistency, and delivered with complete metadata including LOC, commits, PRs, issues, contributors, and project age. Our goal is to provide a curated, high quality dataset that meets your strict standards, reduces your validation effort, and delivers meaningful insights for your internal evaluation process.
₹375,000 INR in 7 days
2.6
2.6

Hello dear Client, I design and build focused software engineering and data systems with clear architecture, scalable pipelines and maintainable workflows. For your requirement, I’ll support identifying and structuring enterprise-grade legacy codebases for evaluation and benchmarking, ensuring they meet strict quality, traceability and engineering-history criteria. Let’s discuss in chat as I have some queries to ask regarding the project to proceed further.
₹250,000 INR in 7 days
2.0
2.0

Hi, This is a highly specific requirement focused on sourcing high-quality enterprise codebases with strong PR-to-test linkage. I can help identify and validate repositories that meet your strict criteria. Approach: * Filter repositories from verified companies (GitHub Enterprise/open-source orgs) * Validate LOC (100k+), commits, PRs, and issues with proper distribution * Check PR quality: issue linkage, scoped changes, and test coverage (including F2P patterns where available) * Ensure build readiness (Docker/setup, dependencies, test suites) * Exclude low-quality, bulk-imported, or synthetic projects Target domains: Banking, healthcare, government, enterprise SaaS, and other workflow-heavy systems aligned with your scope. Deliverables: * Curated list of qualified repositories * Full metadata for each (LOC, commits, PRs, issues, contributors, age, etc.) * Verification of legal usability and source authenticity Timeline: 3–5 days for initial shortlist (3–5 high-quality repos) Quick question: Do you require only publicly available repositories, or are private/licensable datasets also acceptable? I can ensure strict validation and provide only high-quality, usable datasets.
₹375,000 INR in 7 days
0.8
0.8

Hi, This is a highly specific and quality-driven requirement, and I understand that you’re not just looking for large repositories, but well-structured, enterprise-grade codebases with meaningful engineering history, strong PR-to-issue linkage, and test-backed development. I can assist in sourcing and validating such repositories based on your defined criteria. My approach will focus on identifying real-world enterprise projects (from verified organizations), then carefully filtering them against your requirements including LOC, PR quality, issue depth, commit distribution, and test coverage. I will ensure that each shortlisted repository includes complete and accurate metadata such as LOC, commits, PRs, issues, contributors, and project maturity. Special attention will be given to PR quality—ensuring proper issue linkage, scoped changes, and presence of test updates, including cases with fail-to-pass behavior where available. I will also verify technical readiness, including build/run capability, dependency management, and availability of setup instructions or containerization support. Given the depth of validation required, I will prioritize quality over volume and share only well-qualified repositories that meet your expectations. Let’s connect to align on timelines and any specific domain priorities so I can begin immediately. Best regards, Siddharth Agarwal
₹250,000 INR in 30 days
0.7
0.7

When it comes to the field of backend development, I'm confident to say that I have honed my skills over the last 3+ years. My specializations are in Python (mainly Django/DRF), PostgreSQL, and working with high-performance production environments. This makes me highly capable in handling and validating your requested/engineered datasets. I’ve had extensive experience building scalable API-driven SaaS platforms with a strong focus on reliability. Clean code, predictable output, and stable performance under load are my priorities. I've also got my hands dirty with Dockerized deployments, secure authentication, Stripe integration among others. Most importantly, I confess a flair for end-to-end API architecture; from reliably processing JWT/OAuth auth flows, to performing async processing and implementing CI/CD hence I firmly believe I can provide the suitable tech stacks you require (C#, Java,.NET). And don't let me forget to mention my knack for quickly acclimating myself to unfamiliar codebases like the pro that I am. To wrap up, while most developers are comfortable within their specific realms, I take pride in being able to broaden mine. Whether it be strengthening what I know or picking something entirely new, such as COBOL or other legacy enterprise technologies you may require. Let's not wait any longer! Let me dive right into your project, navigate through legacy codes and identify repositories that surpass your requirements - all readily available for transfer.
₹375,000 INR in 28 days
0.0
0.0

Hi, I understand you’re looking for high-quality, enterprise-grade legacy codebases with strict criteria around PR quality, issue linkage, and test coverage—not just large repos, but clean engineering datasets. I have experience working with large-scale repositories and can systematically source, validate, and deliver curated codebases that meet your exact requirements. My approach: • Use GitHub API + advanced filtering to shortlist candidates (LOC, commits, PRs, issues) • Manually validate PR → Issue linkage, test coverage, and discussion quality • Prioritize enterprise-backed repositories (banking, healthcare, SaaS, etc.) • Ensure repositories are buildable, well-structured, and production-grade • Exclude low-quality, synthetic, or poorly maintained projects What you’ll receive: ✔ Fully vetted repositories meeting your criteria ✔ Complete metadata (LOC, commits, PRs, issues, contributors, age) ✔ Notes on PR quality, test linkage, and architecture ✔ Setup/build verification (Docker or instructions) I can deliver an initial shortlist quickly and refine based on your feedback. Ready to start immediately and treat this as high priority. Let’s connect to align on any edge criteria before I begin. Best regards, Arun
₹375,000 INR in 20 days
0.0
0.0

, Resonite Technologies can support this urgent requirement by sourcing and validating enterprise-grade legacy repositories that match your benchmarking criteria. We have a proven team experienced in legacy enterprise systems, codebase auditing, Git history analysis, PR/issue validation, test coverage review, and technical due diligence across C#, Java, Python, PHP, .NET Framework and older enterprise stacks. Our approach: ✔ Identify only real, verifiable company-origin repositories ✔ Validate 100K+ LOC, 200+ commits, 100+ meaningful PRs, 50+ issues ✔ Review PR quality: issue linkage, scoped fixes, test updates, and F2P potential ✔ Check build readiness, Docker/setup docs, dependency structure and test suites ✔ Exclude synthetic, AI-generated, bulk-imported or low-quality repositories ✔ Prepare exact metadata: LOC, files, commits, PRs, issues, contributors, age, language, company, country and domain ✔ Prioritize BFSI, accounting, insurance, healthcare, legal, government and enterprise SaaS systems We understand this is not just about large repositories, but high-quality engineering datasets with clean development history and strong PR-to-test traceability. We can perform careful validation before submission and share only suitable options with a concise evaluation report. Regards, Karthik Resonite Technologies
₹4,900,000 INR in 7 days
0.0
0.0

Pune, India
Member since Dec 22, 2025
₹250000-500000 INR
₹250000-500000 INR
₹150000-250000 INR
$5000-10000 USD
€750-1500 EUR
$250-750 USD
$15-25 USD / hour
$10-30 USD
$250-750 USD
$250-750 AUD
₹250000-500000 INR
₹12500-37500 INR
min $50 CAD / hour
₹12500-37500 INR
$8-15 USD / hour
$25-50 USD / hour
$30-250 USD
$250-750 AUD
₹600-1500 INR
£20-250 GBP
$10-30 USD
₹12500-37500 INR
$2-8 USD / hour