
Closed
Posted
Paid on delivery
We are seeking a **pre-recorded Hindi call center conversation audio dataset** with a total volume of approximately **500 hours**, meeting the following requirements: Audio Requirements * Spanish language * Pre-recorded call center conversations * Dual-channel recordings preferred * Agent-side audio only * IVR (Interactive Voice Response) removed * Long silence segments removed * Limited background noise * Relatively continuous and compact conversations * Effective speech ratio preferably **≥85%** * Segment duration preferably **longer than 30 seconds** * Longer conversations are highly preferred * WAV format preferred * 16 kHz, 16-bit or higher Licensing & Copyright Requirements * Data must be **legally collected and fully licensed** * Provider must have the right to sell, license, or distribute the dataset * Commercial usage rights required * No copyright infringement or unauthorized recordings * No personally identifiable information (PII) or sensitive customer data * Proper documentation of ownership and licensing is preferred Requirement Summary * **Total required volume: 500 hours (HINDI)** ### Please Provide * Available dataset details * Sample audio files * Pricing * Delivery timeline * Licensing/copyright confirmation We are open to both full and partial datasets that meet these specifications.
Project ID: 40490627
5 proposals
Remote project
Active 7 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
5 freelancers are bidding on average $2,303 USD for this job

Hi there, I understand you're looking for a pre-recorded Hindi call center conversation audio dataset with specific requirements. As a skilled freelancer with expertise in video editing, motion graphics, and 2D/3D animation, I may not be an exact fit for this project. However, my experience in creating engaging multimedia content can translate to reviewing and editing audio datasets. To approach this project, I would require more information about the dataset's availability, sample files, and licensing details. If you'd like to discuss further or have questions about how my skills might still be relevant, please feel free to chat with me directly on Freelancer.
$10 USD in 1 day
5.9
5.9

Hello, I am interested in supporting your Hindi call center audio dataset requirement and can assist in sourcing, validating, and delivering commercially licensable datasets that meet your specifications. Dataset Requirements Supported: • Hindi-language call center conversations • Approximately 500 hours total volume • Pre-recorded customer-agent interactions • Dual-channel recordings preferred • Agent-side audio options available • IVR removed • Long silence segments removed • Limited background noise • Continuous and compact conversations • Effective speech ratio ≥85% where available • Segment durations typically exceeding 30 seconds • WAV format • 16 kHz, 16-bit or higher quality Licensing & Compliance: • Legally collected data • Commercial usage rights • Licensed for distribution and AI/ASR training use • No unauthorized recordings • PII and sensitive customer information removed or anonymized • Ownership and licensing documentation available Deliverables: • Available dataset volume and specifications • Sample audio files for evaluation • Pricing based on required hours and licensing scope • Delivery timeline • Licensing and copyright confirmation I can work with both full and partial dataset requirements and help ensure the final delivery matches your quality, compliance, and formatting standards.
$505 USD in 7 days
2.4
2.4

Your biggest risk here probably isn't in finding the dataset, but rather in ensuring it meets your stringent quality and legal requirements. Most bids will focus on the technicalities, but let's talk strategy. Understanding the nuances of pre-recorded call center audio datasets is crucial. I'd approach this project by meticulously vetting each sample for speech ratio, background noise, and conversation compactness, ensuring a seamless fit for your needs. For technical direction, leveraging WAV format at 16 kHz with dual-channel recordings can maximize clarity and usability. Let's discuss your project in more detail. I'd love to share insights on how we can tailor a dataset that not only meets but exceeds your expectations.
$3,500 USD in 7 days
0.0
0.0

Hi, Primary risk is the mismatch between the job title and Audio Requirements: you request a pre-recorded Hindi call center conversation audio dataset while the Audio Requirements list Spanish language. I will confirm language scope before any delivery to avoid unusable data. I have delivered commercial call-center speech datasets and agent-only channel extracts for ASR and NLU teams, including dual-channel separation, IVR removal, long-silence trimming, and annotation-ready packaging. I have implemented automated pipelines to ensure Effective speech ratio targets and to enforce PII scrubbing and licensing traceability for buyers. I will first validate a language and legality audit of any candidate files and provide 10 representative sample WAVs within 48 hours for your acceptance. My first step is automated channel selection and silence/IVR detection using energy and model-based classifiers, followed by manual spot checks on licensing metadata. - Available dataset details I can provide partial and full inventories of pre-recorded call-center conversations in WAV, 16 kHz 16-bit or higher, dual-channel where available, agent-side audio extracted, IVR removed, long silences trimmed, low background noise, with Effective speech ratio documented per file. Exact hours and language breakdown will be confirmed after the language audit. - Sample audio files I will deliver 10 representative WAV samples within 48 hours upon confirmation of language scope. - Pricing Pricing depends on confirmed language set and whether you need the full 500 hours or partial slices. I will provide tiered quotes after the audit. - Delivery timeline Sample pack in 48 hours, delivery for partial sets in 7-14 days, full 500 hours timeline estimated after scope confirmation. - Licensing/copyright confirmation All supplied files will include provenance records and seller licensing statements confirming commercial usage rights and PII removal where applicable. - Would you like me to proceed with the language and licensing audit now? - Do you require the full 500 hours in Hindi only, or are Spanish segments acceptable as listed in Audio Requirements? Happy to connect for a short call to walk through the approach. Jen Harvey Z.
$5,000 USD in 50 days
0.0
0.0

Zhob, Pakistan
Payment method verified
Member since Apr 4, 2022
$10-30 USD
$10-5000 USD
$10-5000 USD
$30-250 USD
$30-250 USD
₹600-1500 INR
$250-300 USD
₹1500-12500 INR
$250-750 USD
€8-30 EUR
$750-1500 USD
₹100-400 INR / hour
₹12500-37500 INR
$15-25 USD / hour
$30-250 CAD
$250-750 USD
₹100-400 INR / hour
$15-25 USD / hour
$250-750 USD
$30-250 AUD
€6-12 EUR / hour
$250-750 USD
₹12500-37500 INR
₹100-400 INR / hour
$15-25 USD / hour