
Closed
Posted
I need a developer who can take this idea from concept to a working, browser-based product. The core objective is to enrol users with a unique “voice fingerprint” and then let the system verify or identify them whenever they speak again. Scope • Capture voice directly in the browser (WebRTC or similar) and accept uploaded WAV files. • Run real-time audio preprocessing—noise reduction, silence trimming, level normalisation—before any feature extraction. • Extract robust speaker features (e.g., MFCC, x-vectors, or your preferred state-of-the-art approach) and store them as a compact voiceprint tied to each account. • Provide two modes: – Verification: one-to-one match that returns a confidence score. – Identification: one-to-many search that ranks candidates with similarity scores. • Deliver a simple, clean web interface: microphone record button, enrolment dashboard, and a results panel that shows scores and pass/fail flags. • Expose the core logic through a REST or GraphQL API so it can later be integrated into other services. • Security best practices: HTTPS, encrypted storage of templates, and user consent prompts prior to recording. Acceptance criteria • Enrolment and verification must run end-to-end in under five seconds on a typical broadband connection. • Equal Error Rate (EER) ≤ 5 % on a public speaker dataset or a comparable internal test set. • Clear documentation (setup, model training pipeline, API endpoints) plus a Dockerised deployment script. Tech stack is open, but Python (TensorFlow/PyTorch), Node.js, or Rust are all fine as long as you can justify the choice and meet the performance targets. Please outline your proposed approach, libraries you favour (Kaldi, SpeechBrain, etc.), and any similar work you have shipped.
Project ID: 40197760
10 proposals
Remote project
Active 7 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
10 freelancers are bidding on average $15 USD/hour for this job

Hello, I trust you're doing well. I am well experienced in machine learning algorithms, with nearly a decade of hands-on practice. My expertise lies in developing various artificial intelligence algorithms, including the one you require, using Matlab, Python, and similar tools. I hold a doctorate from Tohoku University and have a number of publications in the same subject. My portfolio, which showcases my past work, is available for your review. Your project piqued my interest, and I would be delighted to be part of it. Let's connect to discuss in detail. Warm regards. please check my portfolio link: https://www.freelancer.com/u/sajjadtaghvaeifr
$25 USD in 40 days
7.2
7.2

I understand your need for a browser-based voice fingerprint system that enrolls and verifies users quickly and securely. I specialize in audio processing and machine learning, with hands-on experience using Python and frameworks like SpeechBrain and PyTorch for speaker recognition tasks. My approach involves leveraging WebRTC for browser audio capture, applying real-time noise reduction and normalization, and extracting robust features such as x-vectors for accurate voiceprint creation. I will build a clean, user-friendly interface with enrolment and verification dashboards, plus a REST API for easy integration. Security and performance are priorities—I’ll ensure encrypted storage, user consent, and meet the sub-5-second response time and EER ≤5% benchmarks. Deliverables include full documentation and Docker deployment to streamline setup. I’m ready to start immediately and deliver a reliable, scalable solution tailored to your requirements.
$15 USD in 40 days
0.0
0.0

Hi, I have reviewed the job description and I'm ready to start immediately. I guarantee 100% quality based on similar tasks I have completed, and I can offer you a 10% discount for this project. Please contact me so we can discuss the job and begin working on it.
$15 USD in 40 days
0.0
0.0

Hi, I’ve reviewed your requirements for the browser based voice biometric system and can build a secure, real time speaker verification and identification solution. I will implement in browser recording via WebRTC plus WAV uploads, followed by audio preprocessing such as noise reduction, silence trimming, and level normalization. For voiceprints, I can use MFCCs with x-vector or deep speaker embedding models to create compact, reliable speaker templates linked to each account. Both modes will be supported: 1:1 verification with confidence scores and 1:N identification with ranked similarity results. I will also deliver a clean web interface with recording, enrolment, and a results dashboard showing scores and pass or fail indicators. Core functionality will be exposed through a REST API for future integration. Security will include HTTPS, encrypted template storage, and user consent before recording. I will optimize the pipeline to meet the speed requirement and evaluate accuracy on a public dataset to target EER ≤ 5%. Documentation and a Dockerized deployment setup will be included. Looking forward to working on this project. Shantanu and Team
$12 USD in 40 days
0.0
0.0

Hi, I am a full-stack developer with experience building browser-based audio and ML systems, including voice capture, preprocessing, feature extraction, and confidence-based matching. I’m comfortable taking this end to end—from recording in the browser, through a Python-based speaker recognition pipeline, to clean APIs and a simple web UI—while keeping performance, security, and reproducibility in mind. I’d be happy to talk through the approach, model choices, and trade-offs over chat and see how best to move this from concept to a working product. Best regards, Ravindra
$15 USD in 40 days
0.0
0.0

I can deliver a production-ready, browser-based voice fingerprint system with fast enrolment, accurate verification, and a clean API. Approach • WebRTC/MediaRecorder for in-browser mic capture + WAV upload • Server-side audio preprocessing (RNNoise, WebRTC-VAD, loudness normalization) • Speaker embeddings using SpeechBrain ECAPA-TDNN (state-of-the-art, low EER) • Vector search via pgvector (Postgres) or FAISS • Verification (1:1) with calibrated thresholds + confidence score • Identification (1:N) with ranked similarity results Stack Frontend: React + Web Audio API Backend: Python (FastAPI) ML: PyTorch + SpeechBrain Storage: Encrypted templates (AES-256) Deploy: Docker + Nginx + HTTPS Meets acceptance criteria • <5s end-to-end enrol/verify • EER ≤5% (VoxCeleb protocol) • Full docs + Dockerized setup I’ve built ML systems with audio pipelines, embeddings, vector search, and secure APIs. The design will be modular for future integrations (mobile, KYC, fraud detection). Happy to share a brief tech design before kickoff.
$13 USD in 30 days
0.0
0.0

Vijayawada, India
Member since Jun 30, 2020
$30-250 USD
$15-25 USD / hour
₹1500-12500 INR
₹12500-37500 INR
min $50 USD / hour
$8-15 USD / hour
$2-8 USD / hour
$250-750 USD
$10-30 USD
€80-150 EUR
$3000-5000 USD
₹750-1250 INR / hour
$275-350 USD
$10-30 USD
€30-250 EUR
$10-30 USD
$250-750 USD
₹600-800 INR
$8-15 USD / hour
₹12500-37500 INR
$30-250 USD
$30-250 USD
$250-750 USD