Find Jobs
Hire Freelancers

Python Assignment Voice Audio Mapping with Words

$30-250 USD

Annulé
Publié il y a plus de 9 ans

$30-250 USD

Payé lors de la livraison
Preferably Python 2.7 or Python 3.4 (open to VB.net and C#.net). (Free to use any libraries in addition to or instead of SpeechRecognition library) Read a clear audio file in python using SpeechRecognition library [login to view URL] And a string of words. (The audio file has a very clear voice, speaking the same sentence as provided in the string). We just have to map the parts of audio with each word. So, for example, the string is "i have a baby mitten". The script will read the string and audio, and return something like: word="i" begin=100ms end=175ms word="have" begin=215ms end=265ms word="a" begin=265ms end=285ms word="baby" begin=325ms end=400ms word="mitten" begin=400ms end=450ms begin and end mean the start and end of the word in the audio file. Whoever bids $150 or less will be hired immediately. Only if no one bids $150 or less, then I will consider increasing the budget. This project is for a trusted client with whom I have been working on a few projects in past, and I will be able to award you and pay you for the project only when the client is happy with the work and when he makes payment. Approach: As a coder myself, I can suggest one approach to implement an algorithm to map, although I am not sure if it will work and be accurate. Here is the approach: There are some pauses in between some words and no pause in between other words at all. So, for example, if the string is "i have a baby mitten", in the audio, there can be pauses as show below: "i <pause> have a <pause> baby mitten". Now, we can slice the audio in chunks around the pauses so that first chunk of audio is "i", second chunk is "have a", and third chunk is "baby mitten". We trim each chunk so that it doesn't have any leading or trailing pauses. Now we parse each chunk into the speechrecognition library. If the speechrecognition library outputs a single word for a chunk, we just match word with words in string, and map the chunk of audio with that word in the string. Otherwise if the speechrecognition produces multiple words, then we can loop through the chunk's length to locate a partition so that the speechrecognition library produces 1 word for the sub-chunk from start to the partition position of the bigger chunk. And once the partition for 1 word is found, we locate the word in the string of unmapped words and map it with it. And so on.
N° de projet : 6835276

Concernant le projet

2 propositions
Projet à distance
Actif à il y a 9 ans

Cherchez-vous à gagner de l'argent ?

Avantages de faire une offre sur Freelancer

Fixez votre budget et vos délais
Soyez payé pour votre travail
Surlignez votre proposition
Il est gratuit de s'inscrire et de faire des offres sur des travaux
2 freelances proposent en moyenne $172 USD pour ce travail
Avatar de l'utilisateur
Just give me the job and get it done! I'm a very experienced python developer. Enjoy life while I'm doing the work for you.
$144 USD en 3 jours
5,0 (1 commentaire)
0,0
0,0
Avatar de l'utilisateur
We are experienced in software development, Linux, Java, Python, advanced numerical computations, data analysis, crawler development etc. Our skkype is: solverio. Our website is: s o l ve r . i o
$155 USD en 30 jours
0,0 (0 commentaires)
0,0
0,0

À propos du client

Drapeau de INDIA
Ghaziabad, India
5,0
77
Méthode de paiement vérifiée
Membre depuis nov. 7, 2012

Vérification du client

Merci ! Nous vous avons envoyé un lien par e-mail afin de réclamer votre crédit gratuit.
Une erreur a eu lieu lors de l'envoi de votre e-mail. Veuillez réessayer.
Utilisateurs enregistrés Total des travaux publiés
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Chargement de l'aperçu
Permission donnée pour la géolocalisation.
Votre session de connexion a expiré et vous avez été déconnecté. Veuillez vous connecter à nouveau.