
Fermé
Publié
Payé lors de la livraison
I need a developer who can wire together Unreal Engine’s MetaHuman framework with several AI services to create a real-time, push-to-talk assistant that speaks back instantly in perfect lip-sync. Two separate screens drive the experience. • On the landscape display (1920×1080) I’ll host English and Arabic buttons, a press-and-hold mic icon, plus live text showing what the user just said and what the AI replies. • On the portrait display (1080×1920) a MetaHuman avatar delivers the reply, streaming audio while its face tracks the phonemes. My chosen pipeline is: 1. Button held → audio captured. 2. Speech sent to the OpenAI Whisper API for transcription (I prefer the cloud API or local model). 3. Plain text routed through n8n, which handles prompt logic and returns the response. 4. That response feeds straight into ElevenLabs for low-latency audio streaming. 5. MetaHuman receives the stream and plays it with accurate lip-sync. Top priority is minimal latency; every millisecond matters along with lip-sync. The language switch happens only through the on-screen toggle—no voice commands or auto-detection. Deliverables I expect: • Full Unreal Engine project with clean, well-commented Blueprint code and neatly organised folders • A packaged build that runs out of the box on Windows • Setup notes explaining API keys, n8n endpoint configuration, and any special Unreal plugins or MetaHuman Live Link steps When you reply, highlight past work that combines MetaHuman (or similar real-time avatars) with external AI services, especially anything that shows you’ve already tuned pipelines for sub-second round-trips. Please include a realistic timeline from project kick-off to first test build and to final delivery.
N° de projet : 40258956
87 propositions
Projet à distance
Actif à il y a 7 jours
Fixez votre budget et vos délais
Soyez payé pour votre travail
Surlignez votre proposition
Il est gratuit de s'inscrire et de faire des offres sur des travaux
87 freelances proposent en moyenne $482 USD pour ce travail

Hello, I understand you want a real-time MetaHuman voice assistant in Unreal Engine with instant lip-sync, using a push-to-talk flow across two displays. My approach is to build a clean Unreal project with a modular Blueprint/C++ setup: capture mic input on hold, stream to Whisper, feed text through a lightweight n8n workflow for prompts, push the response to ElevenLabs for ultra-low-latency audio, and render the avatar lip-sync via MetaHuman Live Link. I’ll ensure a responsive sub-second loop by optimizing audio buffers, using local/cloud Whisper options depending on latency targets, and aligning phoneme timing with the MetaHuman avatar. Deliverables will include a Windows-ready packaged build, well-commented Blueprints, organized folders, and setup notes for API keys, n8n endpoints, and any plugin steps. I have prior work blending real-time avatars with external AI services and tuned pipelines for near real-time interaction. Timeline: kickoff → first test build in ~2 weeks → final delivery in ~4 weeks. What is your preferred Whisper model (local inference vs cloud) and target latency budget per interaction?
$750 USD en 21 jours
6,4
6,4

✅ Lovable AI Expert | AI Development | Chatbot Development | Unreal Engine✅ Hi, Thank you for considering this opportunity! I bring extensive experience in implementing custom solutions powered by LLMs, conversational AI, and intelligent automation. Recently I have been working on Lovable AI for developing a gaming platform using it, complete with chat-based agent logic, expressive front-ends, and backend integrations. In other project, implemented a fully automated AI agent system for intelligent meeting creation using ElevenLabs Conversational AI and Gemini (via a custom agent brain). The flow integrates voice interaction, natural language processing, location precision, and frontend. Whether you're building an internal assistant, a public-facing voice agent, or an integrated AI productivity tool, I can help bring your vision to life with robust, scalable architecture and a human-like user experience. I would love to connect and explore how we can contribute to your AI initiative. (Note: Budget is flexible — we can finalize it after reviewing the complete scope.) Thanks & Regards, Kajal
$750 USD en 7 jours
6,2
6,2

Hello, HAVE HANDS-ON EXPERIENCE WITH SUCH PROJECT. I have 15+ years of proven experience in real-time interactive systems and confidently understand your requirement. The goal is to build a low-latency Unreal Engine MetaHuman assistant that transcribes user audio, generates AI responses, and streams them with accurate lip-sync for a natural, bilingual experience. -->> Landscape interface with push-to-talk audio capture, language toggle, and live text display -->> Portrait MetaHuman avatar delivering AI responses with sub-second lip-sync via ElevenLabs -->> Whisper API integration for real-time transcription (English/Arabic) -->> n8n pipeline for prompt handling and dynamic response generation -->> Fully packaged Unreal Engine project with clean Blueprints and structured folders I follow a modular, low-latency architecture with efficient API routing, Blueprint optimization, and iterative testing to ensure seamless lip-sync and instant audio playback. I would approach your project by first setting up the audio capture and transcription pipeline, then integrating the AI response and audio streaming, followed by MetaHuman lip-sync tuning and final build optimization. I’m ready to implement this project from start-to-finish and deliver a polished, production-ready MetaHuman voice talkbot. Thanks & regards, Julian
$250 USD en 15 jours
5,6
5,6

Waiting for that MetaHuman avatar to reply when every millisecond counts can break immersion and frustrate users, especially when juggling live lip-sync and instant language switching on two displays. Relying on multiple AI services makes it even tougher to keep everything seamless. You’ll get a tightly wired Unreal Engine project where your push-to-talk assistant replies instantly in both English and Arabic, with accurate lip-sync and smooth transitions between screens. First, I’ll connect your chosen pipeline for real-time audio capture, transcription, and response handling within Unreal. Next, I’ll optimize the live audio streaming into MetaHuman, focusing on sub-second latency and phoneme tracking. Finally, I’ll deliver a clean build with setup notes so it runs out of the box on Windows. Would you like to walk through how the language button and mic icon should behave during rapid switching?
$512 USD en 7 jours
5,3
5,3

⭐⭐⭐⭐⭐ Dear Valuable Client, CnELIndia, led by Raman Ladhani, can seamlessly deliver your real-time MetaHuman assistant by leveraging our experience integrating Unreal Engine avatars with external AI services. We have previously developed projects where MetaHumans and similar real-time avatars were synced with AI-driven text-to-speech and chat systems, achieving sub-second round-trip latencies and precise lip-sync. Our team can wire the landscape and portrait displays, implement the Whisper transcription pipeline, route responses through n8n, and stream ElevenLabs audio to the avatar with minimal latency. We ensure clean Blueprint architecture, well-organized folders, and thorough setup documentation for API keys, endpoints, and Live Link configurations. Proposed timeline: project kick-off → 1 week for environment setup and initial integration; 2 weeks to wire AI services, build interface, and test sub-second response; 1 week for polish, optimization, and packaging; final delivery in 4 weeks with a fully functional Windows build. This approach guarantees performance, reliability, and maintainability aligned with your requirements.
$500 USD en 7 jours
5,4
5,4

As an experienced and dedicated freelance developer with over 7 years in the field, I firmly believe my skills and commitment are what your innovative project needs. Having combined APIs and real-time avatars like MetaHuman before, I have successfully tuned complex pipelines for sub-second round-trips which has equipped me with the ability to tackle this project's core requirements of low latency and accurate lip-sync. I've specialized in C++ programming, the main language of Unreal Engine, thus making me wellversed in- depth knowledge on Unreal Engine. OpenAI speech-to-text API could be handled better to fulfill your expectation. Moreover, my understanding of AI services and their integration propels me to see possibilities for further optimization and enhancement in your chosen pipeline. A clean, well-commented Blueprint code that forms the foundation for a top-performing, push-to-talk MetaHuman assistant is guaranteed while ensuring the product runs conveniently on your desired platform. Your project will be treated like my own; they deserve nothing but the best planning and execution. With this mindset, I'm ready to start work immediately while keeping you updated throughout the process as I deliver the first test build and final delivery with every detail clearly explained in setup notes. Let's efficaciously turn your vision into reality!
$500 USD en 7 jours
4,3
4,3

Hi, I would like to grab this opportunity and will work till you get 100% satisfied with our work. I am an expert team which have many years of experience on Cloud Computing, C++ Programming, Unreal Engine, Voice Assistance Devices, API Integration, ElevenLabs, Speech Synthesis, AI Chatbot, AI Development, n8n I will share with you my recent work in the private chat due to privacy concerns! Regards
$500 USD en 7 jours
4,0
4,0

Unreal Engine Real-Time MetaHuman Voice Talkbot I’m a full-stack software engineer with expertise in React, Node.js, Python, and cloud architectures, delivering scalable web and mobile applications that are secure, performant, and visually refined. I also specialize in AI integrations, chatbots, and workflow automations using OpenAI, LangChain, Pinecone, n8n, and Zapier, helping businesses build intelligent, future-ready solutions. I focus on creating clean, maintainable code that bridges backend logic with elegant frontend experiences. I’d love to help bring your project to life with a solution that works beautifully and thinks smartly. To review my samples and achievements, please visit:https://www.freelancer.com/u/GameOfWords Let’s bring your vision to life—connect with me today, and I’ll deliver a solution that works flawlessly and exceeds expectations.
$250 USD en 4 jours
4,0
4,0

Your project involving a real-time MetaHuman Talkbot aligns perfectly with my recent work in creating low-latency interactive AI avatars for enterprise training and virtual concierge services. I have successfully bridged Unreal Engine 5 with modular conversational AI pipelines, ensuring that the character's responsiveness and facial expressions feel natural rather than robotic. My primary focus is on minimizing the "delay gap" between a user finishing their sentence and the MetaHuman beginning its vocal and visual response, which is the most critical factor for maintaining user immersion in interactive voice systems. To build this, I will implement a robust C++ and Blueprint architecture that manages the asynchronous data flow from a Speech-to-Text provider like Whisper or Deepgram through a logic layer using OpenAI’s GPT-4o or a custom LangChain setup. For the output, I will integrate ElevenLabs or Azure Speech for high-fidelity Text-to-Speech, which I will then synchronize with the MetaHuman’s facial animation using Live Link Face, Nvidia Audio2Face, or the Oculus Lipsync plugin for precise phoneme mapping. I will also optimize the network stack to ensure that data packets for audio and facial transforms are prioritized, preventing any frame drops or stuttering during the real-time AI inference cycle. Do you have a preferred hosting environment for the LLM, or would you like recommendations on balancing response depth with the lowest possible latency? Additionally, are we targeting a local high-end workstation deployment or a cloud-based Pixel Streaming solution for broader accessibility? I am available to discuss these technical specifics and can provide a walkthrough of my previous MetaHuman integration workflows to ensure we are aligned on the performance benchmarks. Please let me know if you would like to schedule a brief call to finalize the architectural blueprint for this talkbot.
$616 USD en 21 jours
3,7
3,7

Hello, I specialize in Unreal Engine real-time MetaHuman integrations with AI pipelines. I can create your push-to-talk assistant with minimal latency, streaming user audio through Whisper for transcription, routing prompts via n8n, generating responses, and streaming back via ElevenLabs into a lip-synced MetaHuman avatar. Both landscape and portrait displays will update in real time with transcription and AI response text. The workflow will be fully modular: button press → audio capture → Whisper transcription → n8n prompt logic → ElevenLabs TTS → MetaHuman lip-sync. Blueprint scripts will be clean, well-commented, and structured for maintainability, while packaged builds for Windows will run out of the box. API configuration, Live Link setup, and endpoint notes will be documented for seamless replication. Sub-second round-trips will be achieved through asynchronous streaming, buffer management, and optimized audio pipelines. Deliverables: fully commented Unreal Engine project, packaged Windows build, setup documentation, and a tested, low-latency MetaHuman AI voice talkbot ready for both English and Arabic toggles. Will the Whisper transcription and ElevenLabs TTS run via cloud APIs exclusively, or is local/offline fallback required? Are there any network bandwidth constraints that may affect sub-second streaming performance?
$750 USD en 11 jours
3,3
3,3

Hello, I’m excited about the opportunity to contribute to your project. With strong experience integrating Unreal Engine (including MetaHuman) with external AI services, real-time audio pipelines, and low-latency STT → LLM → TTS workflows, I can implement your push-to-talk system using Whisper for transcription, n8n for prompt orchestration, ElevenLabs for streaming voice output, and precise MetaHuman lip-sync driven by phoneme-aligned audio in a dual-screen setup. I’ll tailor the work to your exact requirements, focusing on sub-second round-trip optimization, clean and well-structured Blueprints, stable Windows packaging, and clear setup documentation covering API keys, n8n endpoints, and required Unreal/MetaHuman plugins so the system runs out of the box. You can expect clear communication, fast turnaround, and a high-quality result that fits seamlessly into your existing workflow. Best regards, Juan
$500 USD en 3 jours
3,5
3,5

Hey, I’ve reviewed your project and understand you’re looking to integrate Unreal Engine MetaHuman with Whisper, n8n, and ElevenLabs to create a real-time push to talk assistant with ultra low latency and precise lip sync across dual displays. The priority is sub second round trips and accurate phoneme driven facial animation. I can build a clean Unreal Engine project using Blueprint with optimized audio capture, async API handling, and streaming playback. Whisper will handle transcription, n8n will manage prompt routing, and ElevenLabs streaming audio will feed directly into MetaHuman with proper lip sync using Live Link or audio driven facial animation. The landscape UI will manage language toggle and live captions, while the portrait display renders the avatar smoothly at 1080 by 1920. You’ll receive a packaged Windows build, structured project files, and clear setup notes for API keys and endpoints. Let’s connect so I can outline timeline from kickoff to first test build and final delivery. Best regards, Muhammad Adil Portfolio: https://www.freelancer.com/u/webmasters486
$600 USD en 8 jours
3,0
3,0

Hi there,Good evening I am Talha. I can work with your project skills AI Development, API Integration, C++ Programming, Voice Assistance Devices, Cloud Computing, ElevenLabs, AI Chatbot, Unreal Engine, n8n and Speech Synthesis I am excited to present my proposal, which centers around a personalized approach designed to elevate your project. We will start with an in-depth consultation to gain a deep understanding of your project's unique requirements, goals, and constraints. Our commitment to customization means that we will tailor our services to align perfectly with your project, and we will explain how this approach will meet your expectations. Please note that the initial bid is an estimate, and the final quote will be provided after a thorough discussion of the project requirements or upon reviewing any detailed documentation you can share. Could you please share any available detailed documentation? I'm also open to further discussions to explore specific aspects of the project. Thanks Regards. Talha Ramzan
$250 USD en 14 jours
2,8
2,8

HELLO, HOPE YOU ARE DOING WELL! You’re looking to wire Unreal Engine’s MetaHuman with AI services to create a low-latency, seamless, push-to-talk real-time assistant with perfect lip-sync across dual screens and both English and Arabic support. My expertise aligns with your needs—I've integrated MetaHuman avatars with AI-driven pipelines, optimized for low-latency, real-time user interactions and live lip-sync, ensuring neat Unreal Blueprint organization and smooth deployment. My plan is to build a modular Unreal project that ties together your full AI pipeline—from mic capture to live Whisper transcription, prompt routing in n8n, streaming ElevenLabs audio, and MetaHuman live response—maintaining priority on responsiveness and accurate bilingual switching, all within a user-friendly, well-documented system. I'd like to have a chat with you at least so I can demonstrate my abilities and prove that I'm the best fit for this project. Warm regards, Natan.
$500 USD en 2 jours
2,6
2,6

Hello! Thank you for the detailed project description. I have extensive experience integrating Unreal Engine's MetaHuman framework with AI services, specifically in creating real-time avatar interactions that prioritize low latency and accurate lip-sync. To tackle this project, I will: - Design a seamless user interface for both landscape and portrait displays, ensuring an intuitive experience with the English and Arabic buttons and mic icon. - Implement a robust pipeline that captures audio, utilizing the OpenAI Whisper API for transcription and routing text through n8n to manage prompt logic efficiently. - Integrate ElevenLabs for audio streaming, ensuring that the MetaHuman avatar delivers responses in perfect synchronization with the audio output. I expect to deliver a fully functional Unreal Engine project with well-organized Blueprint code, a packaged build for Windows, and comprehensive setup documentation detailing API configurations and necessary plugins. I am eager to start this project and confident in my ability to meet your expectations for quality and timely delivery. I propose a timeline of approximately 4-6 weeks from kickoff to the first test build, with adjustments as needed based on your feedback. We can negotiate the budget and timeframe in more detail. I look forward to discussing this further!
$250 USD en 7 jours
2,3
2,3

Hello, I can seamlessly integrate Unreal Engine’s MetaHuman framework with your AI services to create a real-time, push-to-talk assistant that delivers instant, lip-synced responses. In past projects, I've successfully connected MetaHumans with AI frameworks for interactive experiences, ensuring minimal latency and flawless audio-visual synchronization. For instance, I developed a similar application that achieved sub-second response times by optimizing the API communication pathways and leveraging efficient data handling. To tackle your project, I’ll focus on optimizing the pipeline: 1. Implementing efficient audio capture and transcription with the OpenAI Whisper API. 2. Configuring n8n for prompt logic to ensure swift responses. 3. Integrating ElevenLabs for low-latency audio streaming and precise phoneme tracking with the MetaHuman. I have a couple of questions: - Do you have preferred settings for the Whisper API (cloud vs. local)? - Are there specific Unreal plugins you want to include for the MetaHuman Live Link? I’m ready to kick off this project immediately. Expect a first test build within two weeks, leading to final delivery shortly after. Let’s make this happen.
$250 USD en 7 jours
1,8
1,8

With my extensive experience in developing fast, secure, and scalable web systems with APIs, I can deliver the real-time MetaHuman voice talkbot you need with exceptional precision. Additionally, for seas of sub-second round-trips, to ensure your priority of minimal latency is met with utmost satisfaction for every millisecond saved weighs heavily on your project. My past work, whose highlight you require, combines similar real-time avatars like MetaHuman with external AI services resulting in optimal performance and seamless integration. During my 8+ years of being a full-stack developer and digital marketing expertI have had the opportunity to launch high-quality digital products that actually convert. Your project aiming at an unrivaled user experience with proper phonemic synchronization cannot be taken lightly. My Production pipeline mostly focused on running API through N8n is just what you need. Over time,I have developed a knack for creating clean,well-commented Blueprint code and have maintained neatly organized folders in most of my projects ensuring superior accessibility and ease of use. As a detail-driven professional,I guarantee the delivery of a playwright's like out-of-the-box project that'll run seamlessly on Windows. Ready to start immediately and deliver professional results. Timeline and cost after discussion. Portfolio: https://www.freelancer.com/u/moizs13 ✔ 100% satisfaction guaranteed ✔ High-quality professional work Regards, Moiz Y
$250 USD en 15 jours
1,4
1,4

Hello, I can deliver a real-time MetaHuman Voice Talkbot with minimal latency, integrating Unreal Engine’s MetaHuman framework, OpenAI Whisper, n8n, and ElevenLabs as specified. My approach will focus on optimizing the pipeline for sub-second round-trips, ensuring seamless lip-sync and language toggling. With 5+ years of experience in real-time avatar systems and AI integration, I’ve successfully tuned similar pipelines for low-latency performance. Send a message to see samples of my work or discuss further. Thanks, Adegoke. M
$338 USD en 5 jours
0,2
0,2

Hello. I came across your project, Real-Time MetaHuman Voice Talkbot and it aligns well with my background. I have hands-on experience with Cloud Computing, C++ Programming, Unreal Engine that's directly relevant here. Feel free to reach out if you have questions.
$250 USD en 7 jours
0,0
0,0

Hi there! I’m excited about your project and can help integrate Unreal Engine’s MetaHuman framework with the AI services you’ve outlined to create a responsive, real-time assistant. With over 10 years of experience building production systems, I’ve successfully worked on projects involving real-time avatars and complex API integrations, ensuring minimal latency and accurate lip-sync. I understand the importance of getting this right from the start, so I’m happy to answer any technical questions you have. To kick things off, we could start with a small milestone to ensure we’re aligned. I’ll provide a full Unreal Engine project with organized code and setup notes, ready to run on Windows. Looking forward to collaborating!
$250 USD en 7 jours
0,0
0,0

Diera, United Arab Emirates
Membre depuis févr. 21, 2016
$10-30 USD
$30-250 USD
$10-30 USD
$1500-3000 USD
$10-30 USD
$250-750 USD
$250-750 USD
$30-250 USD
$30-250 USD
₹150000-250000 INR
₹12500-37500 INR
$10-30 USD
₹1500-12500 INR
minimum $50 USD / heure
$2-8 CAD / heure
$30-250 USD
₹100-400 INR / heure
$3000-5000 USD
₹600-1500 INR
$2-8 USD / heure
$30-250 USD
€30-250 EUR
₹50000-200000 INR
$2-8 CAD / heure
$30-250 CAD