Job Description
Senior ASR / TTS Specialist - AI Agent Integration Expert
Company : EXL Service
Type : Full-time
Experience : 3+ years
Position Summary
We seek an exceptional Senior ASR / TTS Specialist to lead speech AI initiatives and integrate advanced speech technologies with AI agent frameworks. This role focuses on fine-tuning ASR / TTS models, implementing MLOps best practices, and building production-ready speech AI systems powering next-generation conversational AI agents.
Key Responsibilities
Speech AI Model Development & Integration
MLOps & Production Engineering
Research & Development
Required Qualifications
Core Technical Skills (Must-Have)
Speech AI Models (3+ years experience) : - ASR Systems : Amazon Nova Sonic v1.0, Google Speech-to-Text, Azure Speech Services, Whisper, Wav2Vec2, Riva - TTS Systems : Google TTS, Azure Cognitive Services TTS, ElevenLabs (REST / WebSocket), Tortoise, VITS, FastSpeech2 - Speech-to-Speech : Direct S2S without intermediate text, multimodal audio processing - Cloud Services : AWS Bedrock Runtime, Google Cloud AI (Gemini API), Azure OpenAI Services
Programming & Frameworks : - Languages : Expert Python, proficient C++ / Rust for optimization - ML Frameworks : Advanced PyTorch, TensorFlow 2.x, JAX / Flax - Audio Processing : librosa, torchaudio, soundfile, WebRTC, µ-law / PCM conversion - Agent Frameworks : Hands-on experience with 3+ of : LangChain, CrewAI, AutoGen, LlamaIndex, OpenAI Assistants
MLOps & Infrastructure (Essential)
MLOps Tools (2+ years) : - Experiment Management : MLflow, Weights & Biases - Model Serving : TorchServe, TensorFlow Serving, NVIDIA Triton -
Assistant Manager • bangalore, India