Description : Location : Bengaluru
Experience : 3+ yrs.
experience in AI Engineering (preferably Speech Tech, NLP, or Conversational AI)
Reports to : CEO / Co-Founder
Job Type : Hybrid (3 Days from Office)
About the Client has retained as hiring partner by an emerging Startup in Voice AI, building next-gen AI Voice Agents through its no-code Voice AI Studio.
Their solutions power enterprises in healthcare, finance, recruitment, and non-profits, making voice technology accessible, scalable, and inclusive.
Supported by Techstars, Microsoft, UNICEF, and GSMA, and their leadership team brings deep expertise from Stanford, Microsoft, IBM, and Uber.
Role Summary :
As an AI Engineer at based in our Bengaluru office, youll design, develop, and deploy advanced voice AI models that power our Voice AI Studio and enterprise solutions.
Youll work in a fast-paced, collaborative environment, shipping real features in days, not months, and making a direct impact on real-world applications.
Key Responsibilities :
- Develop, test, and refine end-to-end voice agent models, including ASR (Automatic Speech Recognition), NLU (Natural Language Understanding), dialog management, and TTS (Text-to-Speech).
- Stress-test agents in noisy, real-world scenarios and iterate for improved robustness and low latency.
- Research, prototype, and implement cutting-edge techniques (e.g., robust speech recognition, adaptive language understanding).
- Collaborate with backend and frontend engineers to integrate AI components into live voice products.
- Monitor agent performance in production, analyze failure cases, and drive continuous improvement.
- Design and maintain scalable, efficient voice AI architectures, optimizing for real-time performance and reliability.
- Write clear developer documentation and contribute to internal tools for automating voice AI workflows.
Experience & Qualifications :
Bachelors or Masters degree in Computer Science, Engineering, or a related field.3+ years of hands-on experience in speech-centric machine learning (ASR, NLU, TTS) and transformer-based models.Strong programming skills in Python; experience with ML frameworks such as TensorFlow or PyTorch.Experience with real-time audio pipelines and deploying ML models to production.Proven track record of building and optimizing AI / ML models for reliability and low latency.Analytical mindset with the ability to deconstruct complex voice interactions and design practical solutions.Passion for building inclusive, accessible technology.Excellent communication and teamwork skills.Preferred Qualifications :
Experience with large-scale audio datasets and enterprise-grade voice AI deployments.Familiarity with cloud platforms (AWS, Azure, GCP) and MLOps tools.Exposure to telephony, voice-based solutions, or conversational AI products.Experience working in fast-paced startup environments.Preferred Companies who have worked with or similar companies like Sarvam AI, Futwork, Olive AI, Convin, Yellow.ai, Uniphore, Observe.AI, Skit.ai, Gnani.ai and RaftLabs will be preferred.
Why Join? :
Competitive pay and flexible hybrid work.Immediate impact - ship features in days and see your work in real-world products.Learning opportunities and potential for substantial ownership.Supportive, mission-driven team with global exposure.Note :
This job description outlines the general nature and scope of work for this role. It is not an exhaustive list of all duties, responsibilities, or qualifications required of employees in this position
(ref : hirist.tech)