This job offer is not available in your country.

Speech Data Scientist - Machine Learning

Albatronix Consulting Private LimitedBangalore

5 days ago

Job description

About The Opportunity :

Join a high-velocity engineering team building robust, low-latency speech and voice solutions for large-scale deployments.

You will design and ship state-of-the-art ASR models and production pipelinesbridging classical signal-processing foundations with modern transformer-based speech models to drive measurable product impact.

Role & Responsibilities :

Lead design, training and optimisation of ASR systemsend-to-end and hybridusing transformer and sequence modeling (Wav2Vec 2.0, Whisper, CTC, attention-based encoders / decoders).
Develop and evaluate speech pre-processing and DSP pipelines (feature extraction, augmentation, denoising, VAD) to improve robustness across noisy, multilingual inputs.
Prototype and productionise model-serving solutions : containerised inference, latency optimisation, batching, and autoscaling for cloud and edge deployments.
Collaborate with data engineers and linguists to curate datasets, define annotation guidelines, and run rigorous evaluation (WER, CER, streaming metrics) and error-analysis cycles.
Implement reproducible training workflows, CI / CD for models, monitoring for drift and performance, and automation for retraining and A / B evaluation.
Mentor peers, author engineering-excellence patterns (testing, observability), and present technical results to product and stakeholder teams.

Skills & Qualifications :

Must-Have :

5+ years in speech recognition or related audio ML roles with proven production impact.

Strong DSP and audio analysis fundamentals (feature engineering, spectrograms, filtering, VAD).

Hands-on experience with PyTorch and / or TensorFlow for building and training ASR models.

Practical knowledge of transformer-based speech models (Wav2Vec 2.0, Whisper) and sequence losses (CTC), plus RNN / CNN architectures.

Proficient in Python; experience with C++ / Java for production deployments is highly desirable.

Experience deploying models in cloud environments (AWS / GCP) and container orchestration (Docker / Kubernetes); familiar with MLOps tooling and CI / CD.

Preferred :

Background in multilingual ASR, low-resource languages, or on-device / edge inference optimisation.

Experience with large-scale data pipelines, annotation platforms, and semi-supervised / self-supervised learning workflows.

Familiarity with production monitoring (prometheus / grafana), model explainability, and privacy-preserving ML techniques.

Benefits & Culture Highlights :

High-autonomy engineering culture with strong emphasis on ownership, mentorship, and career growth.

Opportunity to influence product direction and work on state-of-the-art speech models at scale.

Competitive compensation, flexible hybrid work, and learning budget for conferences and training.

We are seeking a results-oriented Speech Scientist who thrives on technical ownership and delivering dependable voice AI in real-world settings.

Apply if you want to push ASR boundaries and build production-grade speech systems that scale.

(ref : hirist.tech)

Create a job alert for this search

Scientist Machine Learning • Bangalore