Talent.com
Speech Recognition Consultant

Speech Recognition Consultant

Sony Research IndiaPushkar, Republic Of India, IN
16 days ago
Job description

Sony Research India is driving cutting-edge research and development in various locations around the globe, including laboratories in Japan, the United States, Europe, and Asia. We endeavor to create new technology, products, and services while sustaining Sony Group’s diverse businesses in electronics, entertainment, and financial fields. For our research centre to blaze a trail in the latest technologies, we seek to foster the growth of a diverse pool of research and engineering talent and create a technology talent bank to drive research excellence worldwide. Sony Research India is offering outstanding career opportunities around frontline technologies such as AI and data analytics.

Sony Research India is seeking a dynamic and motivated Speech Recognition Consultant to join our innovative research team. As a Consultant, you will work on real-world problems in automatic speech recognition (ASR), focusing on improving noise robustness and minimizing code-switching errors in transcription outputs. You'll gain hands-on experience with state-of-the-art tools and datasets, and contribute to impactful projects alongside experienced researchers and engineers.

Key Responsibility :

  • Explore and develop techniques to enhance ASR robustness under noisy, low-resource, and domain-shifted conditions.
  • Investigate code-switching errors in end-to-end ASR models (e.G., Whisper, Wav2Vec2, etc.) and propose mitigation strategies.
  • Conduct experiments using large-scale speech datasets and evaluate ASR performance across varying noise levels and linguistic diversity.
  • Contribute to publications, technical reports, or open-source tools as outcomes of the research.

Work Location :

  • Remote within India,
  • Duration of the paid contractual role :

  • The annual paid direct contractual tenure is extendable.
  • Ideally this position will start from first week of November 2025.
  • The working hours are from 9 : 00 to 18 : 00 (Monday to Friday) full-time.
  • Essential Education :

  • Completed Ph.D. / Bachelor’s or Master’s (Research) degree with some industry experience in deep learning or machine learning, and hands-on expertise with Transformer models applied to audio or speech tasks.
  • Must Have Skills & Abilities :

  • Excellent coding skills, especially in Python and PyTorch.
  • Experience with speech processing libraries (e.G., Torchaudio, ESPnet, Hugging Face Transformers).
  • Prior experience with ASR models like Wav2Vec2, Whisper, or RNN-T is a plus.
  • Ability to read and implement academic papers.
  • Strong foundation in machine learning and signal processing.
  • Good to Have Skills :

  • Familiarity with prompt tuning, contrastive learning, or multi-modal architectures.
  • Experience with multilingual ASR.
  • Papers in top-tier conferences like ICASSP, Interspeech, NeurIPS, AAAI, ACL, etc.
  • Our Values :

  • Dreams & Curiosity : Pioneer the future with dreams and curiosity.
  • Diversity : Pursue the creation of the very best by harnessing diversity and varying viewpoints.
  • Integrity & Sincerity : Earn the trust for Sony brand through ethical and responsible conduct.
  • Sustainability : Fulfil our stakeholder responsibilities through disciplined business practices.
  • Sony Research India is committed to equal opportunity in all its employment practices, policies and procedures and to ensuring that no worker or potential worker will receive less favourable treatment due to any characteristic protected under applicable local laws.

    Create a job alert for this search

    Consultant • Pushkar, Republic Of India, IN

    Related jobs
    • Promoted
    Speech Recognition Consultant

    Speech Recognition Consultant

    Sony Research IndiaAjmer, Rajasthan, India
    Sony Research India is driving cutting-edge research and development in various locations around the globe, including laboratories in Japan, the United States, Europe, and Asia.We endeavor to creat...Show moreLast updated: 23 days ago
    • Promoted
    Biztalk Consultant

    Biztalk Consultant

    MSHpushkar, gujarat, in
    BizTalk Administration (installation, configuration, maintenance).BizTalk Server troubleshooting & ticket resolution.On-call support & incident management. Patching, updates, and upgrades for BizTal...Show moreLast updated: 21 days ago
    • Promoted
    • New!
    Echo Technician

    Echo Technician

    Tricog HealthAjmer, Republic Of India, IN
    Founded in 2014, Tricog is one of the world’s largest predictive healthcare analytics firms.Tricog was first started by Dr Charit Bhograj — an Interventional Cardiologist — who realized that the ca...Show moreLast updated: 15 hours ago
    • Promoted
    Speech Language Pathologist

    Speech Language Pathologist

    1SpecialPlacePushkar, IN
    Speech Language Pathologist (SLP).SpecialPlace is seeking a dedicated and skilled Speech Language Pathologist (SLP) to join our team. The role will involve online therapy services, providing compreh...Show moreLast updated: 2 days ago
    • Promoted
    Public Speaking Facilitator

    Public Speaking Facilitator

    TalentGumPushkar, IN
    We are hiring for the following time slots-.TalentGum is a leading e-learning platform launched in 2021 that aspires to transform the scope of extra-curricular education globally by encouraging the...Show moreLast updated: 30+ days ago
    • Promoted
    Baker Tournant

    Baker Tournant

    Celebrity CruisesAjmer, IN
    Ensures the smooth and efficient operation and control of the Bakery Shop and production daily according to company policies. Responsible for the production, quality, and presentation of the bread a...Show moreLast updated: 30+ days ago
    • Promoted
    Audio Transcribers

    Audio Transcribers

    Innodata Inc.pushkar, India
    We’re Hiring : Freelance Transcribers.English audio files (primarily podcast episodes).Listen to and transcribe audio files with high accuracy and attention to detail. Follow specific formatting and ...Show moreLast updated: 12 days ago
    • Promoted
    Social Listening Consultant

    Social Listening Consultant

    ListenFirstAjmer, Rajasthan, India
    About the Role We are seeking a proactive and insights-driven Consultant to join our Digital Insights Delivery team.This role is perfect for professionals with hands-on experience using social medi...Show moreLast updated: 23 days ago
    • Promoted
    Remote Baker Role - 42981

    Remote Baker Role - 42981

    Turingajmer, India
    Remote
    Turing is one of the world’s fastest-growing AI companies, accelerating the advancement and deployment of powerful AI systems. Turing helps customers in two ways : Working with the world’s leading AI...Show moreLast updated: 16 days ago
    • Promoted
    Audio Transcription Specialist

    Audio Transcription Specialist

    Innodata Inc.Pushkar, IN
    Innodata (NASDAQ : INOD) is a leading data engineering company.With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider-of-choice for 4...Show moreLast updated: 12 days ago