Sony Research India is seeking a dynamic and motivated Speech Recognition Intern to join our innovative research team. As a Speech Recognition Intern, you will have the opportunity to work on cutting-edge projects in the field of speech recognition technologies. This internship is designed for individuals passionate about advancing their skills and knowledge in speech recognition, machine learning, and artificial intelligence.
Key Responsibilities :
- Research and Development : Collaborate with our research team to design, implement state-of-the-art speech recognition and speaker diarization algorithms and models.
- Algorithm Optimization : Work on optimizing existing speech recognition algorithms for enhanced accuracy, speed, and efficiency.
- Production Ready API : Work towards real-time development of speech algorithms and models.
- Stay Current : Stay updated of the latest developments in the field of speech recognition and contribute insights to enhance the team's knowledge base.
Work Location : - Remote
Duration of the paid Internship :
This paid internship will be for a period of 6 months starting January first week of 2026.9 : 00 to 18 : 00 (Monday to Friday).Qualification :
Currently pursuing / completed Master's (Research) or Ph.D. in deep learning / machine learning with hands-on experience on Transformer and models with an applications audio / speech.Must Have Skills :
Strong programming skills in Python, shell scripting, PERLHands-on deep learning, machine learning (Pytorch, Tensorflow)Sound knowledge of speech technologies, libraries and toolkits like ESPNET, SpeechBrain, etc.Good to have skills :
Expertise in PyTorch and Shell Scripting.Prior experience in the development of Indian Languages ASR and Speaker diarization systemsPrior experience at publishing paper to top conferences like, ICML, AAAI, Interspeech, ICASSP.