Job Opportunity
We are seeking an experienced Research Engineer with expertise in Multimodal Large Language Models, Text-to-Speech technologies, and agentic workflows.
Key Responsibilities
- Develop state-of-the-art models integrating text, audio, and visual inputs.
- Design experiments to test new algorithms and architectures for TTS applications and agentic workflows.
- Stay updated with the latest advancements in m-LLMs, TTS, and AI agents.
- Participate in developing, fine-tuning, and evaluating AI models for complex NLP tasks, focusing on TTS and multimodal integration.
- Optimize models for scalability, efficiency, and deployment in real-world production systems.
- Experiment with novel methods to enhance naturalness and responsiveness of TTS systems.
- Work with cross-functional teams to translate research insights into tangible products utilizing m-LLMs and TTS technologies.
- Engage in a culture of learning and innovation, contributing to team knowledge sharing on m-LLMs, TTS, and AI agents.
- Collaborate with external research communities, contributing to conferences and publications.
Required Skills and Qualifications
Educational Background : Advanced degree in Computer Science, Machine Learning, NLP, or a closely related field.Technical Expertise : Experience or coursework in large language models, deep learning, NLP, and TTS technologies.Proficiency in programming languages like Python, with experience in frameworks like TensorFlow or PyTorch.Benefits
Opportunity to work on cutting-edge AI projects.Collaborative and innovative work environment.Chance to contribute to the development of new technologies.