Position : Data Scientist (Generative AI Text, Audio, Video)
Experience : 4+ Years
Engagement : Permanent
Notice Period : Immediate Joiner
Location : Remote (with occasional travel covered)
Work Arrangement : Remote
Position Overview :
We are hiring Data Scientists specializing in Generative AI to build breakthrough applications in text, audio, and video generation. The role is hands-on, research-driven, and involves taking experiments from concept to deployment in a fast-paced environment.
Key Responsibilities :
- Fine-tuning and optimizing LLMs for advanced text generation
- Developing Whisper-based pipelines for multilingual speech recognition and audio generation
- Experimenting with diffusion models, LoRA, RLHF, and SFT for text-to-audio and text-to-video tasks
- Building lip-sync systems that feel natural and emotionally consistent
- Ensuring character consistency in video generation pipelines
- Designing and implementing automated evaluation systems to raise quality standards for AI-generated content
- Collaborating with cross-functional teams across product, engineering, and research to deploy real-world AI experiences at scale
Required Skills :
47 years of Data Science / Machine Learning experienceDepth in at least one generative modality : Text, Audio, or VideoStrong coding skills (Python, PyTorch, TensorFlow, CUDA)Proven experience with transformer architectures, diffusion models, or generative pipelinesFamiliarity with model fine-tuning (LoRA, PEFT), RLHF, SFT, and large-scale trainingAbility to take ownership from research and prototyping to production deploymentStrong problem-solving skillsComfort working in a high-velocity, research-oriented environment(ref : hirist.tech)