Data Scientist (Video & Image Generation) - 3+ Years, immediate joiner, location agnostic
Required / Must-Have
- Strong background in Machine Learning, Deep Learning, and Computer Vision
- Hands-on experience with Generative AI models (diffusion, transformer, or latent video models)
- Proficiency in Python and ML frameworks such as PyTorch or TensorFlow
- Experience designing and optimizing video and image generation pipelines
- Familiarity with Azure Machine Learning and cloud-based training / inference workflows
- Experience with GPU-based training, distributed computing, and model deployment
- Knowledge of multimodal data handling (text, image, video, audio)
- Experience working with models like Sora, Veo, or Gemini (or similar multimodal systems)
- Understanding of prompt engineering, fine-tuning, and LoRA / adapters for model adaptation
- Proficiency with tools like Docker, Kubernetes, Databricks, MLflow, and Git
- Strong analytical, problem-solving, and collaboration skills
Nice-to-Have
Familiarity with Java, React, and Spring Boot for integrating AI capabilities into production toolsKnowledge of model evaluation metrics for visual and temporal quality (e.g., FVD, SSIM, CLIPScore)Experience with OpenCV and FFmpeg for video preprocessing and postprocessingAwareness of recent research in video diffusion, temporal consistency, and multimodal generationPrior contributions to open-source AI projects or published research