Data Scientist (Video & Image Generation) - 3+ Years, immediate joiner, location agnostic
Required / Must-Have
Strong background in Machine Learning, Deep Learning, and Computer Vision
Hands-on experience with Generative AI models (diffusion, transformer, or latent video models)
Proficiency in Python and ML frameworks such as PyTorch or TensorFlow
Experience designing and optimizing video and image generation pipelines
Familiarity with Azure Machine Learning and cloud-based training / inference workflows
Experience with GPU-based training, distributed computing, and model deployment
Knowledge of multimodal data handling (text, image, video, audio)
Experience working with models like Sora, Veo, or Gemini (or similar multimodal systems)
Understanding of prompt engineering, fine-tuning, and LoRA / adapters for model adaptation
Proficiency with tools like Docker, Kubernetes, Databricks, MLflow, and Git
Strong analytical, problem-solving, and collaboration skills
Nice-to-Have
Familiarity with Java, React, and Spring Boot for integrating AI capabilities into production tools
Knowledge of model evaluation metrics for visual and temporal quality (e.g., FVD, SSIM, CLIPScore)
Experience with OpenCV and FFmpeg for video preprocessing and postprocessing
Awareness of recent research in video diffusion, temporal consistency, and multimodal generation
Prior contributions to open-source AI projects or published research
Data Scientist • Ajmer, Rajasthan, India