AI Engineer — Image‑to‑Video (Mid‑Level)
Location : Mumbai (on‑site / hybrid)
Contract : 6 months, extendable
Start : ASAP
What you’ll do
Build, fine‑tune, and ship image‑to‑video generation pipelines (prompt‑to‑video, storyboard‑to‑video, identity‑preserving headshots)
Integrate and iterate on SOTA components (Stable Video Diffusion, AnimateDiff, LTX‑Video / 13B variants, CogVideo‑X, ControlNet‑style conditioning).
Optimize inference for throughput and latency (TorchScript / ONNX, TensorRT, CUDA kernels, xFormers / Flash‑Attention, mixed precision).
Handle multi‑GPU training / inference (DDP, gradient checkpointing, sharded weights, efficient sampling).
Own dataset curation and augmentation for faces / motion; enforce consent, licensing, and privacy.
Build evaluation loops and dashboards (FVD, CLIP / ID‑similarity, temporal consistency, face‑ID retention).
Productionize with Docker and CI / CD; wire up tracking (W&B / ClearML) and experiment reproducibility.
Collaborate with design and product to convert creative briefs into deployable features and A / B tests.
Must‑have
3–5 years total software / ML experience with 1–2+ years in generative video or diffusion work.
Strong Python + PyTorch, Diffusers, and CV fundamentals (spatiotemporal models, sampling).
Proven experience with multi‑GPU (DDP / NCCL) and performance profiling on Linux.
Solid grasp of FFmpeg, video codecs / bitrates, and post‑processing pipelines.
Portfolio : repo(s), demo links, or a short reel showing your image‑to‑video work.
Nice‑to‑have
Experience with ComfyUI nodes / graphs, LoRA / ControlNet training, face‑ID preservation, or lip‑sync.
Triton kernels, custom schedulers / samplers, quantization (INT8 / FP8) for fast inference.
MLOps on AWS / GCP / Azure, Kubernetes, vector stores, prompt orchestration.
Tools you’ll touch
PyTorch, Diffusers, CUDA, TensorRT / ONNX, xFormers / Flash‑Attention, FFmpeg, Docker, W&B / ClearML, ComfyUI, GitHub Actions.
What we offer
Competitive contract compensation (INR, market‑aligned) with extension potential.
High‑impact ownership on production creative pipelines.
Modern GPU stack and a fast path from prototype → production.
How to apply
Email hr@mugshotstudios.com with subject “AI Engineer — Image‑to‑Video (Mumbai)” and include :
Ai Engineer • Mumbai, Maharashtra, India