Talent.com
Generative AI Model Trainer

Generative AI Model Trainer

LatentForceRepublic Of India, IN
8 days ago
Job description

Machine Learning Engineer III – LLM Training (RL + PEFT)

📍 On-site, Bangalore

🏢 LatentForce

About the Role

We are building specialized LLMs that understand and reason over massive enterprise codebases. This is real model training — RL loops, PEFT, verifiable rewards, long-context modeling — not API integration. You’ll own end-to-end experimentation and work directly with founders.

Responsibilities

  • Train LLMs using RL (PPO / GRPO / RLHF / RLVR) and PEFT (LoRA, QLoRA, DoRA, IA3).
  • Build custom training loops with PyTorch, HuggingFace, TRL, Unsloth .
  • Design reward functions and verifiers for code-understanding tasks.
  • Run full-stack ML experiments : data → training → eval → iteration.
  • Develop scalable training infra (FSDP / DeepSpeed, distributed training).
  • Build evaluation suites for reasoning and code comprehension.

Minimum Qualifications

  • 3+ years of real deep learning experience (actual model training).
  • Strong fundamentals : linear algebra, probability, optimization, statistics .
  • Proven experience training transformers or large DNNs from scratch or checkpoints.
  • Proficiency in PyTorch , HuggingFace , TRL , Unsloth .
  • Experience implementing RL algorithms or custom training pipelines.
  • Research exposure (publications / preprints) or strong open-source work.
  • Ability to debug training issues (NaNs, KL drift, reward hacking, etc.).
  • Startup mindset;
  • comfortable with fast, on-site, high-performance execution.

    Nice to Have

  • DeepSpeed / FSDP, model parallelism, vLLM.
  • Program analysis / AST tooling.
  • Long-context modeling experience.
  • Why Join Us

  • Build specialized LLMs at a well-funded early-stage company.
  • Direct work with founders;
  • high ownership and technical depth.

  • High-impact role shaping core training architecture.
  • Apply here : https : / / forms.Gle / KUyohXyBjbU8gFC69

    Create a job alert for this search

    Generative Ai • Republic Of India, IN

    Related jobs
    • Promoted
    • New!
    AI / ML 2–5 Years Experience

    AI / ML 2–5 Years Experience

    AAVIL IncIndia, India
    AAVIL INC is Hiring Talented AI / ML Engineers!.Remote | 0–5 Years Experience | AI ML Agentic AI Expertise.We’re expanding our team and looking for passionate AI / ML Engineers who are ready to build t...Show moreLast updated: 2 hours ago
    • Promoted
    Machine Learning Engineer - Agentic AI & AIOps

    Machine Learning Engineer - Agentic AI & AIOps

    Platform9Nagpur, IN
    Platform9 is a leader in simplifying enterprise private clouds.Our flagship product, Private Cloud Director, turns existing infrastructure into a full-featured private cloud.Enterprise IT teams can...Show moreLast updated: 17 days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Edstem TechnologiesNagpur, IN
    The ideal candidate will have hands-on expertise across the full ML lifecycle—from data exploration and feature engineering to model training, optimization, and production deployment.You will work ...Show moreLast updated: 19 days ago
    • Promoted
    Artificial Intelligence Engineer

    Artificial Intelligence Engineer

    AscendionNagpur, IN
    Job Title : : Python Artificial Engineer.Minimum relevant experience : : 6.We are seeking a highly motivated AI Engineer with hands-on experience in developing and deploying agentic AI architectures....Show moreLast updated: 16 days ago
    • Promoted
    AI Engineer

    AI Engineer

    Tensor PilotNagpur, IN
    Tensor Pilot, through its flagship product Tensor AI, provides a sophisticated desktop-based AI assistant for interacting with local files such as code, documents, images, and videos.Tensor AI emph...Show moreLast updated: 1 day ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    RecroNagpur, IN
    Job Description : AI / ML Engineer (3D Geometry & Manufacturing).We are seeking an exceptionally talented and entrepreneurial. Design for Manufacturability (DFM).If you are passionate about leveraging ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    AI Engineer Intern

    AI Engineer Intern

    Alchemic (previously Echo)Nagpur, IN
    Role : AI Engineer Intern (Full-time Internship, Remote).This is a remote full-time paid internship for an AI Engineer.You will help us push the boundaries of what LLMs can do by designing, testing,...Show moreLast updated: 2 hours ago
    • Promoted
    Responsible AI

    Responsible AI

    EXLNagpur, IN
    We are seeking a highly skilled and principled Responsible AI Evaluator to assess, audit, and ensure the ethical development and deployment of AI models across the enterprise.This role spans tradit...Show moreLast updated: 30+ days ago
    • Promoted
    Technical Trainer

    Technical Trainer

    Sustainable Living Lab (SL2)Nagpur, IN
    Location : Delhi or Bangalore (Hybrid).The trainer will primarily conduct sessions on.The candidate should have experience delivering sessions in both. The selected candidate should be open to.As a T...Show moreLast updated: 2 days ago
    • Promoted
    AI Data Trainer

    AI Data Trainer

    Innodata Inc.Nagpur, IN
    AI and Machine Learning talent network.Data Annotators and Content Moderators (Review & Labeling).If you enjoy working with data, pay close attention to detail, and want to contribute to real-world...Show moreLast updated: 9 days ago
    • Promoted
    AI Developer - ML & AI Agents (3 to 9 Years)

    AI Developer - ML & AI Agents (3 to 9 Years)

    AIMLEAPIndia, India
    AI Developer - ML & AI Agents (3 to 9 Years).Tech in Computer Science, AI / ML, or related field.AI / ML with hands-on exposure to LLMs and agentic AI development. Strong Python programming background f...Show moreLast updated: 4 days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Innodata Inc.Nagpur, IN
    Our AI-driven platforms and expert teams empower clients in healthcare, life insurance, and other industries to identify risks, improve efficiency, and make smarter decisions.By combining proprieta...Show moreLast updated: 18 days ago
    • Promoted
    • New!
    Principal RTL Design Engineer / Co-founder - AI / ML Accelerator

    Principal RTL Design Engineer / Co-founder - AI / ML Accelerator

    Faststream TechnologiesNagpur, IN
    Lead / Own a world class NPU for Edge AI Inference.Develop ultra-low-power machine learning chips for intelligent sensing and autonomous navigation. Architect / Work independently and collaborativel...Show moreLast updated: 2 hours ago
    • Promoted
    Stem Rater

    Stem Rater

    AceolutionNagpur, IN
    As a STEM Rater for [Maths / Physics / Chemistry / Biology / Coding / Finance], you will be a crucial contributor to shaping the "brain" of our AI. Working closely with the STEM Lead and your team, your prima...Show moreLast updated: 30+ days ago
    • Promoted
    Technical Trainer

    Technical Trainer

    NIITIndia, India
    At NIIT, we’re transforming the way the world learns, for the better.That’s why the world’s best-run learning functions across 30 countries trust us with their learning and talent.Since 1981, we ha...Show moreLast updated: 2 days ago
    • Promoted
    AI Model Training Specialist

    AI Model Training Specialist

    VerbiQRepublic Of India, IN
    Job Description : AI Model Trainer (Remote, Contract Basis).German language to support the development of high-quality AI language models. Create, review, and refine AI training data in English and G...Show moreLast updated: 1 day ago
    • Promoted
    LLM Trainer

    LLM Trainer

    Insight GlobalIndia, India
    Insight Global Client is developing a computer usage dataset to train intelligent systems that understand how users interact with software applications. LLM Trainer (Computer Usage Data Collection)....Show moreLast updated: 1 day ago
    • Promoted
    Machine Learning Engineer-Agentic AI

    Machine Learning Engineer-Agentic AI

    Innodata Inc.Nagpur, IN
    Design and implement multi-agent systems using LangChain, LangGraph, CrewAI, AutoGen or similar frameworks.Build A2A (agent-to-agent) orchestration and implement MCP (multi-context protocol) for co...Show moreLast updated: 18 days ago