Talent.com
No longer accepting applications
Senior AI Fine-Tuning Engineer (LLMs / RLHF / LoRA)

Senior AI Fine-Tuning Engineer (LLMs / RLHF / LoRA)

Koda Integrated Marketing ServicesBengaluru, Karnataka, India
1 day ago
Job description

We're hiring on behalf of Entermind.com

Location : Kuala Lumpur, Malaysia

Type of employment : Full-time | On-site

Apply teyna@entermind.com

Open to candidates from Bangalore

The Role

We are seeking a Senior AI Fine-Tuning Engineer with 8+ years of experience to lead the

design, development, and deployment of custom-tuned large language models for enterprise

clients.

You will be the technical authority on model adaptation, alignment, and

optimization—translating business requirements into fine-tuning strategies that deliver

superior performance on domain-specific tasks. You'll architect training pipelines, implement

RLHF workflows, optimize model inference, and ensure our AI systems meet enterprise

standards.

This is not a research position - it's a senior engineering role focused on applied AI

that ships to production and drives business outcomes.

What You'll Do

Model Fine-Tuning and Alignment

  • Design fine-tuning strategies using SFT, instruction tuning, and RLHF (PPO / DPO)
  • Implement parameter-efficient methods (LoRA, QLoRA, Adapters) for cost-effective

adaptation

  • Apply constitutional AI and safety alignment techniques
  • Training Infrastructure & MLOps

  • Build end-to-end training pipelines with distributed systems (DeepSpeed, FSDP)
  • Implement experiment tracking, model versioning, and reproducibility
  • RAG and Hybrid Systems

  • Design retrieval-augmented generation systems with semantic search
  • Build and optimize vector databases (Pinecone, Weaviate, Qdrant, Milvus)
  • Enterprise Integration

  • Deploy models with serving infrastructure (vLLM, TensorRT-LLM)
  • Implement quantization (GPTQ, AWQ) and inference optimization
  • Technical Leadership

  • Mentor engineers on fine-tuning best practices and MLOps
  • Lead client technical discussions and solution design sessions
  • What You Bring

    Core Requirements

  • 8–12 years in ML / AI engineering with minimum 2 years on LLM fine-tuning
  • Proven track record shipping production LLM systems with business impact
  • Deep expertise in PyTorch / TensorFlow / JAX and transformer architectures
  • Technical Expertise

    Model Fine-Tuning : RLHF / RLAIF (PPO, DPO), LoRA, QLoRA, instruction tuning, alignment

    techniques

    Open-Source LLMs : Llama 3 / 3.1 / 3.2, Mistral / Mixtral, Qwen 2.5, Falcon, Phi, Gemma

    RAG Systems : Vector databases (Pinecone, Weaviate, Qdrant), orchestration (LangChain,

    LlamaIndex)

    MLOps : Distributed training (DeepSpeed, FSDP), model serving (vLLM, TGI), quantization

    (GPTQ, AWQ)

    Data and Evaluation : Custom benchmarks, data governance, synthetic data generation

    Professional Competencies

  • Strong technical communication with stakeholders
  • Client-facing experience in solution design
  • Leadership maturity with mentoring capabilities
  • About Entermind (https : / / www.entermind.com / )

    Entermind is a leading Data & AI consulting firm specializing in enterprise-grade AI

    transformation. We architect, fine-tune, and deploy production-ready AI systems that solve

    Our solutions blend cutting-edge research with battle-tested engineering to create systems

    that are accurate, reliable, and production-ready.

    Create a job alert for this search

    Senior Ai Engineer • Bengaluru, Karnataka, India