No longer accepting applications

Senior AI Fine-Tuning Engineer (LLMs / RLHF / LoRA)

Koda Integrated Marketing ServicesBengaluru, Karnataka, India

1 day ago

Job description

We're hiring on behalf of Entermind.com

Location : Kuala Lumpur, Malaysia

Type of employment : Full-time | On-site

Apply teyna@entermind.com

Open to candidates from Bangalore

The Role

We are seeking a Senior AI Fine-Tuning Engineer with 8+ years of experience to lead the

design, development, and deployment of custom-tuned large language models for enterprise

clients.

You will be the technical authority on model adaptation, alignment, and

optimization—translating business requirements into fine-tuning strategies that deliver

superior performance on domain-specific tasks. You'll architect training pipelines, implement

RLHF workflows, optimize model inference, and ensure our AI systems meet enterprise

standards.

This is not a research position - it's a senior engineering role focused on applied AI

that ships to production and drives business outcomes.

What You'll Do

Model Fine-Tuning and Alignment

Design fine-tuning strategies using SFT, instruction tuning, and RLHF (PPO / DPO)
Implement parameter-efficient methods (LoRA, QLoRA, Adapters) for cost-effective

adaptation

Apply constitutional AI and safety alignment techniques

Training Infrastructure & MLOps

Build end-to-end training pipelines with distributed systems (DeepSpeed, FSDP)

Implement experiment tracking, model versioning, and reproducibility

RAG and Hybrid Systems

Design retrieval-augmented generation systems with semantic search

Build and optimize vector databases (Pinecone, Weaviate, Qdrant, Milvus)

Enterprise Integration

Deploy models with serving infrastructure (vLLM, TensorRT-LLM)

Implement quantization (GPTQ, AWQ) and inference optimization

Technical Leadership

Mentor engineers on fine-tuning best practices and MLOps

Lead client technical discussions and solution design sessions

What You Bring

Core Requirements

8–12 years in ML / AI engineering with minimum 2 years on LLM fine-tuning

Proven track record shipping production LLM systems with business impact

Deep expertise in PyTorch / TensorFlow / JAX and transformer architectures

Technical Expertise

Model Fine-Tuning : RLHF / RLAIF (PPO, DPO), LoRA, QLoRA, instruction tuning, alignment

techniques

Open-Source LLMs : Llama 3 / 3.1 / 3.2, Mistral / Mixtral, Qwen 2.5, Falcon, Phi, Gemma

RAG Systems : Vector databases (Pinecone, Weaviate, Qdrant), orchestration (LangChain,

LlamaIndex)

MLOps : Distributed training (DeepSpeed, FSDP), model serving (vLLM, TGI), quantization

(GPTQ, AWQ)

Data and Evaluation : Custom benchmarks, data governance, synthetic data generation

Professional Competencies

Strong technical communication with stakeholders

Client-facing experience in solution design

Leadership maturity with mentoring capabilities

About Entermind (https : / / www.entermind.com / )

Entermind is a leading Data & AI consulting firm specializing in enterprise-grade AI

transformation. We architect, fine-tune, and deploy production-ready AI systems that solve

Our solutions blend cutting-edge research with battle-tested engineering to create systems

that are accurate, reliable, and production-ready.

Create a job alert for this search

Senior Ai Engineer • Bengaluru, Karnataka, India