Job Summary :
We are seeking a highly skilled and innovative LLM AI Engineer to join our AI / ML team. The ideal candidate will specialize in building, fine-tuning, and deploying Large Language Models (LLMs) to solve real-world problems across various domains. You will be responsible for developing state-of-the-art language models, integrating them into production systems, and collaborating with cross-functional teams to deliver impactful AI Responsibilities :
- Design, develop, and fine-tune large-scale transformer-based models (e.g., GPT, BERT, LLaMA, Mistral, etc.).
- Perform domain-specific adaptation and fine-tuning of foundation models using techniques like LoRA, PEFT, QLoRA, and prompt tuning.
- Collaborate with data engineers and domain experts to curate, preprocess, and label high-quality datasets for training and evaluation.
- Implement efficient training, evaluation, and deployment pipelines using frameworks such as PyTorch, TensorFlow, HuggingFace Transformers, and LangChain.
- Optimize LLM inference and memory usage for scalability and latency reduction in production environments.
- Develop APIs and tools to serve LLM-powered applications in real time.
- Stay current with the latest research and advancements in NLP and LLMs and propose improvements or adoption of emerging techniques.
- Ensure responsible AI usage by incorporating fairness, safety, interpretability, and compliance :
Must-Have :
Bachelors or Masters degree in Computer Science, AI, Machine Learning,2- 5+ years of experience in machine learning or natural language processing.Hands-on experience working with LLMs (GPT-2 / 3 / 4, LLaMA, Mistral, Claude, PaLM, etc.).Strong programming skills in Python and experience with ML frameworks (e.g., PyTorch, HuggingFace Transformers, TensorFlow).Familiarity with prompt engineering, few-shot learning, retrieval-augmented generation (RAG), and model fine-tuning techniques.Experience with REST APIs, containerization (Docker), and ML Ops tools (e.g., MLflow, Weights & Biases).Good understanding of model evaluation metrics (BLEU, ROUGE, perplexity, hallucination :Experience deploying LLMs on cloud platforms (AWS / GCP / Azure) or using services like SageMaker, Vertex AI, or Azure OpenAI.Exposure to privacy-preserving ML, model distillation, or federated learning.Familiarity with vector databases (Pinecone, Weaviate, FAISS) and embedding techniques.Experience with RLHF (Reinforcement Learning from Human Feedback).Contributions to open-source LLM projects or research Skills :Strong analytical thinking and problem-solving abilities.Excellent communication and documentation skills.Collaborative mindset and ability to work in fast-paced environments.Passion for AI / ML and a continuous learning attitude.(ref : hirist.tech)