We are looking for a Senior LLM Engineer to design, build, and optimize intelligent agents powered by Large Language Models (LLMs). You will work on cutting-edge AI applications , pre-train LLMs , fine-tune open-source models, integrate multi-agent systems, and deploy scalable solutions in production environments.
Key Responsibilities
- Develop and fine-tune LLM-based models and AI agents for automation, reasoning, and decision-making.
- Build multi-agent systems that coordinate tasks efficiently.
- Design prompt engineering, retrieval-augmented generation (RAG), and memory architectures .
- Optimize inference performance and reduce hallucinations in LLMs.
- Integrate LLMs with APIs, databases, and external tools for real-world applications .
- Implement reinforcement learning with human feedback (RLHF) and continual learning strategies.
- Collaborate with research and engineering teams to enhance model capabilities.
Requirements
5+ years in AI / ML, with at least 2+ years in LLMs, or AI agents .Strong experience in Python, LangChain, LlamaIndex, Autogen, Hugging Face, etc.Experience with open-source LLMs (LLaMA, Mistral, Falcon, etc.) .Hands-on experience in LLM deployments with strong inference capabilities using robust frameworks such as vLLM.building multi-modal RAG systems.Knowledge of vector databases (FAISS, Chroma) for retrieval-based systems.Experience with LLM fine-tuning, downscaling, prompt engineering, and model inference optimization .Familiarity with multi-agent systems, cognitive architectures, or autonomous AI workflows .Expertise in cloud platforms (AWS, GCP, Azure) and scalable AI deployments .Strong problem-solving and debugging skills.Skills Required
Falcon, Gcp, Azure, Python, Aws