Key Responsibilities
- Agentic System Design : Design, develop, and deploy autonomous AI agents capable of complex, goal-oriented reasoning, planning, and task execution using modern agentic frameworks.
- RAG System Development : Build and optimize robust Retrieval-Augmented Generation (RAG) pipelines to ground LLMs in proprietary data, ensuring factual accuracy and data security.
- Tool Integration & Function Calling : Equip AI agents with the ability to use external APIs, databases, and custom tools to perform actions in the real world.
- Orchestration Frameworks : Master and implement core logic using orchestration tools like LangChain, LlamaIndex, LangGraph, or CrewAI to manage conversation state and agent workflows.
- Prompt Engineering & Alignment : Develop systematic Prompt Engineering strategies to maximize agent reliability and output quality.
- Production Deployment : Develop secure, low-latency API services (using Python / FastAPI ) to serve LLM applications, ensuring high availability and scalability.
- Evaluation & Quality Assurance (Evals) : Design and implement continuous evaluation frameworks to measure performance against business metrics. This includes developing ground truth datasets , implementing LLM-as-a-Judge scoring, and monitoring for hallucinations, factual correctness, and prompt injection .
Required Skills and Qualifications
Programming : 3+ years of professional software development experience, with expert proficiency in Python .LLM Application Experience : Direct experience building and deploying production-level applications using major LLM APIs or open-source models.Core RAG Expertise : Deep practical knowledge of designing, implementing, and optimizing a RAG system and proficiency with Vector Databases (e.g., Pinecone, Chroma, Milvus).Agent Orchestration : Proven experience with at least one major agent orchestration library ( LangChain, LlamaIndex, or similar state machine / graph-based frameworks ).Evaluation Tools : Hands-on experience with LLM evaluation frameworks like RAGAS, DeepEval, or LangSmith to create automated performance benchmarks.Back-end Engineering : Strong experience developing and maintaining RESTful APIs and integrating with various data sources (SQL / NoSQL).DevOps Fundamentals : Working knowledge of Docker, CI / CD pipeline and cloud infrastructure for deployment.Skills Required
Sql, Nosql, Docker, FastAPI, Restful Apis, Python