GenAI Lead Engineer– Job Description
Role Overview :
We’re looking for a GenAI lead to design, build, and deploy intelligent LLM-powered systems—from single-agent chatbots, copilots to complex multi-agent applications—at scale. We are particularly interested in candidates who have hands-on experience in taking GenAI applications from concept to production, especially within high-volume B2C environments. This role prioritizes individuals who understand the nuances of deploying, maintaining, and optimizing GenAI solutions for real-world users, beyond the scope of Proof-of-Concept (PoC) development. You will work across the full stack, integrating LLMs, microservices, vector databases, backend APIs, and modern cloud infrastructure.
Key Responsibilities :
1. GenAI Application Development & Deployment
Develop scalable, asynchronous microservices using Python (FastAPI) for chatbots, copilots, and agentic workflows.
Design event-driven architectures to support high concurrency , rate limiting, and real-time responsiveness .
Implement secure, versioned REST / gRPC APIs
Use Pydantic , dependency injection, and modular coding practices for maintainability.
Proficient in working with databases using ORMs like SQLAlchemy
Ensure observability using logging, metrics, tracing, and health checks.
Create responsive React.js frontends integrated via REST APIs or WebSockets.
Deploy applications on Cloud Run , GKE , using Docker, Artifact registry, CI / CD pipelines
2. LLM-Powered Conversational Interfaces
Design and build LLM-powered chatbots, voicebots, copilots and other applications using LangChain or custom orchestration frameworks.
Integrate enterprise-grade LLM APIs (Gemini, OpenAI, Claude) for multi-turn, intelligent interactions.
Implement user session management and context / state tracking for personalized and continuous conversations.
Build RAG pipelines with vector databases, knowledge graphs to ground responses with external knowledge and documents.
Apply advanced prompt engineering (ReAct, Chain-of-Thought with tool calling) for precise and goal-oriented outputs.
Ensure performance in low-latency, streaming environments using WebSockets, gRPC, and SIP media gateways.
Perform fine-tuning of open-source LLMs (LLaMA variants) using techniques like SFT, LoRA, for cost-effective domain adaptation.
Optimize high-speed inference pipelines leveraging multi-GPU clusters (up to 8x H100s) to reduce latency and improve throughput.
3. Multi-Agent Systems & Orchestration
Create multi-agent systems & Implement orchestration patterns like supervisor-agent, hierarchical, and networked agents using frameworks like ADK, Pydantic AI and LangGraph.
Use LangGraph for stateful workflows with memory, conditional branching, retries, and async execution.
Enable persistent context and long-term memory
Monitor behavior, drift, and performance using observability tools.
Skilled in developing agents with ADK and A2A protocols & experienced in configuring custom and remote MCP servers.
Preferred Tech Stack :
Languages / Frameworks : Python, FastAPI, HTML, CSS, React.js, LangChain, LangGraph, Pydatic AI, ADK (Agent Development Kit)
LLMs & Agents : OpenAI (GPT-4), Claude, Gemini, Mistral, LLaMA 3.2 / 4
Databases : BigQuery, Redis, FAISS, Pinecone, SQLAlchemy, Chroma, GCP Vector search
Protocols / APIs : REST, gRPC, WebSockets, OAuth2, OpenAPI, MCP, A2A
Additional Good to have Tech Stack :
DevOps : Docker, GitHub Actions, Jenkins, GKE, Cloud Run
Infra & Tools : GCP, Azure, Pub / Sub,Artifact Registry, NGINX, Langfuse, Postman, Pytest
Years of Experience :
Overall Experience : 10+ years of experience (with 2+ years in GenAI)
Location : Bangalore.
Lead Genai • Bengaluru, India