Job Description :
We are seeking a highly skilled Generative AI Architect / Lead with 612 years of overall experience and a strong balance of business acumen and technical expertise. The ideal candidate will have proven expertise in building, deploying, and scaling enterprise-grade GenAI solutions, with hands-on experience across advanced LLMOps, Agentic AI, and fine-tuning techniques.
Key Responsibilities :
- GenAI Solution Delivery : Design, build, and release enterprise-grade GenAI systems, including RAG pipelines, Agentic AI systems, multimodal LLMs, and foundation models.
- Agentic AI : Minimum 612 months of experience in developing, deploying, and managing Agentic AI systems, with a clear understanding of tool usage, structured outputs, speculative decoding, AST-Code RAG, streaming, and sync / async processing.
- Fine-Tuning & PEFT : Hands-on expertise in PEFT methods (LoRA / QLoRA) for model fine-tuning and optimization.
- Embedding Models : Strong knowledge of embedding models, chunking strategies, and their limitations when used in RAG pipelines.
- Hands-on Coding : Write, test, and maintain clean, efficient, and scalable code in Python for building NLP and AI systems.
- Cloud & Deployment : Deep familiarity with Azure and proven experience in deploying LLMs for large-scale inference using LLMOps techniques and orchestration frameworks.
- Tech Stack Proficiency : Strong expertise in PyTorch, TensorFlow, Kubernetes, Docker, LlamaIndex, LangChain, and LangGraph.
- Innovation & Research : Stay updated on the latest advancements in AI agents, LLM architectures, and orchestration tools, experimenting with emerging techniques to enhance system performance.
- Communication : Strong interpersonal and communication skills, with the ability to design solutions and explain complex AI concepts to both technical and business stakeholders. Interact with CxOs executives. This is key for the role.
Mandatory Skills :
6 to 12 years overall experience with strong balance of business & technical acumen3+ years of GenAI development and deployment experienceMin 6 to 12 months Agentic AI development experienceStrong Python development skillsProven experience in RAG pipelines, embeddings, and chunking strategiesExpertise in LoRA / QLoRA fine-tuningHands-on coding for NLP & LLMsProficiency in PyTorch, TensorFlow, LangChain, LangGraph, LlamaIndexDeep familiarity with Azure Cloud, LLMOps, orchestration, and large-scale inferenceKnowledge of speculative decoding, AST-Code RAG, structured outputs, streaming & async processingPreferred Qualifications :
Strong experience in enterprise-grade AI solution deliveryProven track record in building multi-modal LLM systemsExcellent ability to bridge business goals with technical design(ref : hirist.tech)