About ConvoZen AI
At ConvoZen AI, we are at the forefront of the generative AI revolution. We empower businesses to build, deploy, and manage sophisticated AI agents that drive real-world value. We are building a scalable, reliable, and cutting-edge platform to power the next generation of AI-driven applications.
The Role
The Lead AI Engineer is a senior, hands-on technical leader responsible for architecting and building the production systems that power our generative AI platform. You will be our senior expert on engineering and deploying complex, scalable AI systems, with a special focus on agentic AI and MLOps for LLMs.
This is a "player-coach" role for a seasoned engineer who loves building robust, high-performance infrastructure. You will lead the design of our AI agent framework, own the MLOps lifecycle, and build the core components that bridge the gap between AI research and production reality. You will collaborate closely with Data Science, Product, and our Forward Deployed Engineers to create a world-class platform.
What You'll Do
- Lead the design, development, and scaling of our core AI agent architecture, including multi-agent orchestration, tool integration, and state management.
- Architect and build high-throughput, low-latency RAG (Retrieval-Augmented Generation) pipelines, optimizing for cost, speed, and accuracy.
- Design and implement robust MLOps infrastructure for the entire LLM lifecycle (e.g., model serving, evaluation, monitoring, and continuous deployment of fine-tuned models).
- Build and maintain the core infrastructure for deploying and managing LLM-powered applications in a secure, scalable, and cost-effective way.
- Implement comprehensive monitoring, logging, and observability for our AI systems to track performance, detect drift, and ensure reliability.
- Own the CI / CD pipelines for AI components, ensuring rapid and safe iteration.
- Lead the development of our core agent-building frameworks and libraries, making them more powerful, efficient, and easier for our customers to use.
- Collaborate with the Data Science team to productionize their research, turning new models and algorithms (e.g., novel RAG techniques, fine-tuned models) into scalable platform features.
- Partner closely with the Product team to translate platform requirements into concrete engineering designs and roadmaps.
- Act as a key technical partner to our Forward Deployed Engineering team, providing them with the robust tools and platform they need to succeed with customers.
- Mentor junior engineers on the team, establishing best practices for software engineering, system design, and MLOps.
Who You Are (Qualifications)
MS in Computer Science, Software Engineering, or a related field.7+ years of hands-on software engineering experience, with at least 4+ years focused on building and deploying production-grade Machine Learning or AI systems.Demonstrable, hands-on experience in engineering production systems using the modern Gen AI stack. You must have :Proven experience building and scaling RAG pipelines.Deep expertise with agentic frameworks (e.g., LangChain, LlamaIndex) and a strong understanding of how to build custom, production-ready agentic systems from the ground up.Experience with MLOps for LLMs, including model serving, monitoring, and evaluation.Expert-level proficiency in Python.Strong experience with cloud platforms (AWS, GCP, or Azure), containerization (Docker, Kubernetes), and CI / CD systems.Experience building and scaling data-intensive, real-time applications.Deep experience with vector databases (e.g., Pinecone, Weaviate, Milvus).Experience with data processing and pipeline tools (e.g., Kafka, Spark, Airflow).Strong understanding of REST APIs, gRPC, and microservices architecture.You are a systems-level thinker who can manage complexity, work with ambiguity, and build for scale.Bonus Points
Experience building and deploying AI models in a low-latency, real-time environment.Contributions to open-source AI / ML or MLOps projects.Experience with GPU-based infrastructure and inference optimization (e.g., vLLM, TensorRT-LLM).Experience leading small engineering project teams or pods.Why Join ConvoZen AI?
Build the Engine : You won't just use AI; you will build the high-performance engine that powers our entire platform and all our customers.Work on the Frontier : You'll be solving the hardest engineering problems in AI today—scaling agentic systems, optimizing LLM inference, and building for massive reliability.High-Impact Role : Your work will be the foundation for every feature we build and every customer we serve. You will have a direct and measurable impact on the company's success.World-Class Team : We are a small, focused team of passionate, brilliant, and kind people who are dedicated to building something truly transformative.