About Xenonstack
XenonStack is the fastest-growing Data and AI Foundry for Agentic Systems , enabling people and organizations to gain real-time and intelligent business insights .
We Deliver Innovation Through
- Agentic Systems for AI Agents → akira.ai
- Vision AI Platform → xenonstack.ai
- Inference AI Infrastructure for Agentic Systems → nexastack.ai
Our mission is to accelerate the world's transition to AI + Human Intelligence by making AI agents enterprise-ready, reliable, and production-grade .
THE OPPORTUNITY
We are seeking an AgentOps Engineer to deploy, monitor, and optimize agentic AI systems in production environments .
This role is at the core of AI observability and operational reliability , ensuring that multi-agent workflows perform consistently, safely, and efficiently. You'll work at the intersection of MLOps, DevOps, and Agentic AI , enabling enterprises to confidently adopt AI agents at scale.
Key Responsibilities
Agent Deployment & OperationsDeploy and maintain LLM-powered and multi-agent systems across cloud and on-prem environments.Integrate agents with enterprise APIs, knowledge bases, and third-party systems.Monitoring & ObservabilityImplement agentic observability frameworks to track performance, latency, cost, and accuracy.Monitor execution traces, context windows, and agent interactions for anomalies.Optimization & ReliabilityFine-tune agent configurations for cost efficiency, scalability, and response quality.Implement fallbacks, guardrails, and redundancy mechanisms to ensure reliability.Evaluation & Feedback LoopsBuild automated pipelines to evaluate agent performance, safety, and compliance.Feed test results into continuous improvement loops with ML / AI engineers.Security & ComplianceEnsure agents follow enterprise security, governance, and compliance standards.Collaborate with Responsible AI teams to implement trust, safety, and audit mechanisms.Cross-Functional CollaborationWork with AI engineers, DevOps, and product managers to align operational reliability with business outcomes.Support customer deployments with customized monitoring and tuning.Skills & Qualifications
Must-Have
2–5 years of experience in DevOps, MLOps, or AI systems engineering.Hands-on with LLM orchestration frameworks (LangChain, LangGraph, LlamaIndex).Familiarity with AgentOps tools (LangSmith, PromptLayer, Weights & Biases, Arize AI).Proficiency in Python and scripting for automation.Experience with cloud platforms (AWS, GCP, Azure) and containerization (Docker, Kubernetes).Knowledge of monitoring & observability tools (Prometheus, Grafana, ELK, OpenTelemetry).Understanding of RAG pipelines, vector databases, and context orchestration.Good-to-Have
Exposure to multi-agent orchestration (MCP, A2A messaging, AgentBridge).Experience in Responsible AI / model evaluation frameworks.Familiarity with CI / CD pipelines for AI models and agents.Background in BFSI, GRC, SOC, or enterprise SaaS systems.WHY SHOULD YOU JOIN US
Agentic AI Product CompanyWork on next-gen AI platforms where agent reliability and observability define enterprise adoption.
A Fast-Growing Category LeaderJoin one of the fastest-growing AI Foundries , powering mission-critical AI agents for global enterprises.
Career Mobility & GrowthGrow into roles such as AgentOps Lead, Reliability Engineer, or AI Systems Architect .
Global ExposureManage enterprise-scale AgentOps deployments across regulated industries worldwide.
Create Real ImpactEnsure that AI agents in production deliver measurable business outcomes .
Culture of ExcellenceOur values — Agency, Taste, Ownership, Mastery, Impatience, and Customer Obsession — empower you to build, innovate, and own outcomes.
Responsible AI FirstContribute to trustworthy, explainable, and compliant AI agents that enterprises can rely on.
XENONSTACK CULTURE – JOIN US & MAKE AN IMPACT!
At XenonStack, we believe in shaping the future of intelligent systems . We foster a culture of cultivation built on bold, human-centric leadership principles, where deep work, simplicity, and adoption define everything we do.
Our Cultural Values
Agency – Be self-directed and proactive.Taste – Sweat the details and build with precision.Ownership – Take responsibility for outcomes.Mastery – Commit to continuous learning and growth.Impatience – Move fast and embrace progress.Customer Obsession – Always put the customer first.Our Product Philosophy
Obsessed with Adoption – Making AI agents reliable and enterprise-ready.Obsessed with Simplicity – Turning complex agent operations into seamless, intuitive workflows.Be a part of our mission to accelerate the world's transition to AI + Human Intelligence — by ensuring AI agents are reliable, secure, and production-ready .
Skills Required
Elk, Prometheus, Grafana, Docker, containerization , Python, Aws, Gcp, Azure, Kubernetes