Description :
Job Title : Platform Engineer / Distributed Systems Engineer
Location : Full Time, In Office (Gurugram / Bengaluru)
About Us :
We are disrupting the Observability domain by leveraging AI Agents and Large Language Models (LLMs) to revolutionize monitoring, troubleshooting, and automation for applications, cloud, and on-prem infrastructure. Our mission is to build the next-generation intelligent observability platform that proactively detects, diagnoses, and resolves issues at scale. Our team consists of experienced engineers and entrepreneurs who have built multiple billion-dollar products. As a well-funded, US-based company backed by top-tier VCs, we have offices in the US, India, and Europe. Join us in our fast-paced environment where you'll help shape the future of AI-driven observability solutions.
What You'll Work On :
- Design and develop a next-generation scalable observability platform for modern cloud-native and hybrid infrastructures that
works in tandem with AI agents.
Create intelligent AI agents to analyze logs, traces, and metrics in real time, delivering automated insights and remediation.Build scalable and fault tolerant AI agent frameworksEngineer and optimize large-scale analytics pipelines to process high-velocity telemetry data.Build resilient distributed systems with high reliability, performance, and fault tolerance.Implement and fine-tune LLMs for natural language querying and automated troubleshooting.Partner with ML engineers to streamline AI model deployment and management.What We're Looking For :
Strong programming skills in Python and Golang (experience with Rust is a plus)Track record of building distributed systems and large-scale analytics pipelinesHands-on experience with cloud infrastructure (AWS, GCP, or Azure) and KubernetesDeep understanding of observability technologies (Prometheus, OpenTelemetry, Grafana, Elastic, etc.)Knowledge of LLMs, AI agents , agent frameworks liks langchain, autogen is a plusExperience with stream processing and real-time data processing frameworksProficiency in database technologies (SQL & NoSQL, Clickhouse, Time-Series DBs)7+ years of relevant experienceBachelor's degree in Computer Science, Engineering, or related field (Master's / PhD is a plus)Our Values :
Loyalty & Long-term Commitment We invest in people who invest in us.Opinionated yet Open-Minded We value strong perspectives but encourage constructive discussions.Passion We seek individuals who are passionate about their craft.Humility & Integrity Honest, transparent, and accountable team members are key.Adaptability & Self-Sufficiency Ability to thrive in a fast-paced and evolving environment.Build Fast and Break Fast We believe in rapid iteration and learning from failures.What Youll Work On : You will be instrumental in building the next-generation Observability platform that use AI agents to do resolve issues at high accuracy. You will be rethinking observability platform from scratch with AI agents in picture. Youll have the opportunity to work with an experienced team, gain deep insights into how startups are built, and be at the forefront of disruptive innovation in Observability.
If youre excited about working in an environment that values innovation, speed, and quality, wed love to hear from you!
Experience : 5 to 13 years
Compensation - upto 2Cr
(ref : hirist.tech)