Senior Observability Engineer
Location : Remote
Employment Type : 6 Month Extendable Contract
I am seeking a highly experienced Senior Observability Engineer to lead the development and implementation of a unified observability strategy across a modern, cloud-native technology stack. This is a strategic role focused on enabling full-stack visibility, optimizing performance, and ensuring reliability across distributed systems.
Key Responsibilities :
Design and implement a comprehensive observability blueprint covering frontend (React / Next.js), backend (API / GraphQL), Kubernetes infrastructure, and alert routing.
Enable Real User Monitoring (RUM) and frontend metrics, linking browser performance to backend traces.
Instrument backend services to trace user actions and identify performance bottlenecks.
Define and standardize SLO / SLI templates for latency, availability, and error rates across environments.
Develop and manage alerting strategies, including severity levels, team-based routing, and escalation workflows.
Standardize observability practices across services, including labels, dashboards, alert rules, and collector configurations.
Operate OpenTelemetry and Grafana Alloy pipelines for metrics, logs, and traces, with sampling and cardinality controls.
Maintain observability configurations as code using GitOps tools (Helm, Kustomize, Terraform).
Deliver “golden dashboards” for frontend UX, API performance, and Kubernetes / service health.
Required Experience & Skills :
Minimum 7 years in Observability, SRE, Platform Engineering, or Backend roles with production systems.
Strong Kubernetes expertise, including agent deployment and metadata enrichment.
Hands-on experience with OpenTelemetry for Go, Node.js, and browser instrumentation.
RUM experience with Datadog (preferred), Grafana, or Dynatrace.
Advanced skills in Prometheus / Grafana or Datadog, including histograms, recording rules, and alerting.
Proven ability to reduce alert noise and standardize SLOs across teams.
Ideal Candidate Profile :
Strategic thinker with a passion for clean observability architecture.
Strong communicator and collaborator across engineering and infrastructure teams.
Comfortable working in a GitOps-driven environment.
Experience in Fintech, Crypto, or E-Commerce is advantageous but not essential.
Observability • Hyderabad, Telangana, India