Talent.com
Observability Specialist

Observability Specialist

Halian | Managed Services, Recruitment Agency & Contract StaffingThoothukudi, IN
4 hours ago
Job description

Senior Observability Engineer

Location : Remote

Employment Type : 6 Month Extendable Contract

I am seeking a highly experienced Senior Observability Engineer to lead the development and implementation of a unified observability strategy across a modern, cloud-native technology stack. This is a strategic role focused on enabling full-stack visibility, optimizing performance, and ensuring reliability across distributed systems.

Key Responsibilities :

  • Design and implement a comprehensive observability blueprint covering frontend (React / Next.js), backend (API / GraphQL), Kubernetes infrastructure, and alert routing.
  • Enable Real User Monitoring (RUM) and frontend metrics, linking browser performance to backend traces.
  • Instrument backend services to trace user actions and identify performance bottlenecks.
  • Define and standardize SLO / SLI templates for latency, availability, and error rates across environments.
  • Develop and manage alerting strategies, including severity levels, team-based routing, and escalation workflows.
  • Standardize observability practices across services, including labels, dashboards, alert rules, and collector configurations.
  • Operate OpenTelemetry and Grafana Alloy pipelines for metrics, logs, and traces, with sampling and cardinality controls.
  • Maintain observability configurations as code using GitOps tools (Helm, Kustomize, Terraform).
  • Deliver “golden dashboards” for frontend UX, API performance, and Kubernetes / service health.

Required Experience & Skills :

  • Minimum 7 years in Observability, SRE, Platform Engineering, or Backend roles with production systems.
  • Strong Kubernetes expertise, including agent deployment and metadata enrichment.
  • Hands-on experience with OpenTelemetry for Go, Node.js, and browser instrumentation.
  • RUM experience with Datadog (preferred), Grafana, or Dynatrace.
  • Advanced skills in Prometheus / Grafana or Datadog, including histograms, recording rules, and alerting.
  • Proven ability to reduce alert noise and standardize SLOs across teams.
  • Ideal Candidate Profile :

  • Strategic thinker with a passion for clean observability architecture.
  • Strong communicator and collaborator across engineering and infrastructure teams.
  • Comfortable working in a GitOps-driven environment.
  • Experience in Fintech, Crypto, or E-Commerce is advantageous but not essential.
  • Create a job alert for this search

    Observability • Thoothukudi, IN