SRE Observability Engineer

Evnek Technologies Pvt LtdHyderabad, TG, in

2 days ago

Job type

Quick Apply

Job description

Job Description

Job Title : SRE Observability Engineer

Experience : 6 Years

Location : Hyderabad

Notice Period : Immediate Joiners Only

About the Role

We are seeking a highly skilled and motivated SRE Observability Engineer to design, build, and scale observability platforms across our distributed systems. The ideal candidate will have deep expertise in monitoring, logging, tracing, and alerting frameworks along with hands-on experience in Prometheus, Grafana, and Loki.

This role involves close collaboration with Development, DevOps, Infrastructure, and SRE teams to ensure end-to-end visibility, reliability, performance, and availability of critical systems.

Mandatory Skills

Observability
Grafana
Prometheus & Loki (including strong query-writing skills)

Key Responsibilities

Lead the design and implementation of observability solutions spanning monitoring, logging, and distributed tracing across cloud and on-prem environments.

Develop and maintain advanced monitoring frameworks using Prometheus, Grafana, Datadog, New Relic, AppDynamics and other observability platforms.

Implement and optimize distributed tracing using OpenTelemetry, Jaeger, or Zipkin to enhance application visibility and performance diagnostics.

Improve log management pipelines using tools such as Elasticsearch, Splunk, Loki, Fluentd, ensuring efficient log ingestion, parsing, storage, and analysis.

Build advanced alerting and anomaly detection mechanisms for proactive issue resolution and improved MTTR.

Work with development and SRE teams to enhance observability integration within CI / CD pipelines, microservices, and cloud-native architectures.

Automate observability processes using Python, Bash, or Golang to scale operations and reduce manual effort.

Ensure observability platforms are resilient, scalable, and cost-effective for large-scale distributed systems.

Lead incident response efforts, offering actionable insights through logs, metrics, and traces for rapid troubleshooting.

Stay updated on evolving observability, SRE, and monitoring practices to continuously strengthen observability posture.

Required Qualifications

5+ years of hands-on experience in Observability, SRE, DevOps, or similar roles, managing large-scale distributed systems.

Strong experience designing and implementing solutions using Prometheus, Grafana, Datadog, New Relic, AppDynamics.

Expertise in log management tools such as Elasticsearch, Splunk, Loki, Fluentd, including performance optimization.

Deep proficiency in distributed tracing frameworks (OpenTelemetry, Jaeger, Zipkin).

Hands-on experience with cloud platforms Azure, AWS, or GCP, and Kubernetes-based environments.

Strong scripting skills in Python, Bash, or Golang, and experience with IaC tools such as Terraform, Ansible.

Solid understanding of system architecture, performance tuning, scalability, and high-availability architectures.

Proven experience in guiding teams, providing technical leadership, and enforcing observability best practices.

Excellent problem-solving skills with the ability to provide data-driven, actionable insights.

Strong stakeholder management, communication, and collaboration abilities.

Preferred Qualifications

Experience with AI-driven observability and automated anomaly detection.

Familiarity with microservices, serverless, and event-driven architectures.

Prior experience in on-call rotations and incident management in high-availability environments.

Certifications in cloud platforms, SRE, or observability tools.

Requirements

SRE

Create a job alert for this search

Observability Engineer • Hyderabad, TG, in

Related jobs

Promoted

Observability Engineer(Dynatrace)

TEKsystemsHyderabad, IN

Hands-on experience with design and implementation of observability frameworks.Dynatrace Managed and / or SaaS experience including hands on expertise with designing, instrumenting, and administering...Show moreLast updated: 1 day ago

Promoted

Observability Engineer

ConfidentialHyderabad / Secunderabad, Telangana, India

Are you ready to make an impact at DTCC.Do you want to work on innovative projects, collaborate with a dynamic and supportive team, and receive investment in your professional development At DTCC, ...Show moreLast updated: 21 days ago

Promoted

SRE Engineer - ECMS

ConfidentialHyderabad / Secunderabad, Telangana

Ensure the reliability, availability, and performance of Enterprise Content Management Systems (ECMS) within the organization. Proactively monitor ECMS infrastructure and applications to detect and ...Show moreLast updated: 30+ days ago

Promoted

Senior Engineer, SRE - Accounting Tech

Talent500 INCHyderabad, India

Senior Engineer, SRE - Accounting Tech.The Senior Engineer, Site Reliability (SRE) will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Fin...Show moreLast updated: 30+ days ago

Promoted

Sr Advanced Systems Engr

ConfidentialHyderabad / Secunderabad, Telangana

We are seeking a highly skilled and analytical.In this role, you will play a crucial part in the development, integration, and testing of complex aerospace systems, contributing to all phases of th...Show moreLast updated: 30+ days ago

Promoted

Senior Site Reliability Engineer (SRE)

Voya IndiaHyderabad, IN

We are seeking a strategic and technically adept leader to drive the scalability, resilience, and operational excellence of our enterprise systems. This role will set the vision for site reliability...Show moreLast updated: 1 day ago

Promoted

SRE II

ConfidentialHyderabad / Secunderabad, Telangana, India

Electronic Arts creates next-level entertainment experiences that inspire players and fans around the world.Here, everyone is part of the story. Part of a community that connects across the globe.A ...Show moreLast updated: 3 days ago

Promoted

Sr Advanced Software Engr

ConfidentialHyderabad / Secunderabad, Telangana

We are seeking a highly skilled and experienced.Displays and Graphics products and programs at Sparta Systems.In this pivotal role, you'll provide expert-level technical leadership in avionics desi...Show moreLast updated: 30+ days ago

Promoted

Observability Engineer - Splunk / Kafka

Jobhedge ConsultancyHyderabad

Description : Job Description : AI-Driven Observability Engineer Experi...Show moreLast updated: 15 days ago

Promoted
New!

Site Reliability Engineer

Awign Experthyderabad, India

Position : SRE Observability Engineer.Mandatory Skills : Observability, Grafana and Writing queries using Prometheus and Loki. We are seeking a highly experienced and driven Senior Observability Engin...Show moreLast updated: 18 hours ago

Promoted

Senior Observability Engineer

ExperianHyderabad, Telangana, India

Experian is the worlds leading global information services company unlocking the power of data to create more opportunities for consumers businesses and society. We are thrilled to share that ....Show moreLast updated: 30+ days ago

Promoted

Senior SRE Engineer I

ConfidentialHyderabad / Secunderabad, Telangana, India

Dive in and do the best work of your career at DigitalOcean.Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud.If you have a g...Show moreLast updated: 10 days ago

Promoted

SRE Engineer

ConfidentialHyderabad / Secunderabad, Telangana

The candidate will be required to work on Automations , Capacity Management, Code Optimizations, along with carrying out partial Production Support L2 / L3 activities. SRE, Unix, Sql, Java, Capacity M...Show moreLast updated: 30+ days ago

Promoted

Observability Tools Sme

Tata Consultancy ServicesHyderabad, Republic Of India, IN

Job Role : Observability Tools SME.Location : Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, Indore. Job Role : Observability Tools SME.We are seeking skilled observability Tools S...Show moreLast updated: 17 days ago

Promoted

Site Reliability Engineer (SRE) - Observability & Azure Infrastructure

ConfidentialHyderabad / Secunderabad, Telangana

Observability Platform Implementation : .Design and maintain distributed tracing, metrics, and logging using OpenTelemetry, Prometheus, Loki, and Tempo. Ensure complete instrumentation of.NET Core app...Show moreLast updated: 30+ days ago

Promoted

Observability Tools SME

Tata Consultancy ServicesHyderabad, Telangana, India

Promoted

Site Reliability Engineer (SRE) / DevOps Engineer

Stoopa AIhyderabad, telangana, in

AI is building next-generation AI-driven platforms for ports and is focused on reliability, speed, and intelligent automation. As we scale our next generation smart port product Turi, we are hiring ...Show moreLast updated: 23 hours ago

Promoted
New!

SREII

ConfidentialIndia, Hyderabad / Secunderabad, Telangana