Talent.com
No longer accepting applications
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Voya IndiaHyderabad, Telangana, India
1 day ago
Job description

About the position

We are seeking a strategic and technically adept leader to drive the scalability, resilience, and operational excellence of our enterprise systems. This role will set the vision for site reliability engineering (SRE) practices, observability frameworks, and performance optimization, ensuring our digital platforms are robust, measurable, and aligned to business priorities. You will collaborate across product, engineering, and infrastructure teams to deliver highly available, high-performing systems that meet the demands of a modern digital enterprise.

Responsibilities

Set strategy and lead delivery of scalable, resilient systems across cloud and on-premise environments.

Define and govern reliability standards (SLAs, SLOs, error budgets) and embed them into development practices.

Implement observability at scale (logs, metrics, traces) to drive real-time visibility and actionable insights.

Lead performance engineering initiatives including capacity planning, load testing, and tuning of critical applications.

Drive incident management practices — proactive detection, streamlined response, and a culture of learning through postmortems.

Champion automation in monitoring, alerting, CI / CD pipelines, and infrastructure provisioning.

Partner across functions (product, engineering, DevOps, security, architecture) to align reliability goals with business priorities.

Influence enterprise architecture decisions with a reliability-first perspective, including platform modernization efforts.

Mentor and develop engineers, fostering a culture of technical excellence, accountability, and continuous improvement.

Represent reliability in executive forums, providing clear insights into system health, risks, and roadmap implications.

Qualifications

10+ years of experience in systems engineering, site reliability engineering, or infrastructure architecture.

Expertise in distributed systems and cloud platforms (AWS, Azure, GCP).

Deep knowledge of observability tooling (Datadog, Prometheus, Grafana, OpenTelemetry, etc.).

Strong programming background (e.g., Java, Python, Node.js, or similar).

Proven leadership of cross-functional technical initiatives at scale.

Experience with CI / CD, infrastructure-as-code (Terraform, Ansible, etc.), and automation frameworks.

Strong communicator with the ability to translate technical reliability goals into business outcomes.

Create a job alert for this search

Senior Site Reliability Engineer • Hyderabad, Telangana, India

Related jobs
  • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Elios TalentHyderabad, Telangana, India
Senior Site Reliability Engineer Key Highlights ️ Build, scale, and optimize cloud-native infrastructure powering global, high-availability platforms ⚡ Drive automation-first engineering across ...Show moreLast updated: 2 days ago
  • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Zyoin GroupHyderabad
Description : As the most senior technical individual contributor within an entire division of Engine...Show moreLast updated: 23 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Elios TalentHyderabad, Telangana, India
Site Reliability Engineer Key Highlights ️ Build, automate, and support cloud-native infrastructure powering high-availability platforms ⚡ Contribute to automation-first engineering across AWS, Te...Show moreLast updated: 2 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Tata Consultancy ServicesHyderabad, Telangana, India
GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must) Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment exp...Show moreLast updated: 30+ days ago
  • Promoted
Sr Engineer, Site Reliability [T500-20425]

Sr Engineer, Site Reliability [T500-20425]

TMUS Global SolutionsHyderabad, Telangana, India
NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Prometheus consultingHyderabad
WHAT YOU'LL DO : - Support, maintain, and enhance the reliability, scalability, and performance of our Azure-based Data Analytics Platform. Collaborate closely with Data En...Show moreLast updated: 27 days ago
  • Promoted
  • New!
Site Reliability Engineer

Site Reliability Engineer

Awign ExpertGreater Hyderabad Area, India
Position : SRE Observability Engineer.Mandatory Skills : Observability, Grafana and Writing queries using Prometheus and Loki. We are seeking a highly experienced and driven Senior Observability Engin...Show moreLast updated: 14 hours ago
  • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

AutoRABITHyderabad, Telangana, India
AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer (SRE) – Infrastructure & Automation

Site Reliability Engineer (SRE) – Infrastructure & Automation

InstaServiceHyderabad, IN
InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 16 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Inspire Brands Hyderabad Support CenterHyderabad, India
Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The companys technology hub, Inspire Brands Hyderabad Support Center, India, will le...Show moreLast updated: 26 days ago
  • Promoted
Sr Engineer, Site Reliability

Sr Engineer, Site Reliability

TMUS Global SolutionsHyderabad, India
As a Senior Site Reliability Engineer, you will be a key member of the CFL Platform Engineering and Operations team you will play a pivotal role in building and scaling intelligent infrastructure t...Show moreLast updated: 30+ days ago
  • Promoted
Principal Engineer, Site Reliability

Principal Engineer, Site Reliability

TMUS Global SolutionsHyderabad, India
The Principal Engineer, Site Reliability (SRE) will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Finance platforms.This role is focused ...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer [T500-21132]

Site Reliability Engineer [T500-21132]

InspireHyderabad, Telangana, India
Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The company’s technology hub, Inspire Brands Hyderabad Support Center, India, will l...Show moreLast updated: 17 days ago
  • Promoted
Site Reliability Engineer - Cloud Solutions

Site Reliability Engineer - Cloud Solutions

SMARTWORK IT SERVICESHyderabad
Description : Role : Site Reliability Engineer (SRE).Job Summary : The Site Reliability E...Show moreLast updated: 24 days ago
  • Promoted
Engineer - Site Relibility - FPT

Engineer - Site Relibility - FPT

Talent500 INCHyderabad, India
Engineer - Site Reliability - FPT.As a Site Reliability Engineer, youll play a crucial role in keeping our digital backbone running seamlessly for millions of customers. Your mission : reduce inciden...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

VXI Global SolutionsHyderabad, Telangana, India
We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show moreLast updated: 30+ days ago
  • Promoted
Engineer, Site Reliability

Engineer, Site Reliability

TMUS Global SolutionsHyderabad, India
Engineer reliability : Identify potential system issues early, implement preventive measures, and boost system resilience. Automate for speed : Build tools, pipelines, and scripts that eliminate manua...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

NationsBenefits IndiaHyderabad, Telangana, India
Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog |.SRE team focused on maintaining the performance, reliability, and availability of our fintech platforms.Triage and resolve product...Show moreLast updated: 30+ days ago