Senior Site Reliability EngineerConfidential • Hyderabad / Secunderabad, Telangana

Senior Site Reliability Engineer

Confidential • Hyderabad / Secunderabad, Telangana

30+ days ago

Job description

Key Responsibilities :

Lead incident management , monitoring, and alerting processes to ensure timely detection and resolution of production issues.
Ensure reliability, availability, and performance of systems by defining and maintaining SLIs, SLOs, and SLAs.
Design and implement fault-tolerant, scalable architectures to minimize downtime and improve resiliency.
Develop automation and tooling for monitoring, incident remediation, and infrastructure management.
Participate in a 24x7 on-call rotation to manage production incidents and maintain system uptime.
Create and maintain SOPs and technical documentation for processes, tools, and incident management protocols.
Implement and manage Infrastructure as Code (IaC) using tools such as Terraform and Ansible to automate provisioning and deployments.
Work with cloud platforms —primarily AWS (EC2, S3, VPC, RDS, EKS, ECS, CloudWatch, CloudFormation)—to support scalable system operations.
Integrate and manage CI / CD pipelines using tools like Jenkins to enable seamless deployments.
Utilize monitoring and alerting tools (Datadog, Site24x7, Grafana, CloudWatch) to proactively identify issues.
Conduct performance tuning and optimization , addressing bottlenecks and improving efficiency.
Drive cost optimization strategies while maintaining performance and reliability standards.
Adhere to security best practices and ensure infrastructure compliance with organizational standards.
Collaborate with development, product, and security teams to enhance system reliability and service delivery.
Mentor junior engineers and promote a culture of reliability engineering across the organization.

Qualifications :

5–8 years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.

Strong hands-on expertise with AWS (experience with GCP or Azure is a plus).

Proficiency in Infrastructure as Code (IaC) tools such as Terraform and Ansible .

Experience with monitoring and alerting tools including Datadog, Site24x7, Grafana, and CloudWatch.

Solid understanding of CI / CD tools such as Jenkins.

Proven ability in incident management, root cause analysis , and implementing long-term reliability improvements.

Familiarity with automation scripting (Python, Bash, or Shell scripting preferred).

Knowledge of security best practices , networking , and cloud cost management .

Excellent problem-solving, analytical, and collaboration skills.

AWS certification or equivalent cloud certification is an advantage.

Skills Required

Aws, Rds, ECS, Vpc, Cloud, Ci

Create a job alert for this search

Senior Site Reliability Engineer • Hyderabad / Secunderabad, Telangana

Related jobs

Engineer, Site Reliability [T500-20517]

TMUS Global Solutions • Hyderabad, Telangana, India

NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more

Last updated: 30+ days ago • Promoted

Senior Site Reliability Engineer

Blue Spire Inc • Hyderabad, Republic Of India, IN

We are seeking a highly skilled Senior L2 Ops Engineer to join our dynamic team.You will play a critical role in maintaining the stability, performance, and reliability of our systems through robus...Show more

Last updated: 4 days ago • Promoted

Senior Site Reliability Engineer

Zyoin Group • Hyderabad

Description : As the most senior technical individual contributor within an entire division of Engine...Show more

Last updated: 16 days ago • Promoted

Sr Engineer, Site Reliability Engineer [T500-20464]

TMUS Global Solutions • Hyderabad, Telangana, India

Last updated: 30+ days ago • Promoted

Lead Site Reliability Engineer

AutoRABIT • Hyderabad, Republic Of India, IN

AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show more

Last updated: 30+ days ago • Promoted

Senior Site Reliability Engineer

IntraEdge • Hyderabad, IN

Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show more

Last updated: 24 days ago • Promoted

SRE (Site Reliability Engineer)

Tata Consultancy Services • Hyderabad, Republic Of India, IN

Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional), Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment experience,.Show more

Last updated: 1 day ago • Promoted

Senior Site Reliability Engineer

AutoRABIT • Hyderabad, Telangana, India

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Tata Consultancy Services • Hyderabad, Telangana, India

GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Senior Engineer

Confidential • Hyderabad / Secunderabad, Telangana, India

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in hist...Show more

Last updated: 15 days ago • Promoted

Site Reliability Engineer

HRhelpdesk • secunderabad, telangana, in

Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.As a Site Reliability Engineer (SRE), you will be responsible for building and maintainin...Show more

Last updated: 1 day ago • Promoted

Site Reliability Engineer

Capgemini • Hyderabad, IN

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more

Last updated: 21 days ago • Promoted

Senior Site Reliability Engineer

Nebula Tech Solutions • secunderabad, telangana, in

SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show more

Last updated: 11 days ago • Promoted

Site Reliability Engineer

Infosys • Hyderabad, Republic Of India, IN

We are seeking a skilled and motivated Site Reliability Engineer with hands-on expertise.DevOps tools, and SRE principles. Provide production support for Production applications, ensuring the stabil...Show more

Last updated: 25 days ago • Promoted

Senior Site Reliability Engineer

TMUS Global Solutions • Hyderabad, Republic Of India, IN

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Foodsmart • Hyderabad, Republic Of India, IN

Foodsmart is the leading telenutrition and foodcare solution, backed by a robust network of Registered Dietitians.Our platform is designed to foster healthier food choices, drive lasting behavior c...Show more

Last updated: 30+ days ago • Promoted

Senior Site Reliability Engineer

o9 Solutions, Inc. • secunderabad, India

Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more

Last updated: 6 hours ago • Promoted • New!

Senior Site Reliability Engineer

Confidential • Hyderabad / Secunderabad, Telangana, India

As a senior site reliability engineer will work in our global organization to provide operational support for all Thomson Reuters products, including development tools and infrastructure used by en...Show more

Last updated: 30+ days ago • Promoted