🚀 We're Hiring : Senior Site Reliability Engineer
📍 Location : Onsite (Office : Hyderabad – Mandatory from Day 1)
🕒 Employment Type : Full-time
📅 Notice Period : Immediate to 15 Days Only
📈 Experience : 8+ Years
🔧 About the Role
We’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact role where you’ll design scalable architectures, drive automation, and ensure system resilience at scale. You’ll collaborate across engineering, infrastructure, and leadership teams to build robust, high-performing platforms.
💼 Key Responsibilities
- Architect and implement highly available, scalable, and resilient infrastructure.
- Define and enforce SLIs, SLOs, and SLAs across services.
- Lead incident management : RCA, blameless postmortems, and remediation.
- Automate infra provisioning & deployments using Terraform, Ansible, Helm.
- Enhance observability platforms (metrics, tracing, logging).
- Collaborate on scaling strategies, capacity planning, and performance tuning.
- Implement disaster recovery and business continuity strategies.
- Mentor junior and mid-level SREs.
- Partner with security teams on zero-trust, RBAC, IAM, and compliance.
Required Skills
8+ years of experience , with minimum 5 years in SRE roles .Expert in Kubernetes, AWS, and distributed systems.Strong experience with Docker and containerized applications.Deep knowledge of Linux systems, networking, and storage.Hands-on with IaC tools (Terraform, Ansible), CI / CD, GitOps.Strong observability and incident management practices.Basic scripting / development in Python.Excellent monitoring and troubleshooting skills.⚠️ Important Notes
Profiles with less than 8 years of experience will not be considered.Minimum 5 years of relevant SRE experience is mandatory.Only candidates with Immediate to 15 Days Notice Period will be considered.Office presence is mandatory from Day 1.📩 Interested?
Apply now or DM me for more details. Let’s build resilient systems together!