Job Title : Site Reliability Engineer (SRE)
Experience Required : 7+ Years
Location : Bangalore / Chennai
Employment Type : Full-Time
Work Mode : Onsite
Role Overview :
We are seeking a highly skilled Site Reliability Engineer (SRE) with 7+ years of experience to ensure the reliability, scalability, and performance of our systems. The ideal candidate will bring deep technical expertise, problem-solving abilities, and a passion for driving system efficiency and resilience at scale.
Key Responsibilities :
- Design, implement, and maintain reliable, scalable, and highly available systems.
- Monitor and troubleshoot production systems to proactively prevent incidents.
- Collaborate with development and operations teams to improve deployment, monitoring, and system automation.
- Build and maintain CI / CD pipelines for faster and reliable software delivery.
- Implement best practices for observability, including logging, monitoring, and alerting.
- Participate in incident response, root cause analysis, and drive long-term fixes.
- Optimize performance and resource utilization across cloud infrastructure.
Mandatory Skills & Experience :
7+ years of experience in Site Reliability Engineering or related roles.Proven expertise in monitoring, troubleshooting, and system performance optimization.Strong hands-on experience with cloud platforms (AWS, Azure, or GCP).Proficiency in scripting languages such as Python, Shell, or similar.Experience with CI / CD pipelines and deployment automation.Solid knowledge of containerization and orchestration (Docker, Kubernetes).Familiarity with incident management processes and root cause analysis.Good to Have :
Experience with Infrastructure as Code (Terraform, Ansible).Knowledge of security best practices in cloud-native environments.Exposure to large-scale distributed systems and microservices architecture.(ref : hirist.tech)