Hello Everyone,
We're looking for an experienced Site Reliability Engineer who excels in automation, cloud infrastructure, and observability solutions. The right candidate will combine technical depth with a proactive mindset to drive system reliability and performance.
Location : Hyderabad (Hybrid Role. 2-3 days in office )
Experience level : Senior ( 7 years and above )
Schedule : Support US EST time zone ( 2 PM – 11 PM IST)
Key Qualifications : Cloud & Automation
Experience with Python for cloud service automation
Skills to integrate cloud services into monitoring systems
Infrastructure Management
Strong Terraform skills for cross-cloud infrastructure deployment
Infrastructure as Code expertise
Cloud Architecture Knowledge
Proficiency with resilient design patterns including failover and disaster recovery
Practical experience with multi-cloud environments ( primarily GCP and Azure )
Monitoring Systems
Comprehensive understanding of observability frameworks
Ability to implement monitoring as code
Experience with Dynatrace, Elastic, Splunk , and PagerDuty
Knowledge of integrating observability tools to enhance reliability
Logging & Alert Systems
Proficiency with Splunk and native cloud logging solutions
Deployment Pipelines
Jenkins and Groovy implementation experience
Strong CI / CD pipeline background
Problem-Solving & Security
Advanced troubleshooting capabilities
Security vulnerability remediation experience
Networking fundamentals
Security compliance knowledge (including Azure VM vulnerability assessment)
Additional Valuable Skills :
Java and .NET application development background
Enhanced networking expertise
Site Reliability Engineer • Hyderabad, India