Site Reliability Engineer
Role : Site Reliability Engineer
Location : Kolkata / Mumbai / Remote
Experience : 5 years
Education : Bachelors or masters degree in Computer Science, Engineering, or related field
Number of Positions : 1
Skills Required Essential Skills (Two top skills)
- AWS Ecosystem EKS, EC2, DynamoDB, Lambda, etc.
- Dynatrace (or similar)
The SRE team should include some members with Dynatrace experience, while the rest can have experience with similar tools.
Roles & Responsibilities Experience
5 years of experience in Site Reliability Engineering or related roles.Design, implement, and maintain scalable and reliable infrastructure on AWS.Utilize Dynatrace for monitoring, performance tuning, and troubleshooting of applications and services.Develop automation scripts to streamline deployment processes and enhance operational efficiency.Lead chaos engineering initiatives to proactively identify weaknesses in our systems and improve resilience.Collaborate with development teams to integrate reliability into the software development lifecycle.Automate operational processes to enhance efficiency and reduce manual intervention.Participate in on-call rotations to support incident response and resolution.Strong proficiency in AWS services (EC2, S3, RDS, Lambda, etc.) and cloud architecture best practices.Experience with Dynatrace or similar monitoring tools for application performance management.Familiarity with chaos engineering principles and tools.Solid understanding of load testing methodologies and tools.Proficient in scripting languages and configuration management tools.Excellent problem-solving skills and the ability to work under pressure.SPOC : Human Resources
Mail to : careers@tcgdigital.com