Key Job Responsibilities and Duties :
The core premise for the SRE lies in treating operational issues as a software problem.
We code our way out of problems where operations are concerned addressing availability,
scalability, latency, and efficiency challenges within the vast infrastructure here.
You will impact millions of people all over the globe with your creative solutions
You work in one of the biggest e-commerce companies in the world
You will solve exciting problems at scale by writing and deploying code across tens of thousands of servers
You will have the opportunity to collaborate with many of the world’s leading SREs
You will be free to launch your own ideas and solutions within our sophisticated production environment
Here are some of the tools and technologies we use to achieve this : Python, Go, Puppet, Kubernetes, Elasticsearch, Prometheus, HAProxy, Cassandra, Kafka etc
What you’ll be Doing :
Share the on-call rotation and be an escalation contact for incidents (depending on level of role)
What you’ll bring :
Solid experience in at least one programming language.
Good interpersonal skills
Proficient command of the English language, both written and spoken
Site Reliability Engineer • India