Talent.com
This job offer is not available in your country.
Sr Site Reliability Engineer

Sr Site Reliability Engineer

Multi RecruitHyderabad, IN
30+ days ago
Job description

Roles and Responsibility

Cloud Platform team is looking for engineers to build andoperate data pipelines that power the gamut of company’s products andanalytics. Due to the massive scale and performance requirements of many of ouruse cases, you will be solving challenging problems on a daily basis using avariety of cutting-edge technologies.

What you will do :

  • Focus on Production operations / matters and on-call.
  • Provision and scale multi-datacenter KubernetesInfrastructure and Applications (EKS)
  • Deploy Software in multiple Production Environments
  • Own monitoring and alerting to production systems, improvements,and changes.
  • Contribute improvements to the current automation.
  • Contribute improvements to our on-call process and alerting.

What You’ll Bring :

  • Availability to be in on-call rotation for Production issues.
  • Availability to work with a distributed team in differenttime zones.
  • Desired Skill Set :

  • Bachelor’s degree in engineering related discipline orequivalent work experience.
  • 3+ Years of experience with ProductionTroubleshooting
  • 3+ years of experience Programming / Scripting - one of thefollowing e.g., Perl, Python, PHP, GoLang, Java, etc
  • Experience both setting up and utilizing Monitoring andobservability tools e.g., New Relic, Nagios / Icinga, Grafana, Prometheus
  • Experience with Kubernetes (operate)
  • Basic Terraform Knowledge
  • Expert experience and working knowledge with modern LinuxOperating systems (Enterprise Linux or Debian based)
  • Experience with modern cloud infrastructure, preferablyAWS
  • Differentiators :

  • Troubleshooting production performance / servicedegradation or outage issues at scale
  • Experience with Infrastructure Troubleshooting in VMsand / or Bare Metal (ssh / Linux)
  • Advanced Kubernetes knowledge
  • Advanced Terraform knowledge.
  • Experience operating NoSQL Databases in Production
  • Experience operating Relational Databases in Production
  • Generic Configuration Management experience
  • Create a job alert for this search

    Site Reliability Engineer • Hyderabad, IN