Talent.com
This job offer is not available in your country.
Sr Engineer, Site Reliability Engineer

Sr Engineer, Site Reliability Engineer

TMUS Global SolutionsHyderabad, India
13 days ago
Job description

About the Role

The Senior Systems Reliability Engineer (SRE) ensures the stability, performance, and reliability of IT services and infrastructure. This role combines software engineering and operations expertise to build and maintain highly available, scalable systems. As a leader in DevOps and cloud reliability practices, the engineer supports continuous improvement of automation, deployment pipelines, observability, and incident management, while mentoring junior engineers and optimizing production workflows.

The position plays a critical part in enabling software to be delivered faster, better, and more reliably to support business and customer needs.

What Youll Do

  • Design and maintain CI / CD pipelines and DevOps automation solutions
  • Guide incident response and improve system resiliency and performance
  • Build monitoring tools, dashboards, and proactive alerting for non-production environments
  • Create and maintain infrastructure as code (IaC) for scalable environments
  • Work with containerization and microservices in cloud-native platforms
  • Mentor junior engineers and collaborate across teams on cloud and DevOps initiatives
  • Improve software delivery processes through automation, cloud migration, and service orchestration
  • Perform other duties and technical projects as assigned

What Youll Bring

  • Bachelors degree in Computer Science, Software Engineering, or related field (masters preferred)
  • 47 years of experience in systems reliability, DevOps, or cloud infrastructure engineering
  • Experience with CI / CD tools like Jenkins, GitLab CI, or CloudBees
  • Familiarity with infrastructure and configuration management tools (Ansible, Chef, Puppet)
  • Hands-on knowledge of public and private cloud platforms
  • Experience with application performance monitoring (APM) and log aggregation tools
  • Proven experience working in Agile and DevOps environments
  • Must Have Skills

  • Kubernetes Fundamentals Deploys and manages pods, services, scaling.
  • CI / CD pipelines.
  • Monitoring tools - Grafana , Splunk etc.
  • Ability to script experience - Python(preferred), Bash, Shell
  • Agile Practices Works effectively in Agile teams; collaborates with developers and DBAs.
  • Create a job alert for this search

    Site Reliability Engineer • Hyderabad, India