Talent.com
This job offer is not available in your country.
Site Reliability Engineer - AWS / Azure

Site Reliability Engineer - AWS / Azure

Squareroot Consulting Pvt. Ltd.Bangalore
30+ days ago
Job description

Job Title : Site Reliability Engineer (SRE)

Experience : 5 to 8 years

Location : Bangalore, India (Work from Office 5 Days a Week)

Joining : Immediate Joiners Only

About Us :

We are a fast-growing tech startup building scalable and reliable solutions to solve real-world problems. As part of our next growth phase, we are looking to strengthen our infrastructure and reliability engineering team with top-notch talent who thrive in fast-paced environments.

Key Responsibilities :

  • Design, implement, and manage scalable, secure, and highly available infrastructure systems.
  • Build automation tools for system provisioning, configuration, monitoring, and deployment.
  • Maintain and improve CI / CD pipelines to support rapid development and deployment.
  • Drive observability through monitoring, alerting, and dashboards using tools like Prometheus, Grafana, ELK, etc.
  • Respond to production incidents, perform root cause analysis, and implement long-term fixes.
  • Collaborate with development and QA teams to ensure system reliability, scalability, and performance.
  • Own SLAs, uptime, and availability metrics and ensure compliance with them.
  • Implement security best practices across infrastructure components.

Key Requirements :

  • 58 years of experience as an SRE, DevOps Engineer, or in a similar role.
  • Proven track record in a fast-paced startup environment.
  • Strong experience with cloud platforms like AWS, GCP, or Azure.
  • Expertise in containerization and orchestration tools such as Docker and Kubernetes.
  • Hands-on experience with infrastructure-as-code tools like Terraform or Ansible.
  • Proficient in scripting languages (Python, Bash, or Go preferred).
  • Experience with monitoring tools (Prometheus, Grafana, ELK, etc.).
  • Solid understanding of system architecture, networking, and Linux internals.
  • Strong problem-solving and debugging skills.
  • Good to Have :

  • Experience with incident management and chaos engineering.
  • Exposure to security and compliance for infrastructure.
  • Familiarity with cost optimization techniques in cloud environments.
  • ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Bangalore