Talent.com
Site Reliability Engineer (Sre) – Infrastructure & Automation

Site Reliability Engineer (Sre) – Infrastructure & Automation

InstaServiceHyderabad, Republic Of India, IN
1 day ago
Job description

About InstaService

InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding nationwide — backed by strong traction, rapid adoption, and a mission to simplify how people get work done at home.

We’re looking for a Senior Site Reliability Engineer (SRE) to join our core engineering team and scale our infrastructure to serve millions of users reliably.

What You’ll Do

  • Lead incident response , conduct root cause analysis , and ensure permanent preventive measures.
  • Design and optimize CI / CD pipelines , automate deployments, and enforce release stability.
  • Build and manage scalable infrastructure on AWS, GCP, or Azure using Terraform , Ansible , and Kubernetes .
  • Continuously monitor system health with Prometheus , Grafana , ELK , and CloudWatch .
  • Conduct load and performance testing (k6, JMeter, Locust) and optimize systems for high-traffic events.
  • Improve observability , reduce alert noise, and enhance signal clarity for faster debugging.
  • Collaborate with developers and architects to ensure systems meet SLOs, SLIs, and SLAs .
  • Develop automation scripts and tools in Python, Go, Node.Js, or Shell to streamline operations.
  • Manage distributed systems and message queues like Kafka or RabbitMQ .
  • Drive a culture of reliability, automation, and scalability across teams.

What We’re Looking For

  • 4–7 years of experience in SRE or DevOps roles (preferably in high-scale or e-commerce environments).
  • Strong hands-on experience with Kubernetes , Docker , Terraform , Ansible , and CI / CD pipelines .
  • Deep understanding of Linux systems , networking , and distributed architecture .
  • Solid programming skills in Python , Go , or Node.Js .
  • Experience managing cloud platforms (AWS, GCP, or Azure).
  • Proven track record of maintaining production uptime and optimizing system performance .
  • Nice to Have

  • Experience with observability stacks , distributed tracing , and incident automation .
  • Familiarity with microservices and event-driven systems .
  • Exposure to cost optimization and capacity planning in multi-cloud environments.
  • Why Join InstaService?

  • Fast-growing startup reshaping a massive industry
  • Work on high-scale systems and impactful technology
  • Collaborative and innovation-driven team
  • Competitive compensation and growth opportunities
  • Create a job alert for this search

    Site Reliability Engineer • Hyderabad, Republic Of India, IN

    Related jobs
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABITHyderabad, Telangana, India
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 30+ days ago
    SRE(Site Reliability Engineer)

    SRE(Site Reliability Engineer)

    Talent WorxHyderabad, TS, IN
    Quick Apply
    SRE (Site Reliability Engineer).Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, perfo...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer (SRE)

    Lead Site Reliability Engineer (SRE)

    Tata Consultancy ServicesHyderabad, Republic Of India, IN
    TCS is hiring for DevOps Observability with Python for Hyderabad Location.DevOps Observability with Python Developer.IT Infrastructure, with at least 8+ years in Observability, Monitoring, or SRE r...Show moreLast updated: 25 days ago
    • Promoted
    Sr Engineer, Site Reliability Engineer [T500-20464]

    Sr Engineer, Site Reliability Engineer [T500-20464]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 29 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    AutoRABITHyderabad, Republic Of India, IN
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Jade GlobalHyderabad, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 4 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeHyderabad, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 17 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.secunderabad, telangana, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceHyderabad, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    ▷ [09 / 11 / 2025] Site Reliability Engineer

    ▷ [09 / 11 / 2025] Site Reliability Engineer

    Sonata SoftwareHyderabad, Telangana, India
    Category Details Role Site Reliability Engineer (SRE) III – Data Engineering Location Hyderabad- Employment Type Full Time Experience 7–12 years in site reliability, cloud-based data infra...Show moreLast updated: 1 hour ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialHyderabad / Secunderabad, Telangana, India
    Must be able to join within 30 days or less!.An employer is looking for an SRE to join their enterprise level SRE team.They are building a specialized team of Senior Site Reliability Engineers to a...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiHyderabad, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 14 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20279]

    Sr Engineer, Site Reliability [T500-20279]

    TMUS Global Solutionshyderabad, telangana, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 29 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    InfosysHyderabad, Republic Of India, IN
    We are seeking a skilled and motivated Site Reliability Engineer with hands-on expertise.DevOps tools, and SRE principles. Provide production support for Production applications, ensuring the stabil...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupHyderabad, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 3 days ago
    • Promoted
    Infrastructure Automation Site Reliability Engineer (SRE)

    Infrastructure Automation Site Reliability Engineer (SRE)

    ConfidentialHyderabad / Secunderabad, Telangana, India
    The Infrastructure Automation Site Reliability Engineer (SRE) bridges the gap between development and operations by applying software engineering principles to infrastructure and operational challe...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    ConfidentialHyderabad / Secunderabad, Telangana
    Design, build, and maintain scalable, highly available, and resilient infrastructure.Develop automation tools and scripts to improve operational efficiency and reduce manual intervention.Monitor sy...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE) - Observability & Azure Infrastructure

    Site Reliability Engineer (SRE) - Observability & Azure Infrastructure

    ConfidentialHyderabad / Secunderabad, Telangana
    Observability Platform Implementation : .Design and maintain distributed tracing, metrics, and logging using OpenTelemetry, Prometheus, Loki, and Tempo. Ensure complete instrumentation of.NET Core app...Show moreLast updated: 30+ days ago