Talent.com
Site Reliability Engineer (SRE) – Infrastructure & Automation

Site Reliability Engineer (SRE) – Infrastructure & Automation

InstaServiceVadodara, IN
13 hours ago
Job description

About InstaService

InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding nationwide — backed by strong traction, rapid adoption, and a mission to simplify how people get work done at home.

We’re looking for a Senior Site Reliability Engineer (SRE) to join our core engineering team and scale our infrastructure to serve millions of users reliably.

What You’ll Do

  • Lead incident response , conduct root cause analysis , and ensure permanent preventive measures.
  • Design and optimize CI / CD pipelines , automate deployments, and enforce release stability.
  • Build and manage scalable infrastructure on AWS, GCP, or Azure using Terraform , Ansible , and Kubernetes .
  • Continuously monitor system health with Prometheus , Grafana , ELK , and CloudWatch .
  • Conduct load and performance testing (k6, JMeter, Locust) and optimize systems for high-traffic events.
  • Improve observability , reduce alert noise, and enhance signal clarity for faster debugging.
  • Collaborate with developers and architects to ensure systems meet SLOs, SLIs, and SLAs .
  • Develop automation scripts and tools in Python, Go, Node.js, or Shell to streamline operations.
  • Manage distributed systems and message queues like Kafka or RabbitMQ .
  • Drive a culture of reliability, automation, and scalability across teams.

What We’re Looking For

  • 4–7 years of experience in SRE or DevOps roles (preferably in high-scale or e-commerce environments).
  • Strong hands-on experience with Kubernetes , Docker , Terraform , Ansible , and CI / CD pipelines .
  • Deep understanding of Linux systems , networking , and distributed architecture .
  • Solid programming skills in Python , Go , or Node.js .
  • Experience managing cloud platforms (AWS, GCP, or Azure).
  • Proven track record of maintaining production uptime and optimizing system performance .
  • Nice to Have

  • Experience with observability stacks , distributed tracing , and incident automation .
  • Familiarity with microservices and event-driven systems .
  • Exposure to cost optimization and capacity planning in multi-cloud environments.
  • Why Join InstaService?

  • Fast-growing startup reshaping a massive industry
  • Work on high-scale systems and impactful technology
  • Collaborative and innovation-driven team
  • Competitive compensation and growth opportunities
  • Create a job alert for this search

    Site Reliability Engineer • Vadodara, IN

    Related jobs
    • Promoted
    Sr Systems Engineer Linux – AI Infrastructure

    Sr Systems Engineer Linux – AI Infrastructure

    DC Tech ConsultingNadiad, IN
    Position : Senior Linux Administrator – AI / ML Infrastructure.We are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionsvadodara, gujarat, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Senior Engineer-SRE

    Senior Engineer-SRE

    Thalesvadodara, gujarat, in
    Apply SRE core tenets of measurement (SLI / SLO / SLA), eliminate toil, and reliability modeling.Enable and educate development teams on industry best practice design patterns, ways of working and oper...Show moreLast updated: 9 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Core Minds Tech SOlutionsVadodara
    Job Description : - Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions&l...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech SolutionsAnand, Republic Of India, IN
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 2 days ago
    Reliability Engineer

    Reliability Engineer

    Saaki Argus & Averil ConsultingVadodara, Gujarat, India
    Quick Apply
    One of the leading Engineering and R&D Software Services Companies.Experience of maintaining the Instruments, Valves, transmitters, Sensors, Control systems (DCS / PLC, SCADA), Analyzers and F&am...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Nadiad, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Grid Dynamicsnadiad, gujarat, in
    Location-Bangalore / Chennai / Hyderabad.Core Skills (Some combination of : ).These might include (Tomcat, Apache, Springboot, SQS, JBoss, IBM MQ, IBM DataPower, Hazelcast, Flink, Connect Direct, SSL).Un...Show moreLast updated: 9 hours ago
    • Promoted
    Senior Site Reliability Engineer- Elk Expert

    Senior Site Reliability Engineer- Elk Expert

    iVedha Inc.Anand, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 17 days ago
    • Promoted
    Sr Engineer, Site Reliability T500-21295

    Sr Engineer, Site Reliability T500-21295

    TMUS Global SolutionsAnand, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmanadiad, gujarat, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 23 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeAnand, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 15 days ago
    • Promoted
    Senior Site Reliability Engineer / Senior Cloud Engineer

    Senior Site Reliability Engineer / Senior Cloud Engineer

    CloudHireAnand, Republic Of India, IN
    The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture.Repo...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Cloud AWS Site Reliability Engineer (4-10 YEARS)

    Cloud AWS Site Reliability Engineer (4-10 YEARS)

    Accelyavadodara, gujarat, in
    Cloud Site Reliability Engineer (SRE).You will work closely with development, DevOps, and operations teams to ensure system uptime, performance, and cost efficiency. Design and maintain highly avail...Show moreLast updated: 9 hours ago
    • Promoted
    • New!
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceAnand, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 13 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupAnand, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiVadodara, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 12 days ago
    • Promoted
    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Jade GlobalNadiād, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 2 days ago