Talent.com
Site Reliability Engineer (SRE) – Infrastructure & Automation

Site Reliability Engineer (SRE) – Infrastructure & Automation

InstaServiceagra, uttar pradesh, in
6 days ago
Job description

About InstaService

InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding nationwide — backed by strong traction, rapid adoption, and a mission to simplify how people get work done at home.

We’re looking for a Senior Site Reliability Engineer (SRE) to join our core engineering team and scale our infrastructure to serve millions of users reliably.

What You’ll Do

  • Lead incident response , conduct root cause analysis , and ensure permanent preventive measures.
  • Design and optimize CI / CD pipelines , automate deployments, and enforce release stability.
  • Build and manage scalable infrastructure on AWS, GCP, or Azure using Terraform , Ansible , and Kubernetes .
  • Continuously monitor system health with Prometheus , Grafana , ELK , and CloudWatch .
  • Conduct load and performance testing (k6, JMeter, Locust) and optimize systems for high-traffic events.
  • Improve observability , reduce alert noise, and enhance signal clarity for faster debugging.
  • Collaborate with developers and architects to ensure systems meet SLOs, SLIs, and SLAs .
  • Develop automation scripts and tools in Python, Go, Node.js, or Shell to streamline operations.
  • Manage distributed systems and message queues like Kafka or RabbitMQ .
  • Drive a culture of reliability, automation, and scalability across teams.

What We’re Looking For

  • 4–7 years of experience in SRE or DevOps roles (preferably in high-scale or e-commerce environments).
  • Strong hands-on experience with Kubernetes , Docker , Terraform , Ansible , and CI / CD pipelines .
  • Deep understanding of Linux systems , networking , and distributed architecture .
  • Solid programming skills in Python , Go , or Node.js .
  • Experience managing cloud platforms (AWS, GCP, or Azure).
  • Proven track record of maintaining production uptime and optimizing system performance .
  • Nice to Have

  • Experience with observability stacks , distributed tracing , and incident automation .
  • Familiarity with microservices and event-driven systems .
  • Exposure to cost optimization and capacity planning in multi-cloud environments.
  • Why Join InstaService?

  • Fast-growing startup reshaping a massive industry
  • Work on high-scale systems and impactful technology
  • Collaborative and innovation-driven team
  • Competitive compensation and growth opportunities
  • Create a job alert for this search

    Site Reliability Engineer • agra, uttar pradesh, in

    Related jobs
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.agra, uttar pradesh, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Engineer

    Lead Engineer

    HyqooAgra, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show moreLast updated: 3 days ago
    • Promoted
    Senior AppDynamics Observability SME

    Senior AppDynamics Observability SME

    Dexian IndiaAgra, IN
    Position Title : Senior AppDynamics Observability SME.IT operations, system administration, or engineering.Ansible, Jenkins, Terraform, Python to develop configuration, deployment, and orchestration...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Synamediaagra, uttar pradesh, in
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutionsagra, uttar pradesh, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 8 days ago
    • Promoted
    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Jade GlobalBharatpur, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 8 days ago
    • Promoted
    Infrastructure Solutions Architect

    Infrastructure Solutions Architect

    BayOne SolutionsAgra, IN
    Systems or Solutions Architect.IaaS), and cloud-scale system design.The ideal candidate combines strong fundamentals in.Kubernetes, observability, and automation. You’ll design scalable systems that...Show moreLast updated: 8 days ago
    • Promoted
    Regional Cloud Infrastructure Engineer

    Regional Cloud Infrastructure Engineer

    Argyll ScottAgra, IN
    This position offers an opportunity to lead and support a diverse hybrid IT landscape across the APAC region.The Regional IT and Cloud Specialist will be responsible for managing, optimizing, and s...Show moreLast updated: 8 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globalagra, uttar pradesh, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmaagra, uttar pradesh, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 28 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionsagra, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceAgra, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiAgra, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 18 days ago
    • Promoted
    Senior Site Reliability Engineer- Elk Expert

    Senior Site Reliability Engineer- Elk Expert

    iVedha Inc.Bharatpur, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 22 days ago
    • Promoted
    Sr Engineer, Site Reliability T500-21295

    Sr Engineer, Site Reliability T500-21295

    TMUS Global SolutionsBharatpur, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 6 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeAgra, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer (Sre) – Infrastructure & Automation

    Site Reliability Engineer (Sre) – Infrastructure & Automation

    InstaServiceBharatpur, Republic Of India, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 5 days ago
    • Promoted
    Systems & Automation Specialist

    Systems & Automation Specialist

    White Tiger Connections Inc.Agra, IN
    We’re looking for someone who thrives at the intersection of IT, systems design, and automation — someone who can help us build, connect, and maintain the tools that keep our business running smoot...Show moreLast updated: 7 days ago