This job offer is not available in your country.

Site Reliability Engineer

ITHENAPune, Maharashtra, India

11 days ago

Job description

We're Hiring : Senior Site Reliability Engineer (SRE) – Backend Systems (Remote-Friendly)

Are you the go-to person when things break in production? Do you love solving deep infrastructure issues, debugging Kafka lag, and reviewing backend code that keeps services resilient and fast?

We’re looking for a Senior Site Reliability Engineer (SRE) to join our backend team and help scale our real-time, event-driven platform. This isn’t a traditional DevOps or infrastructure-only role — this is for engineers who can debug complex systems, write high-quality code, and design for reliability at scale.

Location : India – Remote-Friendly

Experience : 8+ years in SRE, backend, or platform engineering roles

Interview Process Includes :

High-level system design discussions

A coding round focused on problem solving & debugging skills

What You’ll Do

Investigate and resolve reliability issues like Kafka lag, queue bottlenecks, timeouts, memory spikes, and more
Debug and review production code (Python, Go, Node.js, etc.) for performance and reliability
Design scalable, distributed backend systems that are fault-tolerant and observable
Build tools and automation to detect, fix, and prevent incidents
Own the monitoring, alerting, and SLOs for critical systems
Collaborate closely with backend developers and infrastructure teams

What We’re Looking For

Strong debugging skills : Kafka lag, distributed system failures, log tracing, profiling

Proven experience with observability, monitoring, and alerting tools (Prometheus, Grafana, ELK, etc.)

Deep understanding of message brokers and data pipelines : Kafka, RabbitMQ, Redis

Strong backend coding ability in any modern language (Python, Go, Rust, Node.js, etc.)

Familiar with production-grade system design patterns : retries, backpressure, eventual consistency

Experience with microservices, distributed systems, and containerized deployments

Nice to Have

Exposure to streaming platforms (Apache Pulsar, Flink)

Familiarity with Agentic Architecture, LLM

Familiarity with DevSecOps practices and GitOps workflows

Knowledge of resilience engineering, chaos testing, or load testing

Experience working in agile, product-centric teams

Why Join Us?

You’ll be a core part of building resilient, high-scale systems from day one

Modern architecture with no legacy baggage

Remote flexibility and a team that values deep technical work

Direct impact on platform reliability, uptime, and performance

Interested? Let’s Talk.

Send your resume to [email protected] or apply directly here on LinkedIn.

Create a job alert for this search

Site Reliability Engineer • Pune, Maharashtra, India

Related jobs

Promoted

Site Reliability Engineer

ITHENAPune, Maharashtra, India

We're Hiring : Senior Site Reliability Engineer (SRE) – Backend Systems (Remote-Friendly).Are you the go-to person when things break in production? Do you love solving deep infrastructure issues, de...Show moreLast updated: 19 days ago

Site Reliability Engineer

BP EnergyPune, MH, India

Technology .IT&S Group .Experience- 4- 7 years (excluding internship), Required 2-3 years of experience in Azure.A multi-disciplinary squad, enga...Show moreLast updated: 4 days ago

Senior Site Reliability Engineer

OnitPune, Maharashtra, IN

Quick Apply

Role : Senior Site Reliability Engineer Location : Pune Onit, Inc.Site Reliability Engineer L2 to join our Core Infrastructure team. This role will help to ensure the reliability of a diverse s...Show moreLast updated: 15 days ago

Site Reliability Engineer

trellixINDIA

Trellix, the trusted CISO ally, is redefining the future of cybersecurity and soulful work.Our comprehensive, GenAI-powered platform helps organizations confronted by todays most advanced threats g...Show moreLast updated: 30+ days ago

Promoted

Senior DevOps Engineer - Site Reliability

DashhirePune

We are looking for a skilled and experienced Senior DevOps Engineer to lead the design, implementation, and management of our CI / CD infrastructure, cloud operations, and automation frameworks.You w...Show moreLast updated: 15 days ago

Promoted

Site Reliability Engineer

SynechronPune, Maharashtra, India

We have immediate opportunity for.Site Reliability Engineer 5 to 9 years.SRE (Senior Site Reliability Engineer).We began life in 2001 as a small, self-funded team of technology specialists.Since th...Show moreLast updated: 11 days ago

Site Reliability Engineer

Talent WorxPune, MH, IN

Quick Apply

NET based application support like Issues Resolution and Incident management.Strong trouble shooting skills in debugging multiarchitecture systems and experience with microservices architecture&nbs...Show moreLast updated: 18 days ago

Promoted

Site Reliability Engineer

noonPune, IN

Job Title : Site Reliability Engineer.In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' onlin...Show moreLast updated: 18 days ago

Site Reliability Engineer

VistexPune, Maharashtra, IND

The Vistex Site Reliability Engineer will be primarily responsible for service availability, performance, monitoring, incident response, and capacity planning. This is a highly technical, hands-on r...Show moreLast updated: 15 days ago

Site Reliability Engineer

NatWest GroupINDIA

Join us as a Site Reliability Engineer.In this key role, youll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change manage...Show moreLast updated: 30+ days ago

Site Reliability Engineer

PhonepeINDIA

PhonePe is Indias leading digital payments company with 50 crore (500 Million) registered users and 3.Million) merchants covering over 99 PERCENT of the postal codes across India.On the back of it...Show moreLast updated: 30+ days ago

Site Reliability Engineer

Global Payments Asia-Pacific India Private LimitedPune, Maharashtra, India

Every day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services.Our worldw...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer - Azure / AWS

Innova ESIPune

Role : Site Reliability Engineer with (Azure OR AWS) Location : Bangalore, Pune, Noida Show moreLast updated: 30+ days ago

Promoted

Senior Site Reliability Engineer

Idox plcPune, Maharashtra, India

Design, build, and maintain scalable, highly available, and secure AWS environments.Manage and automate infrastructure as code (IaC) using tools like Terraform, OpenTofu, CloudFormation, Ansible.Op...Show moreLast updated: 28 days ago

Site Reliability Engineer

tcg digital solutions pvt ltdINDIA

Bachelors or masters degree in Computer Science, Engineering, or related field.Essential Skills (Two top skills).AWS Ecosystem EKS, EC2, DynamoDB, Lambda, etc. The SRE team should include some memb...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer - Docker / Kubernetes

Shorlist ProfessionalsPune

Job Title : Site Reliability Engineer (SRE) Company : Talent Worx Location : Pune, Maharashtra, India<...Show moreLast updated: 15 days ago

Site Reliability Engineer

Qure.aiINDIA

AI is one of the fastest-growing startups in India, which develops Artificial intelligence-enabled products and platforms for healthcare diagnostics. We create cutting-edge solutions that positively...Show moreLast updated: 30+ days ago

Site Reliability Engineer

KiboPune, MH, IND

As an SRE, your primary responsibility is to ensure the reliability, scalability, and availability of the systems that power Kibo’s products and services. You will work closely with cross-functional...Show moreLast updated: 15 days ago

Site Reliability Engineer

GSPANNPune, IN

Description GSPANN is hiring a Site Reliability Engineer (SRE) for its Pune or Hyderabad location.This full-time role focuses on enhancing the reliability of global eCommerce platforms through auto...Show moreLast updated: 30+ days ago

Site Reliability Engineer

Deutsche BankMargarpatta, Pune

We are looking for a candidate to join a multi-functional SRE team.You should be having cloud engineering experience in such area acting as the SME on operation automation and monitoring, identifyi...Show moreLast updated: 30+ days ago