Site Reliability Engineer (SRE)

ConfidentialGurugram, Gurgaon / Gurugram, India

20 days ago

Job description

We are seeking a seasoned Site Reliability Engineer (SRE) with a solid background in payment systems and high-availability architectures. The ideal candidate will have hands-on experience managing large-scale, distributed systems in production, with a deep understanding of reliability, scalability, and performance tuning in the financial services or payments industry.

Key Responsibilities

Design, build, and maintain scalable, resilient, and secure infrastructure for high-volume payment platforms.
Ensure system uptime, reliability, and performance through effective monitoring, alerting, and incident response strategies.
Collaborate with software engineering and DevOps teams to implement CI / CD pipelines and improve deployment efficiency.
Automate infrastructure management tasks using Infrastructure-as-Code (IaC) tools (Terraform, Ansible, etc.).
Proactively identify and mitigate system bottlenecks, failures, and potential points of failure.
Manage disaster recovery strategies, failover planning, and performance testing for critical payment services.
Work with development teams to ensure services are designed for reliability, scalability, and observability from the ground up.
Participate in root cause analysis and post-incident reviews to prevent future outages.

Required Skills & Experience

8+ years of overall experience in infrastructure engineering or SRE roles, with at least 3+ years in the payments / fintech domain.

Strong understanding of payment protocols (UPI, IMPS, RTGS, NEFT, SWIFT, etc.) and transaction processing systems.

Proven expertise in Linux systems administration, cloud platforms (AWS, GCP, or Azure), and container orchestration (Kubernetes).

Solid experience with monitoring / logging tools like Prometheus, Grafana, ELK Stack, Splunk, etc.

Proficiency in one or more scripting languages (Python, Shell, Go, etc.) for automation.

Experience with incident management, SLAs, and system troubleshooting in high-pressure environments.

Familiarity with security and compliance practices in the financial sector (e.g., PCI-DSS, ISO 27001).

Preferred Qualifications

Previous experience supporting mission-critical applications in banking or financial services.

Exposure to Kafka, Redis, or other real-time streaming and caching technologies.

Experience with Site Reliability Engineering principles and implementing SLOs / SLIs.

Understanding of the Error Budget (EL) concept and how it ties into availability and release decisions.

Experience on any performance testing tool like K6, JMeter, LoadRunner.

Familiarity with mocking tools like Mockito, WireMock, Microcks.

Skills Required

Terraform, Ansible, Incident Management

Create a job alert for this search

Site Reliability Engineer • Gurugram, Gurgaon / Gurugram, India

Related jobs

Promoted
New!

Senior Site Reliability Engineer (SRE)

Voya IndiaDelhi, IN

We are seeking a strategic and technically adept leader to drive the scalability, resilience, and operational excellence of our enterprise systems. This role will set the vision for site reliability...Show moreLast updated: 8 hours ago

Promoted

Site Reliability Engineer

PhonePeDelhi, Delhi, India

SRE We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production ...Show moreLast updated: 15 days ago

Promoted

Site Reliability Engineer

WhiteLotus Talent PartnersDelhi, India

L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

super.moneyDelhi, India

Site Reliability Engineer (SRE) Level 3.Overview : A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, co...Show moreLast updated: 15 days ago

Promoted

Site Reliability Engineer (SRE) – Infrastructure & Automation

InstaServiceDelhi, IN

InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 14 days ago

Promoted
New!

Site Reliability Engineer

inTune Systems IncDelhi, India

SRE / App Support Engineer Location Hyderabad.We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in ensuring the reli...Show moreLast updated: 21 hours ago

Promoted
New!

Site Reliability Engineer

Awign ExpertDelhi, IN

Position : SRE Observability Engineer.Mandatory Skills : Observability, Grafana and Writing queries using Prometheus and Loki. We are seeking a highly experienced and driven Senior Observability Engin...Show moreLast updated: 9 hours ago

Promoted
New!

Site Reliability Engineer (SRE) / DevOps Engineer

Stoopa AIGhaziabad, IN

AI is building next-generation AI-driven platforms for ports and is focused on reliability, speed, and intelligent automation. As we scale our next generation smart port product Turi, we are hiring ...Show moreLast updated: 9 hours ago

Promoted
New!

Site Reliability Engineer

Elios TalentDelhi, India

Key Highlights ️ Build, automate, and support cloud-native infrastructure powering high-availability platforms ⚡ Contribute to automation-first engineering across AWS, Terraform, CI / CD, and observa...Show moreLast updated: 21 hours ago

Promoted

Site Reliability Engineer

GREYTIP SOFTWARE PRIVATE LIMITEDDelhi, India

We are looking for a skilled Site Reliability Engineer II to join our SRE team.The ideal candidate will have hands-on experience in production monitoring, alert handling, and L1 production support....Show moreLast updated: 3 days ago

Promoted

Site Reliability Engineer

SynamediaDelhi, India

At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show moreLast updated: 9 days ago

Promoted

Site Reliability Engineer

Datum Technologies GroupDelhi, Delhi, India

Job Title : Site Reliability Engineer (SRE) – AWS Experience : 8+ years Location : Chennai / Mumbai Work Mode : Hybrid Key Skills : AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog J...Show moreLast updated: 8 days ago

Promoted
New!

Site Reliability Engineer

KarixDelhi, Delhi, India

Role : Site Reliability Engineer Location : Bangalore (WFO) About the role : We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT o...Show moreLast updated: 2 hours ago

Promoted

Site Reliability Engineer

FlipkartDelhi, India

Hiring Site Reliability Engineers.Excluding internship] Location : Bangalore.The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry sta...Show moreLast updated: 5 days ago

Promoted

Senior Site Reliability Engineer

OneAdvancedDelhi, India

We’re looking for a Senior SRE Automation Engineer to lead and drive automation across the operations lifecycle.The ideal candidate will be responsible for identifying and implementing automation o...Show moreLast updated: 11 days ago

Promoted

Site Reliability Engineer

VXI Global SolutionsDelhi, India

We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show moreLast updated: 1 day ago

Promoted

Site Reliability Engineer

People Prime WorldwideDelhi, IN

Our client is a French multinational information technology (IT) services and consulting company, headquartered in Paris, France. Founded in 1967, It has been a leader in business transformation for...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

HRhelpdeskDelhi, India

Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.Job Summary : As a Site Reliability Engineer (SRE), you will be responsible for building ...Show moreLast updated: 5 days ago