Site Reliability Engineer

Zyoin GroupChennai

1 day ago

Job description

Description :

MoneyForward is seeking a Site Reliability Engineer (SRE) to lead the reliability, scalability, and performance of our products. This role involves making critical technical decisions, collaborating with development and platform engineering teams, and ensuring that our systems remain resilient and scalable to support stable business growth.

Responsibilities :

Service Reliability and Scalability : Design, build, and maintain highly available production services; define and implement SLOs / SLIs; perform capacity planning and optimize bottlenecks.
Incident Management : Lead incident response, conduct postmortems / root cause analysis, and improve on-call operations.
Automation and Operational Efficiency : Automate tasks with Infrastructure as Code (Terraform); implement self-healing and auto-scaling systems; optimize CI / CD pipelines.
Observability and Monitoring : Implement monitoring, logging, and tracing strategies using tools like Prometheus, OpenTelemetry, Grafana, and Datadog.
Leadership : Drive SRE practices across teams, act as a technical advisor, and guide developers in adopting reliability best practices.
Collaboration : Work closely with SREs, platform engineers, and developers to improve infrastructure, reliability, and operational efficiency.

Requirements :

Experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.

Strong coding skills (e.g., Python, Go, Java, Rust, C++, Ruby, etc.) - shell scripting alone is not sufficient.

Experience operating Kubernetes in production environments.

Hands-on with Infrastructure as Code (Terraform, Crossplane) and CI / CD automation tools (ArgoCD, CircleCI, GitHub Actions).

Familiarity with cloud platforms (AWS or others) and cloud-native architectures.

Strong knowledge of observability tools (Prometheus, OpenTelemetry, Grafana, Datadog).

Experience in incident management, disaster recovery, and high-availability strategies.

Proven technical leadership and project management skills.

Preferred Qualifications :

Experience fostering SRE best practices within organizations.

Deep understanding of microservice architectures.

Proficiency in Go or Python for automation / tooling.

Contributions to CNCF or open-source projects.

(ref : hirist.tech)

Create a job alert for this search

Site Reliability Engineer • Chennai

Related jobs

Promoted

Site Reliability Engineer

Tata Consultancy ServicesChennai, Tamil Nadu, India

Role : Site Reliability Engineer.Locations : Chennai / Pune / Kolkata.Show moreLast updated: 11 days ago

Promoted

Sr Engineer, Site Reliability [T500-21295]

TMUS Global Solutionschennai, tamil nadu, in

NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago

Promoted

Site Reliability Engineer

ConfidentialChennai

A Site Reliability Engineer is a professional who plays a crucial role in maintaining the reliability and performance of computer systems in an organization. They bridge the gap between development ...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer / Architect - CI / CD Pipeline

Cling Multi SolutionsChennai

Job Description : Role : Site Reliability Engineer (SRE) Location : Bangalore / Chennai / Pune (Hybrid) Experience : 5+ y...Show moreLast updated: 24 days ago

Promoted

Site Reliability Engineer

CapgeminiChennai, IN

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 12 days ago

Promoted

AWS Site Reliability Engineer

HTC Global ServicesChennai, Tamil Nadu, India

Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show moreLast updated: 2 days ago

Promoted

Site Reliability Engineer

CodeKarmachennai, tamil nadu, in

Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 23 days ago

Promoted
New!

Principal Site Reliability Engineer

ConfidentialChennai, India

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives.The work you do with our team will directly improve health outcomes by connect...Show moreLast updated: 9 hours ago

Promoted

Site Reliability Engineer

CitNOW GroupChennai, IN

Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 1 day ago

Promoted

Senior Site Reliability Engineer

ConfidentialChennai, India

We're looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale...Show moreLast updated: 30+ days ago

Promoted

Staff Site Reliability Engineer

PoshmarkChennai, Tamil Nadu, India

We’re looking for an experienced.You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying ...Show moreLast updated: 15 days ago

Promoted

Senior Site Reliability Engineer- ELK Expert

iVedha Inc.Chennai, IN

Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago

Promoted

Poshmark - Senior Site Reliability Engineer - Cloud Infrastructure

POSHMARKChennai

Job Description : Were looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems ...Show moreLast updated: 30+ days ago

Promoted

Senior Site Reliability Engineer

IntraEdgeChennai, IN

Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 15 days ago

Promoted

Lead Site Reliability Engineer

ConfidentialChennai, India

Join our software, system, and test engineering group as a.Lead Site Reliability Engineer.AWS infrastructure, automating CI / CD pipelines, and ensuring scalable, reliable deployments.You will levera...Show moreLast updated: 6 days ago

Promoted
New!

Site Reliability Engineer (SRE) – Infrastructure & Automation

InstaServiceChennai, IN

InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 11 hours ago

Promoted

Site Reliability Engineer

ElgebraChennai

Role Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our c...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer - Elastic Kubernetes Service

MNR SolutionsChennai

Description : Site Reliability Engineer (SRE) Kubernetes & Cloud Position Summary : We are seeking a...Show moreLast updated: 11 days ago