Talent.com
Site Reliability Engineer

Site Reliability Engineer

Zyoin GroupChennai
1 day ago
Job description

Description :

MoneyForward is seeking a Site Reliability Engineer (SRE) to lead the reliability, scalability, and performance of our products. This role involves making critical technical decisions, collaborating with development and platform engineering teams, and ensuring that our systems remain resilient and scalable to support stable business growth.

Responsibilities :

  • Service Reliability and Scalability : Design, build, and maintain highly available production services; define and implement SLOs / SLIs; perform capacity planning and optimize bottlenecks.
  • Incident Management : Lead incident response, conduct postmortems / root cause analysis, and improve on-call operations.
  • Automation and Operational Efficiency : Automate tasks with Infrastructure as Code (Terraform); implement self-healing and auto-scaling systems; optimize CI / CD pipelines.
  • Observability and Monitoring : Implement monitoring, logging, and tracing strategies using tools like Prometheus, OpenTelemetry, Grafana, and Datadog.
  • Leadership : Drive SRE practices across teams, act as a technical advisor, and guide developers in adopting reliability best practices.
  • Collaboration : Work closely with SREs, platform engineers, and developers to improve infrastructure, reliability, and operational efficiency.

Requirements :

  • Experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
  • Strong coding skills (e.g., Python, Go, Java, Rust, C++, Ruby, etc.) - shell scripting alone is not sufficient.
  • Experience operating Kubernetes in production environments.
  • Hands-on with Infrastructure as Code (Terraform, Crossplane) and CI / CD automation tools (ArgoCD, CircleCI, GitHub Actions).
  • Familiarity with cloud platforms (AWS or others) and cloud-native architectures.
  • Strong knowledge of observability tools (Prometheus, OpenTelemetry, Grafana, Datadog).
  • Experience in incident management, disaster recovery, and high-availability strategies.
  • Proven technical leadership and project management skills.
  • Preferred Qualifications :

  • Experience fostering SRE best practices within organizations.
  • Deep understanding of microservice architectures.
  • Proficiency in Go or Python for automation / tooling.
  • Contributions to CNCF or open-source projects.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Chennai

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    Role : Site Reliability Engineer.Locations : Chennai / Pune / Kolkata.Show moreLast updated: 11 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionschennai, tamil nadu, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialChennai
    A Site Reliability Engineer is a professional who plays a crucial role in maintaining the reliability and performance of computer systems in an organization. They bridge the gap between development ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer / Architect - CI / CD Pipeline

    Site Reliability Engineer / Architect - CI / CD Pipeline

    Cling Multi SolutionsChennai
    Job Description : Role : Site Reliability Engineer (SRE) Location : Bangalore / Chennai / Pune (Hybrid) Experience : 5+ y...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiChennai, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 12 days ago
    • Promoted
    AWS Site Reliability Engineer

    AWS Site Reliability Engineer

    HTC Global ServicesChennai, Tamil Nadu, India
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmachennai, tamil nadu, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 23 days ago
    • Promoted
    • New!
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    ConfidentialChennai, India
    Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives.The work you do with our team will directly improve health outcomes by connect...Show moreLast updated: 9 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupChennai, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialChennai, India
    We're looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    PoshmarkChennai, Tamil Nadu, India
    We’re looking for an experienced.You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying ...Show moreLast updated: 15 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Chennai, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Poshmark - Senior Site Reliability Engineer - Cloud Infrastructure

    Poshmark - Senior Site Reliability Engineer - Cloud Infrastructure

    POSHMARKChennai
    Job Description : Were looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeChennai, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 15 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    ConfidentialChennai, India
    Join our software, system, and test engineering group as a.Lead Site Reliability Engineer.AWS infrastructure, automating CI / CD pipelines, and ensuring scalable, reliable deployments.You will levera...Show moreLast updated: 6 days ago
    • Promoted
    • New!
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceChennai, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 11 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ElgebraChennai
    Role Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our c...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Elastic Kubernetes Service

    Site Reliability Engineer - Elastic Kubernetes Service

    MNR SolutionsChennai
    Description : Site Reliability Engineer (SRE) Kubernetes & Cloud Position Summary : We are seeking a...Show moreLast updated: 11 days ago