Talent.com
Site Reliability Engineering Manager

Site Reliability Engineering Manager

People Hire ConsultingChennai, IN
5 hours ago
Job description

Looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure

stability, reliability and performance and rapid deployments of our platform. We build teams that

are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you

have a passion and track record for solving problems; moreover, have strong leadership skills, this is a great fit for you.

As Manager, SRE you will demonstrate both emerging and current technologies, methods, and

processes contributing to the evolution of software deployment processes, enhancing security,

reducing risk, and improving the overall end-user experience. As part of the Technology R&D Team, you will play an integral part in advancing DevOps maturity and be a part of a new culture of quality and site reliability. You will continually improve our CI / CD tools, processes, and procedures. You will also be responsible for regular reporting to Senior Technology Leaders and providing updates on organizational risk exposure and risk related issues.

What You Will Be Doing :

  • Set the direction and strategy for your team, and help shape the overall SRE program for the

company

  • Support the growth by ensuring a robust, scalable, cloud-first infrastructure
  • Own site stability, performance and capacity planning
  • Participate early in the SDLC to ensure reliability is built in from the beginning, and creating
  • plans for successful implementations / launches

  • Foster a learning and ownership culture within the team and the larger organization
  • Ensure best engineering practices through automation, infrastructure as code, robust system
  • monitoring, alerting, auto scaling, self-healing, etc...

  • Manage complex technical projects and a team of SREs
  • Recruit and develop staff; build a culture of excellence in site reliability and automation
  • Lead by example – roll up your sleeves by debugging and coding; participate in on-call rotation
  • & occasional travel

  • Represent the technology perspective and priorities to leadership and other stakeholders by
  • continuously communicating timeline, scope, risks, and technical road map

    What You Will Need for this Position :

  • 10+ years of hands-on technical leadership and people management experience
  • 3+ years of demonstrable experience leading site reliability and performance in large-scale,
  • high-traffic environments

  • Strong leadership, communication and interpersonal skills geared to getting things done
  • Developing themselves and the talent within their charge – fostering and creating
  • opportunity for the team

  • Architect-level understanding of one or more of the major public cloud services (AWS, GCP or
  • Azure), using them to effectively design secure and scalable services

  • Strong understanding of SRE concepts and the DevOps culture, with a focus on leveraging
  • software engineering tools, methodologies and concepts

  • In-depth understanding of automation and CI / CD processes to go along with excellent
  • reasoning and problem-solving skills

  • Experience with Unix / Linux environments with a deep grasp on system internals
  • Worked on large-scale distributed systems including multi-tiered architecture
  • Strong knowledge of modern platforms like Fargate, Docker, Kubernetes etc.
  • Experience working with monitoring tools (Datadog, NewRelic, ELK stack, etc) and Database
  • technologies (SQL Server, Postgres and Couchbase preferred)

  • Validated breadth of understanding and development of solutions based on multiple
  • technologies, including networking, cloud, database, and scripting languages.

  • Experience in prompt engineering, building AI Agents, or MCP is a plus.
  • Create a job alert for this search

    Engineering Manager • Chennai, IN

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    Role : Site Reliability Engineer.Locations : Chennai / Pune / Kolkata.Show moreLast updated: 15 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionschennai, tamil nadu, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiChennai, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 16 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    SynamediaChennai, IN
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show moreLast updated: 5 hours ago
    • Promoted
    AWS Site Reliability Engineer

    AWS Site Reliability Engineer

    HTC Global ServicesChennai, Tamil Nadu, India
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmachennai, tamil nadu, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 26 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globalchennai, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupChennai, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 5 days ago
    • Promoted
    Senior Manager, Site Reliability Engineering

    Senior Manager, Site Reliability Engineering

    ConfidentialChennai, India
    Sam's Club In Club Systems team is seeking a senior engineering leader to drive the transformation of Club systems, improving associate and member experience. The role involves close collaboration w...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    Sr Manager, Site Reliability Engineering

    Sr Manager, Site Reliability Engineering

    ConfidentialChennai, India
    PayPal has been revolutionizing commerce globally for more than 25 years.Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empow...Show moreLast updated: 4 hours ago
    • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    PoshmarkChennai, Tamil Nadu, India
    We’re looking for an experienced.You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying ...Show moreLast updated: 19 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Chennai, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Manager - Site Reliability Engineering

    Senior Manager - Site Reliability Engineering

    ConfidentialChennai, India
    Lead the implementation and advocacy for SRE (Support Site Reliability Engineer) principles to improve the reliability and availability of our applications. Drive work on setting and maintaining SLI...Show moreLast updated: 10 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeChennai, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 19 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutionschennai, tamil nadu, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    ConfidentialChennai, India
    Canonical is a leading provider of open-source software and operating systems for global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initi...Show moreLast updated: 10 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceChennai, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ElgebraChennai
    Role Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our c...Show moreLast updated: 30+ days ago