Talent.com
Site Reliability Engineering Manager

Site Reliability Engineering Manager

People Hire Consultingludhiana, punjab, in
12 hours ago
Job description

Looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure

stability, reliability and performance and rapid deployments of our platform. We build teams that

are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you

have a passion and track record for solving problems; moreover, have strong leadership skills, this is a great fit for you.

As Manager, SRE you will demonstrate both emerging and current technologies, methods, and

processes contributing to the evolution of software deployment processes, enhancing security,

reducing risk, and improving the overall end-user experience. As part of the Technology R&D Team, you will play an integral part in advancing DevOps maturity and be a part of a new culture of quality and site reliability. You will continually improve our CI / CD tools, processes, and procedures. You will also be responsible for regular reporting to Senior Technology Leaders and providing updates on organizational risk exposure and risk related issues.

What You Will Be Doing :

  • Set the direction and strategy for your team, and help shape the overall SRE program for the

company

  • Support the growth by ensuring a robust, scalable, cloud-first infrastructure
  • Own site stability, performance and capacity planning
  • Participate early in the SDLC to ensure reliability is built in from the beginning, and creating
  • plans for successful implementations / launches

  • Foster a learning and ownership culture within the team and the larger organization
  • Ensure best engineering practices through automation, infrastructure as code, robust system
  • monitoring, alerting, auto scaling, self-healing, etc...

  • Manage complex technical projects and a team of SREs
  • Recruit and develop staff; build a culture of excellence in site reliability and automation
  • Lead by example – roll up your sleeves by debugging and coding; participate in on-call rotation
  • & occasional travel

  • Represent the technology perspective and priorities to leadership and other stakeholders by
  • continuously communicating timeline, scope, risks, and technical road map

    What You Will Need for this Position :

  • 10+ years of hands-on technical leadership and people management experience
  • 3+ years of demonstrable experience leading site reliability and performance in large-scale,
  • high-traffic environments

  • Strong leadership, communication and interpersonal skills geared to getting things done
  • Developing themselves and the talent within their charge – fostering and creating
  • opportunity for the team

  • Architect-level understanding of one or more of the major public cloud services (AWS, GCP or
  • Azure), using them to effectively design secure and scalable services

  • Strong understanding of SRE concepts and the DevOps culture, with a focus on leveraging
  • software engineering tools, methodologies and concepts

  • In-depth understanding of automation and CI / CD processes to go along with excellent
  • reasoning and problem-solving skills

  • Experience with Unix / Linux environments with a deep grasp on system internals
  • Worked on large-scale distributed systems including multi-tiered architecture
  • Strong knowledge of modern platforms like Fargate, Docker, Kubernetes etc.
  • Experience working with monitoring tools (Datadog, NewRelic, ELK stack, etc) and Database
  • technologies (SQL Server, Postgres and Couchbase preferred)

  • Validated breadth of understanding and development of solutions based on multiple
  • technologies, including networking, cloud, database, and scripting languages.

  • Experience in prompt engineering, building AI Agents, or MCP is a plus.
  • Create a job alert for this search

    Engineering Manager • ludhiana, punjab, in

    Related jobs
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionsludhiana, punjab, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 5 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globalludhiana, punjab, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW Groupludhiana, punjab, in
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 5 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.ludhiana, punjab, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmaludhiana, punjab, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 27 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceludhiana, punjab, in
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 4 days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    Curately AI, Includhiana, punjab, in
    The ideal candidate will be responsible for managing and inspiring his or her team to achieve their performance metrics.Your role will involve strategizing, project management, part staff managemen...Show moreLast updated: 16 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutionsludhiana, punjab, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 6 days ago
    • Promoted
    Software Engineering Manager

    Software Engineering Manager

    Adaptsludhiana, punjab, in
    Adapts is redefining software maintenance by helping engineering teams generate detailed wikis from code.Our innovative Code to Wiki solution converts legacy code into clear, comprehensive develope...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Synamedialudhiana, punjab, in
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show moreLast updated: 12 hours ago
    • Promoted
    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Jade GlobalLudhiāna, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer (Sre) – Infrastructure & Automation

    Site Reliability Engineer (Sre) – Infrastructure & Automation

    InstaServiceLudhiāna, Republic Of India, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 4 days ago
    • Promoted
    • New!
    Technical Trainer (On-Campus Training)

    Technical Trainer (On-Campus Training)

    AlgoTutorJalandhar, Punjab, India
    We work closely with colleges to deliver.Our training is not just theoretical; it is.This is a part-time, remote role for a Technical Trainer specializing in on-campus training programs.The Technic...Show moreLast updated: 9 hours ago
    • Promoted
    Sales Engineer / Manager

    Sales Engineer / Manager

    JR EnterprisesJalandhar, Punjab, India
    JR Enterprises, an ISO 9001-2015 certified company established in 1991, is a prime distributor in industrial automation products and offers complete engineering solutions.Specializing in compressed...Show moreLast updated: 21 days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    Tamaraludhiana, punjab, in
    Tamara is the leading fintech platform in Saudi Arabia and the wider GCC region with a mission to help people make their dreams come true by building the most customer-centric financial super-app o...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineering Manager

    Software Engineering Manager

    Intellify Solutionsludhiana, punjab, in
    Intellify Solutions is looking for the hands-on and execution-focused engineering manager, you will lead a team that builds and scales the core customer quoting experience for a leading e-commerce ...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgeminiludhiana, punjab, in
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 16 days ago
    • Promoted
    Sr Engineer, Site Reliability T500-21295

    Sr Engineer, Site Reliability T500-21295

    TMUS Global SolutionsLudhiāna, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 5 days ago