Talent.com
Site Reliability Manager

Site Reliability Manager

Skedara TechnologyBengaluru, Republic Of India, IN
2 days ago
Job description

We are looking for an experienced Operations Manager to lead a 24 / 7 support team responsible for ensuring continuous availability and reliability of critical business applications and data pipelines across cloud and on-prem environments.

The ideal candidate is technically hands-on, process-driven, and capable of maintaining high operational standards through strong incident management, governance, and team leadership.

Key Responsibilities

  • Lead and manage the 24 / 7 operations team, ensuring proactive monitoring, incident resolution, and seamless shift handovers.
  • Drive Root Cause Analysis (RCA) processes and ensure all incidents are followed through to closure with preventive measures.
  • Establish structured reporting and governance mechanisms — including daily status updates, trend analysis, and performance dashboards.
  • Ensure SLA compliance, operational stability, and continuous improvement across all supported applications.
  • Act as the primary escalation point for production issues, coordinating with cross-functional teams to restore services promptly.
  • Collaborate with development, cloud, and data engineering teams to identify recurring issues and implement permanent fixes.
  • Maintain updated runbooks, SOPs, and shift documentation to ensure consistency and operational readiness.
  • Mentor and guide team members, fostering a culture of ownership, accountability, and service excellence.

Required Skills & Experience

  • 7–10 years of experience in IT operations, production support, or service delivery roles.
  • Proven expertise in incident management, RCA, and operational reporting.
  • Strong understanding of monitoring frameworks, alerting systems, and escalation processes in a 24 / 7 environment.
  • Hands-on experience with SQL, scripting (Python / PowerShell), or automation tools for issue triage and process improvement.
  • Excellent communication and stakeholder management skills, with experience collaborating across multiple teams and time zones.
  • Demonstrated ability to lead a multi-shift operations team with focus on quality, discipline, and reliability.
  • Nice to Have

  • Experience with Power BI, Grafana, or similar tools for performance and incident dashboards.
  • Exposure to data flow monitoring (SFTP / API), job scheduling, or alert automation platforms.
  • Familiarity with ITIL practices (Incident, Problem, Change Management).
  • Working knowledge of cloud environments (Azure, AWS, or GCP) and ETL / data pipeline operations.
  • Create a job alert for this search

    Manager Site Reliability • Bengaluru, Republic Of India, IN

    Related jobs
    • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    Synechronbangalore, karnataka, in
    We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer.At Synechron, we believe in the power of digital to transform businesses for the better.Our globa...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    JRD SystemsBengaluru, Karnataka, India
    Site Reliability Engineer (Windows / Cloud / Automation).We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments.T...Show moreLast updated: 20 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmahosur, tamil nadu, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 23 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    QuinceBengaluru, Republic Of India, IN
    Quince is a retail and technology company co-founded by a team that has extensive experience in retail, technology and building early stage companies. You’ll work with a team of world-class talent f...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Media.netbangalore, karnataka, in
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Sonata SoftwareBengaluru, Republic Of India, IN
    In today's market, there is a unique duality in technology adoption.On one side, extreme focus on cost containment by clients, and on the other, deep motivation to modernize their Digital storefron...Show moreLast updated: 23 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    greytHRBengaluru, Republic Of India, IN
    We are looking for a passionate and detail-oriented.Site Reliability Engineer (SRE).As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our infrast...Show moreLast updated: 23 days ago
    • Promoted
    Manager- Site Reliability Engineering

    Manager- Site Reliability Engineering

    JPMorganChaseBengaluru, Republic Of India, IN
    JOB DESCRIPTION Guide and shape the future of technology at a globally recognized firm, driven by pride in ownership.As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ImpetusBengaluru, Republic Of India, IN
    You will be a key contributor in the implementation of CI / CD pipelines, managing infrastructure, container orchestration, and system monitoring. Good hands-on experience on Azure Cloud and leading t...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    6thStreet.comBengaluru, Republic Of India, IN
    Com is a one-stop shop for style-conscious women, men and kids in the UAE, KSA and Kuwait.The fashion-savvy destination offers collections from over 150 international fashion brands such as Dune Lo...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiBengaluru, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 12 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    super.moneyBengaluru, Karnataka, India
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show moreLast updated: 2 days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.netBengaluru, Karnataka, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 12 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionshosur, tamil nadu, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    TEKsystems Global Services in IndiaBengaluru, Republic Of India, IN
    Location – Bengaluru / Hyderabad Only.Notice Period – Immediate to 20 days.Hands on experience in TDC / CD technology stack(SRE). Bachelor’s degree / 4-year college degree in Computer Science or enginee...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupBangalore, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    EpsilonBengaluru, Karnataka, India
    SaaSOps leads post-production support and the overall experience of Epsilon PeopleCloud products for our global clients.This function is responsible for product support, incident management, manage...Show moreLast updated: 30+ days ago