Talent.com
This job offer is not available in your country.
Manager - Site Reliability

Manager - Site Reliability

ZORTECH SOLUTIONS PRIVATE LIMITEDHyderabad
13 days ago
Job description

Job Title : Site Reliability Engineering (SRE) Manager

Location : Hyderabad

Employment Type : Full-Time

Work Model : 3 Days from office (Hybrid)

Summary :

The SRE Manager will lead the reliability engineering function, ensuring infrastructure resiliency and optimal operational performance. This hybrid role blends technical leadership with team mentorship and cross-functional coordination.

Experience Required :

10+ years total experience, with 3+ years in a leadership role in SRE or Cloud Operations.

Technical Knowledge and Skills :

Mandatory :

  • Deep understanding of Kubernetes, GKE, Prometheus, Terraform
  • Cloud : Advanced GCP administration
  • CI / CD : Jenkins, Argo CD, GitHub Actions
  • Incident Management : Full lifecycle, tools like OpsGenie

Nice to Have :

  • Knowledge of service mesh and observability stacks
  • Strong scripting skills (Python, Bash)
  • Big Query / Dataflow exposure for telemetry
  • Scope :

  • Build and lead a team of SREs
  • Standardize practices for reliability, alerting, and response
  • Engage with Engineering and Product leaders
  • Roles and Responsibilities :

  • Establish and lead the implementation of organizational reliability strategies, aligning SLAs, SLOs, and Error Budgets with business goals and customer expectations.
  • Develop and institutionalize incident response frameworks, including escalation policies, on-call scheduling, service ownership mapping, and RCA process governance.
  • Lead technical reviews for infrastructure reliability design, high-availability architectures, and resiliency patterns across distributed cloud services.
  • Champion observability and monitoring culture by standardizing tooling, alert definitions, dashboard templates, and telemetry data schemas across all product teams.
  • Drive continuous improvement through operational maturity assessments, toil elimination initiatives, and SRE OKRs aligned with product objectives.
  • Collaborate with cloud engineering and platform teams to introduce self-healing systems, capacity-aware autoscaling, and latency-optimized service mesh patterns.
  • Act as the principal escalation point for reliability-related concerns and ensure incident retrospectives lead to measurable improvements in uptime and MTTR.
  • Own runbook standardization, capacity planning, failure mode analysis, and production readiness reviews for new feature launches.
  • Mentor and develop a high-performing SRE team, fostering a proactive ownership culture, encouraging cross-functional knowledge sharing, and establishing technical career pathways.
  • Collaborate with leadership, delivery, and customer stakeholders to define reliability goals, track performance, and demonstrate ROI on SRE investments
  • (ref : hirist.tech)

    Create a job alert for this search

    Manager Reliability • Hyderabad

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GSPANN Technologies, Inchyderabad, telangana, in
    GSPANN is a global IT services and consultancy provider headquartered in Milpitas, California (U.With five global delivery centers across the globe, GSPANN provides digital solutions that support t...Show moreLast updated: 7 days ago
    • Promoted
    Engineer, Site Reliability [T500-20518]

    Engineer, Site Reliability [T500-20518]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 7 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABITHyderabad, Telangana, India
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersSecunderabad, Telangana, India
    Uplers is hiring for one of the clients.Role Details : Position : SRE (Oracle Cloud Infrastructure) Type : 10-month contract (possible extension) Mode : Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST Pol...Show moreLast updated: 23 days ago
    • Promoted
    Engineer, Site Reliability [T500-20266]

    Engineer, Site Reliability [T500-20266]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 15 days ago
    • Promoted
    Engineer, Site Reliability [T500-20521]

    Engineer, Site Reliability [T500-20521]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 7 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2Hyderabad, Telangana, India
    About WSO2 Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises ...Show moreLast updated: 30+ days ago
    • Promoted
    Lead - Site Reliability Engineer

    Lead - Site Reliability Engineer

    VXI Global Solutionshyderabad, telangana, in
    We are looking for a Lead - Site Reliability Engineer with 8+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications...Show moreLast updated: 25 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Talent WorxHyderabad, TS, IN
    Quick Apply
    Site Reliability Engineer (SRE).At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of o...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer, Site Reliability [T500-20502]

    Engineer, Site Reliability [T500-20502]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Engineer, Site Reliability [T500-20519]

    Engineer, Site Reliability [T500-20519]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    TechBlocksHyderabad, Telangana, India
    Site Reliability Engineering (SRE) Manager.The SRE Manager at TechBlocks India will lead the reliability engineering function, ensuring infrastructure resiliency and optimal operational performance...Show moreLast updated: 18 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20279]

    Sr Engineer, Site Reliability [T500-20279]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Principal Site Reliability Enginee

    Principal Site Reliability Enginee

    ConfidentialBengaluru / Bangalore, Hyderabad / Secunderabad, Telangana, Chennai
    As a Principal Site Reliability Engineer, you will be responsible for developing sophisticated systems and software based on the customer s business goals, needs and general business environment.Yo...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub Serviceshyderabad, telangana, in
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 5 days ago
    • Promoted
    Manager Site Reliability Engineer

    Manager Site Reliability Engineer

    ConfidentialHyderabad / Secunderabad, Telangana
    The role of an Site Reliability Engineer is to bridge the gap between development and operations, focusing on building and maintaining reliable, scalable, and efficient systems.The ultimate goal is...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    ConfidentialHyderabad / Secunderabad, Telangana
    Collaborate with development, operations, and product teams to define, review, and implement reliability standards and best practices. Design, implement, and maintain highly available and scalable a...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20437]

    Sr Engineer, Site Reliability [T500-20437]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 7 days ago