Talent.com
Senior Service Reliability Engineer

Senior Service Reliability Engineer

Elios TalentHyderabad, Republic Of India, IN
15 hours ago
Job description

Senior Service Delivery Engineer

☀️ Key Highlights

🔧 Own service delivery operations across incident response, change control, and reliability for high-availability platforms

⚡ Lead major incident management , driving rapid triage, clear communication, and cross-team coordination

📊 Improve operational maturity through KPIs, reporting, RCA processes, playbooks, and structured workflows

🔄 Champion DevOps culture , automation, tooling integrations, and process efficiency across engineering

🚀 Drive continuous improvement across ITIL, Agile, and SRE practices to enhance performance, stability, and scalability

We’re seeking a Senior Service Delivery Engineer to drive operational excellence across reliability, incident response, change management, automation, and cross-team coordination. This role sits at the center of availability, performance, and process governance—partnering with engineering teams to keep mission-critical platforms stable, secure, and predictable.

What You’ll Do

Service Delivery & Coordination

  • Act as the central point of contact for the SRE practice, unblocking teams and supporting smooth delivery
  • Own change management processes, lead CAB meetings, and approve production changes
  • Define escalation paths, operational workflows, and ITIL severity guidelines
  • Lead, communicate, and coordinate all high-severity incidents during on-call rotations

Operational Excellence

  • Track, document, and organize work intake, project progress, timelines, and risks
  • Create RunBooks, Playbooks, release plans, and validate processes through simulations
  • Define KPIs, service metrics, and produce weekly / monthly reporting
  • Ensure effective RCA execution, track action items, and prevent repeat incidents
  • Incident, Change & Problem Management

  • Respond to major incidents with <
  • 5-minute reaction time, providing clear and frequent updates

  • Collaborate with cross-functional teams through a follow-the-sun support model
  • Host CAB, review / approve emergency and scheduled changes, and maintain 24 / 7 readiness
  • Lead the RCA process, analyze patterns, close problem tickets, and share monthly reports
  • Automation & Process Improvement

  • Build automation scripts and integrations to reduce toil and improve reliability
  • Research new operational methodologies and apply them to improve agility and alignment
  • Enhance configuration and usage of tools like ServiceNow, PagerDuty, Slack, Exigence, and JIRA
  • Drive initiatives such as Slack–ServiceNow integrations, workflow optimization, and monitoring enhancements
  • Tooling, Reporting & Roster Management

  • Maintain incident and major-incident tools, ensuring accuracy and proper configuration
  • Provide daily / weekly / monthly service reports to leadership and stakeholders
  • Keep on-call rosters accurate and up-to-date across multiple teams
  • What You Bring

  • Deep experience in service delivery, incident response, change management, or SRE operations
  • Strong understanding of ITIL, Agile, DevOps, and high-availability environments
  • Proven ability to lead high-severity bridges and coordinate multi-team resolutions
  • Hands-on experience with ServiceNow, Slack, PagerDuty, JIRA, and automated workflows
  • Excellent communication, documentation, and cross-team collaboration skills
  • About Us

    We are a global engineering organization focused on building scalable, reliable, and high-performing digital systems for enterprise clients. Our teams operate in complex environments where performance, uptime, and customer experience matter. We combine technical rigor with structured operational practices to support systems used by millions every day.

    Why Join Us

    You’ll work at the heart of mission-critical operations, shaping reliability, incident strategy, and delivery processes. You’ll collaborate with strong engineers, influence tooling and automation, and play a key role in improving service health across the organization. This role offers ownership, visibility, and the ability to drive meaningful, measurable impact.

    Create a job alert for this search

    Senior Reliability Engineer • Hyderabad, Republic Of India, IN

    Related jobs
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Elios TalentHyderabad, Telangana, India
    Senior Site Reliability Engineer Key Highlights ️ Build, scale, and optimize cloud-native infrastructure powering global, high-availability platforms ⚡ Drive automation-first engineering across AW...Show moreLast updated: 15 hours ago
    • Promoted
    Senior DevOps & Database Reliability Engineer – 100% Remote

    Senior DevOps & Database Reliability Engineer – 100% Remote

    Hyly.AIHyderabad, IN
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show moreLast updated: 7 days ago
    • Promoted
    Service Reliability Expert

    Service Reliability Expert

    NationsBenefits IndiaHyderabad, Republic Of India, IN
    Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog |.SRE team focused on maintaining the performance, reliability, and availability of our fintech platforms.Triage and resolve product...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    inTune Systems IncHyderabad, Telangana, India
    SRE / App Support Engineer Location Hyderabad Job Summary : We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in en...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Elios TalentHyderabad, Telangana, India
    Site Reliability Engineer Key Highlights ️ Build, automate, and support cloud-native infrastructure powering high-availability platforms ⚡ Contribute to automation-first engineering across AWS, Te...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Reliability Engineer

    Reliability Engineer

    Elios TalentHyderabad, Republic Of India, IN
    Build, automate, and support cloud-native infrastructure powering high-availability platforms.Contribute to automation-first engineering across AWS, Terraform, CI / CD, and observability tooling.Impr...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Senior Service Delivery Engineer

    Senior Service Delivery Engineer

    Elios TalentHyderabad, Telangana, India
    Senior Service Delivery Engineer ☀️ Key Highlights Own service delivery operations across incident response, change control, and reliability for high-availability platforms ⚡ Lead major inciden...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Reliability Operations Engineer

    Reliability Operations Engineer

    Elios TalentHyderabad, Republic Of India, IN
    Intermediate Service Delivery Engineer.Support incident response, change control, and day-to-day service delivery operations. Assist triage and coordination during high-severity incidents.Contribute...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Senior Infrastructure Reliability Engineer

    Senior Infrastructure Reliability Engineer

    Elios TalentHyderabad, Republic Of India, IN
    Senior Site Reliability Engineer.Build, scale, and optimize cloud-native infrastructure powering global, high-availability platforms. Drive automation-first engineering across AWS, Terraform, CI / CD,...Show moreLast updated: 15 hours ago
    • Promoted
    SRE (Site Reliability Engineer)

    SRE (Site Reliability Engineer)

    Tata Consultancy ServicesHyderabad, Republic Of India, IN
    Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional), Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment experience,.Show moreLast updated: 5 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABITHyderabad, Telangana, India
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Infrastructure Reliability Engineer

    Senior Infrastructure Reliability Engineer

    AutoRABITHyderabad, Republic Of India, IN
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 16 days ago
    • Promoted
    • New!
    Platform Reliability Engineer

    Platform Reliability Engineer

    Elios TalentHyderabad, Republic Of India, IN
    Build, automate, and support cloud-native infrastructure powering high-availability platforms.Contribute to automation-first engineering across AWS, Terraform, CI / CD, and observability tooling.Impr...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Lead Reliability Engineer

    Lead Reliability Engineer

    Elios TalentHyderabad, Republic Of India, IN
    Senior Site Reliability Engineer.Build, scale, and optimize cloud-native infrastructure powering global, high-availability platforms. Drive automation-first engineering across AWS, Terraform, CI / CD,...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Intermediate Service Delivery Engineer

    Intermediate Service Delivery Engineer

    Elios TalentHyderabad, Telangana, India
    Intermediate Service Delivery Engineer.Support incident response, change control, and day-to-day service delivery operations. Assist triage and coordination during high-severity incidents.Contribute...Show moreLast updated: 17 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    TMUS Global SolutionsHyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Reliability Engineer

    Reliability Engineer

    InspireHyderabad, Republic Of India, IN
    Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The company’s technology hub, Inspire Brands Hyderabad Support Center, India, will l...Show moreLast updated: 16 days ago
    • Promoted
    Hosting Reliability Engineer

    Hosting Reliability Engineer

    MathWorksHyderabad, Telangana, India
    Would you like to join a team making a positive impact at MathWorks? IT Hosting is modernizing our infrastructure and the way we operate it. You will be responsible for designing, deploying, maintai...Show moreLast updated: 6 days ago