Talent.com
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Voya IndiaKollam, IN
4 hours ago
Job description

About the position

We are seeking a strategic and technically adept leader to drive the scalability, resilience, and operational excellence of our enterprise systems. This role will set the vision for site reliability engineering (SRE) practices, observability frameworks, and performance optimization, ensuring our digital platforms are robust, measurable, and aligned to business priorities. You will collaborate across product, engineering, and infrastructure teams to deliver highly available, high-performing systems that meet the demands of a modern digital enterprise.

Responsibilities

  • Set strategy and lead delivery of scalable, resilient systems across cloud and on-premise environments.
  • Define and govern reliability standards (SLAs, SLOs, error budgets) and embed them into development practices.
  • Implement observability at scale (logs, metrics, traces) to drive real-time visibility and actionable insights.
  • Lead performance engineering initiatives including capacity planning, load testing, and tuning of critical applications.
  • Drive incident management practices — proactive detection, streamlined response, and a culture of learning through postmortems.
  • Champion automation in monitoring, alerting, CI / CD pipelines, and infrastructure provisioning.
  • Partner across functions (product, engineering, DevOps, security, architecture) to align reliability goals with business priorities.
  • Influence enterprise architecture decisions with a reliability-first perspective, including platform modernization efforts.
  • Mentor and develop engineers, fostering a culture of technical excellence, accountability, and continuous improvement.
  • Represent reliability in executive forums, providing clear insights into system health, risks, and roadmap implications.

Qualifications

  • 10+ years of experience in systems engineering, site reliability engineering, or infrastructure architecture.
  • Expertise in distributed systems and cloud platforms (AWS, Azure, GCP).
  • Deep knowledge of observability tooling (Datadog, Prometheus, Grafana, OpenTelemetry, etc.).
  • Strong programming background (e.g., Java, Python, Node.js, or similar).
  • Proven leadership of cross-functional technical initiatives at scale.
  • Experience with CI / CD, infrastructure-as-code (Terraform, Ansible, etc.), and automation frameworks.
  • Strong communicator with the ability to translate technical reliability goals into business outcomes.
  • Create a job alert for this search

    Senior Site Reliability Engineer • Kollam, IN

    Related jobs
    • Promoted
    Senior Site Reliability Engineer - Azure Kubernetes Service

    Senior Site Reliability Engineer - Azure Kubernetes Service

    PeoplefyTrivandrum
    Description : Site Reliability Engineer (SRE) - Azure / AKS Lead Role Overview : This is a senior technical leadership role fo...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    PeoplefyThiruvananthapuram, Kerala, India
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    Job Title : Senior Site Reliability Engineer (SRE II).Location : Thiruvananthapuram, KL (Hybrid 3 days Onsite).We're looking for an experienced. Senior Site Reliability Engineer.The ideal candidate ha...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    People Prime WorldwideAlappuzha, IN
    Our client is a French multinational information technology (IT) services and consulting company, headquartered in Paris, France. Founded in 1967, It has been a leader in business transformation for...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer (SRE) / DevOps Engineer

    Site Reliability Engineer (SRE) / DevOps Engineer

    Stoopa AIAlappuzha, IN
    AI is building next-generation AI-driven platforms for ports and is focused on reliability, speed, and intelligent automation. As we scale our next generation smart port product Turi, we are hiring ...Show moreLast updated: 4 hours ago
    • Promoted
    Site Reliability Engineer - DevOps

    Site Reliability Engineer - DevOps

    Aim Plus Staffing SolutionsThiruvananthapuram
    Mandatory skills : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud infra...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    ConfidentialThiruvananthapuram / Trivandrum
    As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability.Should be able to gathe...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    We're looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceThiruvananthapuram, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PhonePeThiruvananthapuram, IN
    SRE We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production ...Show moreLast updated: 15 days ago
    • Promoted
    Lead Site Reliability Specialist

    Lead Site Reliability Specialist

    PeoplefyThiruvananthapuram, Republic Of India, IN
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Site Reliability Engineer II

    Site Reliability Engineer II

    ConfidentialIndia, Thiruvananthapuram / Trivandrum, Thiruvananthapuram
    The world's top banks use Zafin's integrated platform to drive transformative customer value.Powered by an innovative AI-powered architecture, Zafin's platform seamlessly unifies data from across t...Show moreLast updated: less than 1 hour ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Awign ExpertThiruvananthapuram, IN
    Position : SRE Observability Engineer.Mandatory Skills : Observability, Grafana and Writing queries using Prometheus and Loki. We are seeking a highly experienced and driven Senior Observability Engin...Show moreLast updated: 4 hours ago
    • Promoted
    Senior Site Reliability Engineer (C# / Python)

    Senior Site Reliability Engineer (C# / Python)

    EntechKollam, IN
    Senior Software Site Reliability Engineer (C# / Python).You’ll ensure enterprise systems are reliable, scalable, and performant - driving improvements, leading SRE initiatives, and mentoring teams on...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Datum Technologies GroupThiruvananthapuram, IN
    Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show moreLast updated: 8 days ago
    • Promoted
    Principal Reliability Engineer

    Principal Reliability Engineer

    PeoplefyThiruvananthapuram, Republic Of India, IN
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    KarixAlappuzha, IN
    We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Serv...Show moreLast updated: 4 hours ago
    • Promoted
    Senior Systems Reliability Engineer

    Senior Systems Reliability Engineer

    PeoplefyThiruvananthapuram, Republic Of India, IN
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 1 day ago