Talent.com
No longer accepting applications
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Voya Indiakollam, kerala, in
1 day ago
Job description

About the position

We are seeking a strategic and technically adept leader to drive the scalability, resilience, and operational excellence of our enterprise systems. This role will set the vision for site reliability engineering (SRE) practices, observability frameworks, and performance optimization, ensuring our digital platforms are robust, measurable, and aligned to business priorities. You will collaborate across product, engineering, and infrastructure teams to deliver highly available, high-performing systems that meet the demands of a modern digital enterprise.

Responsibilities

  • Set strategy and lead delivery of scalable, resilient systems across cloud and on-premise environments.
  • Define and govern reliability standards (SLAs, SLOs, error budgets) and embed them into development practices.
  • Implement observability at scale (logs, metrics, traces) to drive real-time visibility and actionable insights.
  • Lead performance engineering initiatives including capacity planning, load testing, and tuning of critical applications.
  • Drive incident management practices — proactive detection, streamlined response, and a culture of learning through postmortems.
  • Champion automation in monitoring, alerting, CI / CD pipelines, and infrastructure provisioning.
  • Partner across functions (product, engineering, DevOps, security, architecture) to align reliability goals with business priorities.
  • Influence enterprise architecture decisions with a reliability-first perspective, including platform modernization efforts.
  • Mentor and develop engineers, fostering a culture of technical excellence, accountability, and continuous improvement.
  • Represent reliability in executive forums, providing clear insights into system health, risks, and roadmap implications.

Qualifications

  • 10+ years of experience in systems engineering, site reliability engineering, or infrastructure architecture.
  • Expertise in distributed systems and cloud platforms (AWS, Azure, GCP).
  • Deep knowledge of observability tooling (Datadog, Prometheus, Grafana, OpenTelemetry, etc.).
  • Strong programming background (e.g., Java, Python, Node.js, or similar).
  • Proven leadership of cross-functional technical initiatives at scale.
  • Experience with CI / CD, infrastructure-as-code (Terraform, Ansible, etc.), and automation frameworks.
  • Strong communicator with the ability to translate technical reliability goals into business outcomes.
  • Create a job alert for this search

    Senior Site Reliability Engineer • kollam, kerala, in

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Awign Expertalappuzha, India
    Position : SRE Observability Engineer.Mandatory Skills : Observability, Grafana and Writing queries using Prometheus and Loki. We are seeking a highly experienced and driven Senior Observability Engin...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer - Azure Kubernetes Service

    Senior Site Reliability Engineer - Azure Kubernetes Service

    PeoplefyTrivandrum
    Description : Site Reliability Engineer (SRE) - Azure / AKS Lead Role Overview : This is a senior technical leadership role fo...Show moreLast updated: 2 days ago
    • Promoted
    Lead Engineer

    Lead Engineer

    HyqooThiruvananthapuram, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show moreLast updated: 12 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    PeoplefyThiruvananthapuram, Kerala, India
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Voya IndiaAlleppey, Republic Of India, IN
    We are seeking a strategic and technically adept leader to drive the scalability, resilience, and operational excellence of our enterprise systems. This role will set the vision for site reliability...Show moreLast updated: 21 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    Job Title : Senior Site Reliability Engineer (SRE II).Location : Thiruvananthapuram, KL (Hybrid 3 days Onsite).We're looking for an experienced. Senior Site Reliability Engineer.The ideal candidate ha...Show moreLast updated: 17 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    PhonePethiruvananthapuram, India
    SRE We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production ...Show moreLast updated: 16 hours ago
    • Promoted
    Site Reliability Engineer - DevOps

    Site Reliability Engineer - DevOps

    Aim Plus Staffing SolutionsThiruvananthapuram
    Mandatory skills : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud infra...Show moreLast updated: 15 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    We're looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Datum Technologies GroupKollam, Republic Of India, IN
    Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show moreLast updated: 9 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    KarixKollam, Republic Of India, IN
    We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Serv...Show moreLast updated: 21 hours ago
    • Promoted
    Sr Support Engineer

    Sr Support Engineer

    McLaren Strategic Solutions (MSS)Kollam, IN
    We are looking for a Java + AWS DevOps Support Engineer with strong technical expertise and hands-on experience in both development and support roles. The ideal candidate will have a solid understan...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer (C# / Python)

    Senior Site Reliability Engineer (C# / Python)

    EntechThiruvananthapuram, IN
    Senior Software Site Reliability Engineer (C# / Python).You’ll ensure enterprise systems are reliable, scalable, and performant - driving improvements, leading SRE initiatives, and mentoring teams on...Show moreLast updated: 2 days ago
    • Promoted
    Senior DevOps & Database Reliability Engineer – 100% Remote

    Senior DevOps & Database Reliability Engineer – 100% Remote

    Hyly.AIThiruvananthapuram, IN
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show moreLast updated: 9 days ago
    • Promoted
    Senior Dell Boomi Integration Engineer

    Senior Dell Boomi Integration Engineer

    MaitsysThiruvananthapuram, IN
    Job Description : Senior Boomi Integration Engineer.Atom migration (on-prem → cloud), integration development, and ongoing support. Senior Dell Boomi Integration Engineer.Boomi Atom to a cloud-hosted...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceAlappuzha, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 15 days ago
    • Promoted
    • New!
    Solutions Engineer - SRE - Remote

    Solutions Engineer - SRE - Remote

    datavrutiAlappuzha, IN
    Remote
    Role : Solutions Engineer (SRE / DevOps).A fast-growing AI-driven reliability engineering startup helping organizations reduce downtime by improving incident investigation, root-cause analysis, and ...Show moreLast updated: 3 hours ago
    • Promoted
    Senior GenAI Engineer

    Senior GenAI Engineer

    Mitra AIKollam, IN
    AI System Design & Development : .Architect, develop, and deploy large-scale Generative AI, LLM-based systems, including intelligent agents and automation workflows. LLM Integration & Optimization : .In...Show moreLast updated: 1 day ago