Talent.com
Senior Systems Reliability Engineer

Senior Systems Reliability Engineer

PeoplefyThiruvananthapuram, Republic Of India, IN
21 hours ago
Job description

Greetings from Peoplefy!

We’re looking for an SRE who can own reliability for mission-critical services on Azure , shape standards, lead incidents with calm clarity, and drive engineering excellence across teams

Experience : 10+ years

Location : Trivandrum

Responsibilities :

  • Strong site reliability experience
  • Previously worked as DevOps engineer and at present working as SRE
  • Strong experience in Azure
  • Strong experience with AKS
  • Experience working in docker
  • Experience with observability (Any tool)
  • Experience working on PostgreSQL

SLIs / SLOs & Error Budgets

  • Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly
  • Implement multi-window, multi-burn-rate alerts
  • Change gating via CI / CD based on error budgets
  • Maintain Azure Monitor / Grafana / Prometheus / App Insights dashboards
  • Conduct weekly SLO reviews & drive reliability roadmap
  • Incident Management

  • Lead SEV1 / SEV2 incidents , own communication & postmortems
  • Ensure corrective actions are implemented
  • Reliability Engineering

  • Implement DR, multi-AZ / region patterns, HPA / VPA / KEDA, resilient rollouts
  • Cluster hardening (network, identity, policy), optimize density
  • Ingress : AGIC / Nginx
  • Observability

  • Metrics, traces, logs via Azure Monitor, App Insights, Log Analytics, Prometheus, Grafana, OpenTelemetry
  • Alerts on symptoms, not noise
  • Automation & IaC

  • Terraform / Bicep , GitOps (Flux / Argo) , Azure Policy / OPA Gatekeeper
  • Automate toil & build self-service runbooks / chatops
  • CI / CD Reliability

  • Azure DevOps / GitHub Actions with canary, blue-green, rollback
  • Key Vault-backed secrets
  • Performance & Capacity

  • Load testing, autoscaling, FinOps collaboration
  • Disaster Recovery

  • Define RTO / RPO , run chaos drills & game days
  • Security

  • Entra ID, Key Vault rotation, VNets / NSGs, shift-left security in CI
  • Documentation

  • Runbooks, SLOs, postmortems, architectures — kept current & accessible
  • Interested candidates please share your updated resumes on amruta.bu@peoplefy.com

    Create a job alert for this search

    Senior System Engineer • Thiruvananthapuram, Republic Of India, IN

    Related jobs
    • Promoted
    Lead Engineer

    Lead Engineer

    HyqooThiruvananthapuram, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show moreLast updated: 10 days ago
    • Promoted
    Systems Engineer

    Systems Engineer

    Reflections Info SystemsThiruvananthapuram, Republic Of India, IN
    We’re Hiring for IT Infrastructure Engineer (4+ Years).Manage and maintain Linux servers and related infrastructure.Perform system patching, updates, and security hardening.Monitor performance, imp...Show moreLast updated: 13 days ago
    • Promoted
    Senior Site Reliability Engineer - Azure Kubernetes Service

    Senior Site Reliability Engineer - Azure Kubernetes Service

    PeoplefyTrivandrum
    Description : Site Reliability Engineer (SRE) - Azure / AKS Lead Role Overview : This is a senior technical leadership role fo...Show moreLast updated: 23 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    PeoplefyThiruvananthapuram, Kerala, India
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    The world's top banks use Zafin's integrated platform to drive transformative customer value.Powered by an innovative AI-powered architecture, Zafin's platform seamlessly unifies data from across t...Show moreLast updated: 19 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    Job Title : Senior Site Reliability Engineer (SRE II).Location : Thiruvananthapuram, KL (Hybrid 3 days Onsite).We're looking for an experienced. Senior Site Reliability Engineer.The ideal candidate ha...Show moreLast updated: 15 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialThiruvananthapuram / Trivandrum, India
    Site Reliability Engineering (SRE).Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that ...Show moreLast updated: 19 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    ConfidentialThiruvananthapuram / Trivandrum
    As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability.Should be able to gathe...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - DevOps

    Site Reliability Engineer - DevOps

    Aim Plus Staffing SolutionsThiruvananthapuram
    Mandatory skills : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud infra...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceThiruvananthapuram, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 13 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer (C / Python)

    Senior Site Reliability Engineer (C / Python)

    EntechThiruvananthapuram, Republic Of India, IN
    Senior Software Site Reliability Engineer (C# / Python).You’ll ensure enterprise systems are reliable, scalable, and performant - driving improvements, leading SRE initiatives, and mentoring teams on...Show moreLast updated: 5 hours ago
    • Promoted
    Senior Workspace Support Engineer

    Senior Workspace Support Engineer

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    Join SADA, an Insight Company as a Senior Workspace Support Engineer!.As a Sr Workspace Support Engineer at SADA, you will ensure our customers' support issues are handled effectively.You will work...Show moreLast updated: 7 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer (C# / Python)

    Senior Site Reliability Engineer (C# / Python)

    EntechKollam, IN
    Senior Software Site Reliability Engineer (C# / Python).You’ll ensure enterprise systems are reliable, scalable, and performant - driving improvements, leading SRE initiatives, and mentoring teams on...Show moreLast updated: 14 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeKollam, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 28 days ago
    • Promoted
    Senior DevOps & Database Reliability Engineer – 100% Remote

    Senior DevOps & Database Reliability Engineer – 100% Remote

    Hyly.AIThiruvananthapuram, IN
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show moreLast updated: 7 days ago
    • Promoted
    • New!
    Principal Reliability Engineer

    Principal Reliability Engineer

    PeoplefyThiruvananthapuram, Republic Of India, IN
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 21 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CodeKarmaKollam, Republic Of India, IN
    About Insta Service Insta Service is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+...Show moreLast updated: 2 days ago
    • Promoted
    Senior Deployment Engineer

    Senior Deployment Engineer

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    Join SADA, An Insight Company as a Senior Workspace Deployment Engineer!.As a Senior Workspace Deployment Engineer at SADA, you will work collaboratively with architects, change managers, and proje...Show moreLast updated: 19 days ago