Talent.com
This job offer is not available in your country.
Senior Site Reliability Engineer

Senior Site Reliability Engineer

EmbarkGCCBengaluru, Karnataka, India
26 days ago
Job description

Senior Site Reliability Engineer (SRE) – Job Description

Key Responsibilities

SRE & Application Reliability

  • Implement and tune SLOs / SLIs, build reliability dashboards, and respond to incidents using Grafana IRM, JSM, and escalation workflows.
  • Monitor application performance and availability across Kubernetes clusters using Grafana, Prometheus, Loki, Mimir, and Tempo.
  • Participate in on-call rotation, postmortems, and continual improvement processes.

Application Support & Troubleshooting

  • Act as the primary escalation point for production issues — whether internal or client-facing.
  • Monitor logs, traces, and alerts to proactively identify and resolve incidents.
  • Debug issues across the stack : Kubernetes, Helm releases, application logs, API errors, database bottlenecks.
  • Coordinate with development, QA, and client teams to ensure timely and effective resolution of issues.
  • DevOps & Infrastructure Automation

  • Implement GitOps workflows using FluxCD and ArgoCD to manage Kubernetes deployments.
  • Manage and maintain infrastructure-as-code using Terraform, Terragrunt, and Azure (Preferred).
  • Automate CI / CD pipelines with GitHub Actions for Docker image builds, Helm-based deployments, release tagging, etc.
  • Post-QA & Release Validation

  • Work closely with QA engineers to validate release branches, tag images, and verify integration across services.
  • Test application functionality post deployments (sanity and product functional tests).
  • Assist in defining performance benchmarks (e.g., pgBench for PostgreSQL clusters) and validate pre-production readiness.
  • Must-Have Qualifications

  • 6–8 years of experience in DevOps, SRE, or Production Support roles.
  • Strong hands-on experience with Azure and Kubernetes (AKS preferred) and Helm / Kustomize.
  • Solid knowledge of GitHub Actions, GitOps (FluxCD / ArgoCD), and Terraform / Terragrunt.
  • Experience with monitoring / logging stacks : Grafana, Prometheus, Loki, Tempo, Mimir, and Incident Response tools.
  • Experience debugging microservices written in Node.js, Go, or similar.
  • Excellent troubleshooting and debugging skills across the stack.
  • Create a job alert for this search

    Senior Site Reliability Engineer • Bengaluru, Karnataka, India

    Related jobs
    • Promoted
    Sr. Site Reliability Engineer [T500-20179]

    Sr. Site Reliability Engineer [T500-20179]

    Delta Air LinesBengaluru, Karnataka, India
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 17 days ago
    • Promoted
    Sr. Site Reliability Engineer [T500-20179]

    Sr. Site Reliability Engineer [T500-20179]

    ConfidentialBengaluru / Bangalore, India
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tavantbangalore, karnataka, in
    With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ElgebraBangalore
    Role Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our c...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Core Minds Tech SOlutionsHosur
    Job Description : - Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions&l...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer II

    Senior Site Reliability Engineer II

    ConfidentialBengaluru / Bangalore
    The Site Reliability Engineering team focused on Efficiency and Performance is responsible for driving AWS cost intelligence, managing the ThousandEyes infrastructure, and ensuring optimal resource...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Aqilea (formerly Soltia)Bangalore, Karnataka, India
    Quick Apply
    We are a consulting company with a bunch of technology-interested and happy people!.We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2Bengaluru, Karnataka, India
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaHosur, Tamil Nadu, India
    We are looking for a highly skilled AWS Engineer with strong Python development and Chaos Engineering expertise to design, build, and validate resilient, scalable, and automated cloud-native envi...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer - OpenShift

    Site Reliability Engineer - OpenShift

    ConfidentialBengaluru / Bangalore
    Applies software engineering principles to the operations domain.Contributes to a service's codebase, writes automation that aids in the management of a service, and performs operational engineerin...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersBengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    EmbarkGCCbangalore, karnataka, in
    Senior Site Reliability Engineer (SRE) – Job Description.Implement and tune SLOs / SLIs, build reliability dashboards, and respond to incidents using Grafana IRM, JSM, and escalation workflows.Monito...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronBangalore Urban, Karnataka, India
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5 to 9 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology special...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    At ViewSonic Technologies, we’re passionate about building software that solves problems.We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availabilit...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    BayOne Solutionsbangalore district, karnataka, in
    Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 6 hours ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebiahosur, tamil nadu, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 7 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialBengaluru / Bangalore, India
    About the Role : Position summary &.You would be playing a key role in ensuring the reliability, stability, scalability and security of our Logging & Monitoring cloud systems and infrastructure.You ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersHosur, Tamil Nadu, India
    Uplers is hiring for one of the clients.Role Details : Position : SRE (Oracle Cloud Infrastructure) Type : 10-month contract (possible extension) Mode : Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST Pol...Show moreLast updated: 24 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Exasoftbangalore district, karnataka, in
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 6 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub ServicesBengaluru, Karnataka, India
    Key Responsibilities Manage and scale production systems hosted on Google Cloud Platform (GCP) Implement SRE best practices : monitoring, alerting, SLAs, SLOs, and error budgets Automate operatio...Show moreLast updated: 6 days ago