Talent.com
This job offer is not available in your country.
Senior Site Reliability Engineer - IAC Terraform

Senior Site Reliability Engineer - IAC Terraform

EMBARKBangalore
30+ days ago
Job description

Job Description :

Key Responsibilities :

SRE & Application Reliability :

  • Implement and tune SLOs / SLIs, build reliability dashboards, and respond to incidents using Grafana IRM, JSM, and escalation workflows.
  • Monitor application performance and availability across Kubernetes clusters using Grafana, Prometheus, Loki, Mimir, and Tempo.
  • Participate in on-call rotation, postmortems, and continual improvement processes.

Application Support & Troubleshooting :

  • Act as the primary escalation point for production issues whether internal or client-facing.
  • Monitor logs, traces, and alerts to proactively identify and resolve incidents.
  • Debug issues across the stack : Kubernetes, Helm releases, application logs, API errors, database bottlenecks.
  • Coordinate with development, QA, and client teams to ensure timely and effective resolution of issues.
  • DevOps & Infrastructure Automation :

  • Implement GitOps workflows using FluxCD and ArgoCD to manage Kubernetes deployments.
  • Manage and maintain infrastructure-as-code using Terraform, Terragrunt, and Azure (Preferred).
  • Automate CI / CD pipelines with GitHub Actions for Docker image builds, Helm-based deployments, release tagging, etc.
  • Post-QA & Release Validation :

  • Work closely with QA engineers to validate release branches, tag images, and verify integration across services.
  • Test application functionality post deployments (sanity and product functional tests).
  • Assist in defining performance benchmarks (e.g., pgBench for PostgreSQL clusters) and validate pre-
  • production Qualifications :

  • 6- 8 years of experience in DevOps, SRE, or Production Support roles.
  • Strong hands-on experience with Azure and Kubernetes (AKS preferred) and Helm / Kustomize.
  • Solid knowledge of GitHub Actions, GitOps (FluxCD / ArgoCD), and Terraform / Terragrunt.
  • Experience with monitoring / logging stacks : Grafana, Prometheus, Loki, Tempo, Mimir, and Incident Response tools.
  • Experience debugging microservices written in Node.js, Go, or similar.
  • Excellent troubleshooting and debugging skills across the stack.
  • (ref : hirist.tech)

    Create a job alert for this search

    Senior Site Reliability Engineer • Bangalore

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    AIONBengaluru, KA, IN
    Quick Apply
    AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance,...Show moreLast updated: 30+ days ago
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ScaleneWorksBengaluru, Karnataka, India
    Quick Apply
    Experience in C++ / Java : if one of the two it is ok.Knowledge of cloud would be appreciated.Knowledge of software development life cycle : nice to have. Has working experience and advanced and speci...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    BayOne Solutionshosur, tamil nadu, in
    Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 9 hours ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    LSEG - Site Reliability Engineer

    LSEG - Site Reliability Engineer

    REFINITIV INDIA SHARED SERVICES PRIVATE LIMITEDBangalore
    LSEG is a leading global financial markets infrastructure and data provider.Our purpose is driving financial stability, empowering economies and enabling customers to create sustainable growth.Our ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ElgebraBangalore
    Role Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our c...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Core Minds Tech SOlutionsHosur
    Job Description : - Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions&l...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2Bengaluru, Karnataka, India
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TavantBengaluru, Karnataka, India
    With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaBengaluru, Karnataka, India
    AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE).The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI / CD, monit...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftBangalore, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 12 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersBengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    EmbarkGCCbangalore, karnataka, in
    Senior Site Reliability Engineer (SRE) – Job Description.Implement and tune SLOs / SLIs, build reliability dashboards, and respond to incidents using Grafana IRM, JSM, and escalation workflows.Monito...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Uplershosur, tamil nadu, in
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 24 days ago
    • Promoted
    Senior Site Reliability Engineer [T500-20117]

    Senior Site Reliability Engineer [T500-20117]

    Delta Air LinesBengaluru, Karnataka, India
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 19 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    At ViewSonic Technologies, we’re passionate about building software that solves problems.We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availabilit...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Site Reliability Engineer [T500-20179]

    Sr. Site Reliability Engineer [T500-20179]

    Delta Air Linesbangalore, karnataka, in
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebiahosur, tamil nadu, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub ServicesBengaluru, Karnataka, India
    Key Responsibilities Manage and scale production systems hosted on Google Cloud Platform (GCP) Implement SRE best practices : monitoring, alerting, SLAs, SLOs, and error budgets Automate operatio...Show moreLast updated: 6 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ACL DigitalBengaluru, Karnataka, India
    Service Management : Maintain application uptime / performance, manage system enhancements and defects, oversee daily operational activities, and ensure continuous improvement and adherence to ITIL be...Show moreLast updated: 7 hours ago