Talent.com
Site Reliability Engineer - Elastic Kubernetes Service

Site Reliability Engineer - Elastic Kubernetes Service

MNR SolutionsChennai
3 days ago
Job description

Description :

Site Reliability Engineer (SRE) Kubernetes & Cloud

Position Summary :

We are seeking a highly skilled Site Reliability Engineer (SRE) with deep expertise in Kubernetes and cloud technologies (AWS, Azure, or GCP). The SRE will be responsible for designing, deploying, automating, and supporting highly available, scalable, and secure containerized applications in cloud-native environments. You will work closely with development, operations, and security teams to ensure the reliability, performance, and efficiency of our production systems.

Key Responsibilities :

  • Design, deploy, and manage Kubernetes clusters (on-premises and / or cloud-managed such as EKS, AKS, GKE) to support scalable microservices architectures.
  • Automate infrastructure provisioning and application deployment using Infrastructure as Code (IaC) tools such as Terraform, Helm, or CloudFormation.
  • Monitor, troubleshoot, and optimize system performance using observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
  • Implement and manage CI / CD pipelines to ensure rapid, repeatable, and reliable software delivery.
  • Ensure system reliability, availability, and security through proactive monitoring, incident response, and root cause analysis.
  • Develop and maintain runbooks, dashboards, and documentation for operational procedures and system architectures.
  • Participate in on-call rotations and respond to production incidents, ensuring minimal downtime and fast recovery.
  • Collaborate with development and operations teams to drive DevOps and SRE best practices, including capacity planning, scaling, and cost optimization.
  • Continuously improve automation, tooling, and processes to reduce manual work and increase system reliability.

Required Skills & Experience :

  • 3+ years experience as an SRE, DevOps Engineer, or similar role supporting large-scale, production-grade environments.
  • Expertise in Kubernetes (deployment, scaling, upgrades, troubleshooting, networking, RBAC, etc.).
  • Hands-on experience with at least one major cloud provider : AWS, Azure, or GCP.
  • Proficiency in scripting / programming (Python, Bash, Go, etc.).
  • Experience with IaC tools (Terraform, Helm, CloudFormation, ARM, etc.).
  • Strong knowledge of Linux systems administration and networking concepts.
  • Familiarity with monitoring, logging, and alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.).
  • Experience with CI / CD tools (Jenkins, GitLab CI, ArgoCD, etc.).
  • Understanding of security best practices in cloud and containerized environments.
  • Excellent troubleshooting and problem-solving skills.
  • Strong communication and collaboration skills.
  • Preferred Qualifications :

  • Certified Kubernetes Administrator (CKA) or similar certification.
  • Experience with service mesh (Istio, Linkerd), ingress controllers, and API gateways.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Chennai

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmachennai, India
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer / Architect - CI / CD Pipeline

    Site Reliability Engineer / Architect - CI / CD Pipeline

    Cling Multi SolutionsChennai
    Job Description : Role : Site Reliability Engineer (SRE) Location : Bangalore / Chennai / Pune (Hybrid) Experience : 5+ y...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiChennai, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    TCS has been a great pioneer in feeding the fire of young techies like you.We are a global leader in the technology arena and there’s nothing that can stop us from growing together.Role : Digital : ...Show moreLast updated: 2 days ago
    • Promoted
    DevOps / Platform Engineer

    DevOps / Platform Engineer

    iVedha Inc.Chennai, IN
    Hiring a seasoned DevOps / Platform Engineer to drive automation, platform reliability, and robust.Design, deploy, and manage CI / CD pipelines and infrastructure automation, leveraging AI for.Implemen...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Chennai, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    PoshmarkChennai, Tamil Nadu, India
    We’re looking for an experienced.You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying ...Show moreLast updated: 7 days ago
    • Promoted
    Lead - Cloud Reliability Engineer

    Lead - Cloud Reliability Engineer

    Searce Incchennai, tamil nadu, in
    The ‘process-first’ AI-native modern tech consultancy that's rewriting the rules.As an engineering-led consultancy, we are dedicated to relentlessly improving the real business outcomes.Our solvers...Show moreLast updated: 30+ days ago
    • Promoted
    Deployment Engineer

    Deployment Engineer

    Tata Consultancy ServicesChennai, India
    Experience in deploying complex, multi-component systems, preferably in cloud environments (Azure, AWS).Deep understanding of AI models, especially LLMs, and the infrastructure required to support ...Show moreLast updated: 4 days ago
    • Promoted
    Poshmark - Senior Site Reliability Engineer - Cloud Infrastructure

    Poshmark - Senior Site Reliability Engineer - Cloud Infrastructure

    POSHMARKChennai
    Job Description : Were looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    o9 Solutions, Inc.mount, India
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 9 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeChennai, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 7 days ago
    • Promoted
    Sr DevOps Engineer

    Sr DevOps Engineer

    Nebula Tech Solutionschennai, tamil nadu, in
    DevOps / SRE professionals (5+ years).Kubernetes, monitoring / metrics, and coding.This is a role for engineers who thrive on. Kubernetes clusters (EKS / GKE / AKS).Jenkins, ArgoCD, FluxCD, Harness, GitHub ...Show moreLast updated: 12 days ago
    • Promoted
    Site Reliability Engineer 2

    Site Reliability Engineer 2

    ConfidentialChennai
    Work with team to plan, design and deploy new cloud technologies.Create, Maintain , and Enhance Automated Product Deployments. Develop, Modify, Support and maintain AWS based components through Infr...Show moreLast updated: 30+ days ago
    • Promoted
    Keuro Life - Senior Site Reliability Engineer - DevOps

    Keuro Life - Senior Site Reliability Engineer - DevOps

    Keuro LifeChennai
    Site Reliability Engineer / DevOps We are seeking an experienced Site Reliability Engineer / DevOps professional with a minimum of 6 years in the industry.The ideal c...Show moreLast updated: 30+ days ago
    • Promoted
    RELX - Site Reliability Engineer - IAC Terraform

    RELX - Site Reliability Engineer - IAC Terraform

    REED ELSEVIER INDIA (a part of RELX India Pvt Ltd)Chennai
    Job Description : - Lead initiatives to identify and eliminate manual, repetitive tasks through automation and tooling.Develop s...Show moreLast updated: 30+ days ago
    • Promoted
    Senior DevOps / Site Reliability Engineer

    Senior DevOps / Site Reliability Engineer

    Scoop Technologies Pvt LtdChennai
    Job Title : Senior DevOps Engineer / Site Reliability Engineer (SRE) Experience : 5 to 8 Years &...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Loyalytics AIChennai
    Site Reliability / DevOps Engineer to be our first hire in this function, responsible for owning and scaling the reliability, observability, and infrastructure of our platform running entirely on M...Show moreLast updated: 30+ days ago