Talent.com
Site Reliability Engineer - Elastic Kubernetes Service
Site Reliability Engineer - Elastic Kubernetes ServiceMNR Solutions • Chennai
Site Reliability Engineer - Elastic Kubernetes Service

Site Reliability Engineer - Elastic Kubernetes Service

MNR Solutions • Chennai
30+ days ago
Job description

Description :

Site Reliability Engineer (SRE) Kubernetes & Cloud

Position Summary :

We are seeking a highly skilled Site Reliability Engineer (SRE) with deep expertise in Kubernetes and cloud technologies (AWS, Azure, or GCP). The SRE will be responsible for designing, deploying, automating, and supporting highly available, scalable, and secure containerized applications in cloud-native environments. You will work closely with development, operations, and security teams to ensure the reliability, performance, and efficiency of our production systems.

Key Responsibilities :

  • Design, deploy, and manage Kubernetes clusters (on-premises and / or cloud-managed such as EKS, AKS, GKE) to support scalable microservices architectures.
  • Automate infrastructure provisioning and application deployment using Infrastructure as Code (IaC) tools such as Terraform, Helm, or CloudFormation.
  • Monitor, troubleshoot, and optimize system performance using observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
  • Implement and manage CI / CD pipelines to ensure rapid, repeatable, and reliable software delivery.
  • Ensure system reliability, availability, and security through proactive monitoring, incident response, and root cause analysis.
  • Develop and maintain runbooks, dashboards, and documentation for operational procedures and system architectures.
  • Participate in on-call rotations and respond to production incidents, ensuring minimal downtime and fast recovery.
  • Collaborate with development and operations teams to drive DevOps and SRE best practices, including capacity planning, scaling, and cost optimization.
  • Continuously improve automation, tooling, and processes to reduce manual work and increase system reliability.

Required Skills & Experience :

  • 3+ years experience as an SRE, DevOps Engineer, or similar role supporting large-scale, production-grade environments.
  • Expertise in Kubernetes (deployment, scaling, upgrades, troubleshooting, networking, RBAC, etc.).
  • Hands-on experience with at least one major cloud provider : AWS, Azure, or GCP.
  • Proficiency in scripting / programming (Python, Bash, Go, etc.).
  • Experience with IaC tools (Terraform, Helm, CloudFormation, ARM, etc.).
  • Strong knowledge of Linux systems administration and networking concepts.
  • Familiarity with monitoring, logging, and alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.).
  • Experience with CI / CD tools (Jenkins, GitLab CI, ArgoCD, etc.).
  • Understanding of security best practices in cloud and containerized environments.
  • Excellent troubleshooting and problem-solving skills.
  • Strong communication and collaboration skills.
  • Preferred Qualifications :

  • Certified Kubernetes Administrator (CKA) or similar certification.
  • Experience with service mesh (Istio, Linkerd), ingress controllers, and API gateways.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Chennai

    Related jobs
    AWS Site Reliability Engineer

    AWS Site Reliability Engineer

    HTC Global Services • Chennai, Tamil Nadu, India
    HTC – A brief profile Established in 1990, HTC Inc.Troy, Michigan, is a leading global Information Technology solution and BPO provider. HTC assists clients across multiple industry verticals, offer...Show more
    Last updated: 24 days ago • Promoted
    Cloud Site Reliability Engineer

    Cloud Site Reliability Engineer

    Ford Motor • Chennai, Tamil Nadu, India
    Be at the Forefront of Mobilitys Future : Join Ford as a Site Reliability Engineer!.Enterprise Technology is the engine driving the future of transportation and were looking for a talented Site Reli...Show more
    Last updated: 8 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Intellistaff Services Pvt. Ltd • Chennai, Tamil Nadu, India
    Role : Cloud Engineer - SRE Experience : 6+ Location : Chennai Fulltime - Hybrid Required Skills : 6+ years' experience SRE, 3+ years in Public Cloud & Cloud Engineering GCP experience (preferred) Doc...Show more
    Last updated: 11 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Chennai, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer - Docker & Kubernetes

    Site Reliability Engineer - Docker & Kubernetes

    Growel Softech Pvt. Ltd. • Chennai
    Description : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud...Show more
    Last updated: 12 days ago • Promoted
    Site Reliability Engineer - Google Kubernetes Engine

    Site Reliability Engineer - Google Kubernetes Engine

    NR Consulting • Chennai
    About the Company : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Pagos Consultants • Chennai, IN
    This team will play a pivotal role in spearheading innovation.As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future d...Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Datum Technologies Group • Chennai, Tamil Nadu, India
    Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show more
    Last updated: 15 days ago • Promoted
    Lead Engineer

    Lead Engineer

    Hyqoo • Chennai, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show more
    Last updated: 19 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Grootan Technologies • Chennai, Tamil Nadu, India
    Site Reliability Engineer (SRE).In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications.You will leverage your e...Show more
    Last updated: 14 days ago • Promoted
    Site Reliability Engineer - DevOps

    Site Reliability Engineer - DevOps

    Aim Plus Staffing Solutions • Chennai
    Mandatory skills : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud infra...Show more
    Last updated: 22 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy Services • Chennai, Tamil Nadu, India
    GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Site Reliability Engineer (SRE)

    Sr. Site Reliability Engineer (SRE)

    Datum Technologies Group • Chennai, Tamil Nadu, India
    Site Reliability Engineer (SRE).Duration : Contract to Hire (On the Payroll of Datum Technology Group).Location : Chennai || Mumbai || Gurugram. Interview Process : Virtual (2 Rounds) + 1 Technical scr...Show more
    Last updated: 5 hours ago • Promoted • New!
    Lead Site Reliability Engineer (SRE)

    Lead Site Reliability Engineer (SRE)

    Datum Technologies Group • Chennai, Tamil Nadu, India
    Job Title : Lead Site Reliability Engineer (SRE).Duration : Contract to Hire (On the Payroll of Datum Technology Group).Location : Chennai || Mumbai || Gurugram. Interview Process : Virtual (2 Rounds) +...Show more
    Last updated: 5 hours ago • Promoted • New!
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Confidential • Chennai, India
    Join our software, system, and test engineering group as a.Lead Site Reliability Engineer.AWS infrastructure, automating CI / CD pipelines, and ensuring scalable, reliable deployments.You will levera...Show more
    Last updated: 28 days ago • Promoted
    Senior DevOps & Database Reliability Engineer – 100% Remote

    Senior DevOps & Database Reliability Engineer – 100% Remote

    Hyly.AI • Chennai, IN
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show more
    Last updated: 16 days ago • Promoted
    Keuro Life - Senior Site Reliability Engineer - DevOps

    Keuro Life - Senior Site Reliability Engineer - DevOps

    Keuro Life • Chennai
    Site Reliability Engineer / DevOps We are seeking an experienced Site Reliability Engineer / DevOps professional with a minimum of 6 years in the industry.The ideal c...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • Chennai, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show more
    Last updated: 22 days ago • Promoted