Talent.com
Site Reliability Engineer - Elastic Kubernetes Service
Site Reliability Engineer - Elastic Kubernetes ServiceMNR Solutions • Chennai
Site Reliability Engineer - Elastic Kubernetes Service

Site Reliability Engineer - Elastic Kubernetes Service

MNR Solutions • Chennai
30+ days ago
Job description

Description :

Site Reliability Engineer (SRE) Kubernetes & Cloud

Position Summary :

We are seeking a highly skilled Site Reliability Engineer (SRE) with deep expertise in Kubernetes and cloud technologies (AWS, Azure, or GCP). The SRE will be responsible for designing, deploying, automating, and supporting highly available, scalable, and secure containerized applications in cloud-native environments. You will work closely with development, operations, and security teams to ensure the reliability, performance, and efficiency of our production systems.

Key Responsibilities :

  • Design, deploy, and manage Kubernetes clusters (on-premises and / or cloud-managed such as EKS, AKS, GKE) to support scalable microservices architectures.
  • Automate infrastructure provisioning and application deployment using Infrastructure as Code (IaC) tools such as Terraform, Helm, or CloudFormation.
  • Monitor, troubleshoot, and optimize system performance using observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
  • Implement and manage CI / CD pipelines to ensure rapid, repeatable, and reliable software delivery.
  • Ensure system reliability, availability, and security through proactive monitoring, incident response, and root cause analysis.
  • Develop and maintain runbooks, dashboards, and documentation for operational procedures and system architectures.
  • Participate in on-call rotations and respond to production incidents, ensuring minimal downtime and fast recovery.
  • Collaborate with development and operations teams to drive DevOps and SRE best practices, including capacity planning, scaling, and cost optimization.
  • Continuously improve automation, tooling, and processes to reduce manual work and increase system reliability.

Required Skills & Experience :

  • 3+ years experience as an SRE, DevOps Engineer, or similar role supporting large-scale, production-grade environments.
  • Expertise in Kubernetes (deployment, scaling, upgrades, troubleshooting, networking, RBAC, etc.).
  • Hands-on experience with at least one major cloud provider : AWS, Azure, or GCP.
  • Proficiency in scripting / programming (Python, Bash, Go, etc.).
  • Experience with IaC tools (Terraform, Helm, CloudFormation, ARM, etc.).
  • Strong knowledge of Linux systems administration and networking concepts.
  • Familiarity with monitoring, logging, and alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.).
  • Experience with CI / CD tools (Jenkins, GitLab CI, ArgoCD, etc.).
  • Understanding of security best practices in cloud and containerized environments.
  • Excellent troubleshooting and problem-solving skills.
  • Strong communication and collaboration skills.
  • Preferred Qualifications :

  • Certified Kubernetes Administrator (CKA) or similar certification.
  • Experience with service mesh (Istio, Linkerd), ingress controllers, and API gateways.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Chennai

    Related jobs
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Poshmark • Chennai, Tamil Nadu, India
    We’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale...Show more
    Last updated: 30+ days ago • Promoted
    Cloud Site Reliability Engineer

    Cloud Site Reliability Engineer

    Ford Motor • Chennai, Tamil Nadu, India
    Be at the Forefront of Mobilitys Future : Join Ford as a Site Reliability Engineer!.Enterprise Technology is the engine driving the future of transportation and were looking for a talented Site Reli...Show more
    Last updated: 14 days ago • Promoted
    Senior Cloud Engineer, Site Reliability Engineering

    Senior Cloud Engineer, Site Reliability Engineering

    Kinaxis • Chennai, Tamil Nadu, India
    Elevate your career journey by embracing a new challenge with Kinaxis.We are experts in tech but its really our people who give us passion to always seek ways to do things better.As such were serio...Show more
    Last updated: 23 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Miratech • Chennai, Tamil Nadu, India
    Join us in revolutionizing customer experiences with our client a global leader in cloud contact center software.Senior Site Reliability Engineer. You will design dashboards work with observability ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Chennai, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Intellistaff Services Pvt. Ltd • Chennai, Tamil Nadu, India
    Role : Cloud Engineer - SRE Experience : 6+ Location : Chennai Fulltime - Hybrid Required Skills : - 6+ years' experience SRE Public Cloud & Cloud Engineering - GCP experience (preferred) - Docker...Show more
    Last updated: 17 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Pagos Consultants • Chennai, IN
    This team will play a pivotal role in spearheading innovation.As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future d...Show more
    Last updated: 7 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy Services • Chennai, Tamil Nadu, India
    Role : Site Reliability Engineer.Location : Chennai / Bangalore / Hyderabad.Exposure to any APM tool like Dynatrace, Appdynamics, Splunk, etc. Gremlin or Chaos Monkey or Simian Army or Litmus expertise.Ex...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    NielsenIQ • Chennai, Tamil Nadu, India
    NIQ Activate is the leading provider of AI-powered customer analytics personalization and brand collaboration platform.Serving dozens of retailers and brands across the world using cutting edge big...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer (Middleware)

    Senior Site Reliability Engineer (Middleware)

    Nextiva • Chennai, Tamil Nadu, India
    Redefine the future of customer experiences.At Nextiva were reimagining how businesses connect bringing together customer experience and team collaboration on a single conversation centric platform...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Datum Technologies Group • Chennai, Tamil Nadu, India
    Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show more
    Last updated: 21 days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Datum Technologies Group • Chennai, Tamil Nadu, India
    Job Details : Job Title : Lead Site Reliability Engineer (SRE) Duration : Contract to Hire (On the Payroll of Datum Technology Group) Location : Chennai || Mumbai || Gurugram Interview Process : Vir...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Insight Global • Chennai, IN
    Contract with Insight Global Client.Join our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building and maintaining highly reliable, automated, and scalable systems.Y...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Site Reliability Engineer (SRE)

    Sr. Site Reliability Engineer (SRE)

    Datum Technologies Group • Chennai, Tamil Nadu, India
    Site Reliability Engineer (SRE).Duration : Contract to Hire (On the Payroll of Datum Technology Group).Location : Chennai || Mumbai || Gurugram. Interview Process : Virtual (2 Rounds) + 1 Technical scr...Show more
    Last updated: 6 days ago • Promoted
    Lead Site Reliability Engineer (SRE)

    Lead Site Reliability Engineer (SRE)

    Datum Technologies Group • Chennai, Tamil Nadu, India
    Job Title : Lead Site Reliability Engineer (SRE).Duration : Contract to Hire (On the Payroll of Datum Technology Group).Location : Chennai || Mumbai || Gurugram. Interview Process : Virtual (2 Rounds) +...Show more
    Last updated: 6 days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • Chennai, Tamil Nadu, India
    About InstaService InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ ...Show more
    Last updated: 28 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Grootan Technologies • Chennai, Tamil Nadu, India
    About the Role We are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and m...Show more
    Last updated: 20 days ago • Promoted
    Sr. Site Reliability Engineer

    Sr. Site Reliability Engineer

    Datum Technologies Group • Chennai, Tamil Nadu, India
    Job Details : Job Title : Sr.Site Reliability Engineer (SRE) Duration : Contract to Hire (On the Payroll of Datum Technology Group) Location : Chennai || Mumbai || Gurugram Interview Process : Virtu...Show more
    Last updated: 2 days ago • Promoted