Talent.com
Site Reliability Engineer - Elastic Kubernetes Service
Site Reliability Engineer - Elastic Kubernetes ServiceD2KSS • Hyderabad
Site Reliability Engineer - Elastic Kubernetes Service

Site Reliability Engineer - Elastic Kubernetes Service

D2KSS • Hyderabad
30+ days ago
Job description

Description :

Key Responsibilities :

  • Manage and maintain Kubernetes clusters (EKS) and ensure high system reliability and scalability.
  • Implement and manage AWS services including IAM, EC2, EKS, CloudWatch, and S3.
  • Build automation tools to enable self-healing and self-monitoring systems.
  • Develop and maintain monitoring solutions to track system performance and alert for low-latency applications.
  • Troubleshoot application-specific, network, system, and performance issues in real time.
  • Perform Linux debugging, performance tuning, and optimization for production systems.
  • Apply SRE principles monitoring, alerting, error budgets, fault analysis, capacity planning, and toil reduction.
  • Collaborate with cross-functional teams to improve reliability, performance, and deployment processes.

Must-Have Qualifications :

  • Bachelors degree in Computer Science or a related field.
  • Minimum 5+ years of experience in DevOps / Site Reliability Engineering roles.
  • Strong hands-on experience with Kubernetes and container orchestration.
  • In-depth knowledge of AWS services (IAM, EC2, EKS, CloudWatch, S3).
  • Proficiency in at least one programming / scripting language Python or Shell.
  • Excellent understanding of Linux systems, debugging tools, and performance tuning.
  • Strong problem-solving, troubleshooting, and analytical skills.
  • Ability to work collaboratively in a fast-paced, evolving technology environment.
  • Preferred Skills :

  • Experience with CI / CD pipelines and automation frameworks.
  • Familiarity with Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.
  • Understanding of networking concepts, system architecture, and distributed systems.
  • Key Traits :

  • Strong ownership and accountability.
  • Excellent communication and collaboration skills.
  • Willingness to continuously learn and adapt to new technologies.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Hyderabad

    Related jobs
    SRE (Site Reliability Engineer)

    SRE (Site Reliability Engineer)

    Tata Consultancy Services • Hyderabad, Republic Of India, IN
    Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional), Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment experience,.Show more
    Last updated: 14 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy Services • Hyderabad, Telangana, India
    GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABIT Holding Inc. • Hyderabad, Telangana, IN
    Quick Apply
    AutoRABIT Profile AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recover...Show more
    Last updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Inspire Brands Hyderabad Support Center • Hyderabad, India
    Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The companys technology hub, Inspire Brands Hyderabad Support Center, India, will le...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer - Cloud and On-Premise Infrastructure

    Site Reliability Engineer - Cloud and On-Premise Infrastructure

    MathWorks • Hyderabad, Republic Of India, IN
    Would you like to join a team making a positive impact at MathWorks? IT Hosting is modernizing our infrastructure and the way we operate it. You will be responsible for designing, deploying, maintai...Show more
    Last updated: 12 days ago • Promoted
    Sr Engineer, Site Reliability

    Sr Engineer, Site Reliability

    TMUS Global Solutions • Hyderabad, India
    As a Senior Site Reliability Engineer, you will be a key member of the CFL Platform Engineering and Operations team you will play a pivotal role in building and scaling intelligent infrastructure t...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    LTIMindtree • Hyderabad, Telangana, India
    Hi Talent, If interested, kindly share your updated resume along with below details on - Pradyumn.Role - SRE Experience required 5 to 9 years Location - Hyderabad Mandatory to work all 5 days f...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Elios Talent • Hyderabad, Telangana, India
    Senior Site Reliability Engineer.Build, scale, and optimize cloud-native infrastructure powering global, high-availability platforms. Drive automation-first engineering across AWS, Terraform, CI / CD,...Show more
    Last updated: 10 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Hyderabad, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Elios Talent • Hyderabad, Telangana, India
    Build, automate, and support cloud-native infrastructure powering high-availability platforms.Contribute to automation-first engineering across AWS, Terraform, CI / CD, and observability tooling.Impr...Show more
    Last updated: 10 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Pagos Consultants • Hyderabad, IN
    This team will play a pivotal role in spearheading innovation.As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future d...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer [T500-21132]

    Site Reliability Engineer [T500-21132]

    Inspire • Hyderabad, Telangana, India
    About Inspire Brands : Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies. The company’s technology hub, Inspire Brands Hyderabad Support...Show more
    Last updated: 25 days ago • Promoted
    Engineer, Site Reliability [T500-20266]

    Engineer, Site Reliability [T500-20266]

    TMUS Global Solutions • Hyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TMUS Global Solutions • Hyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
    Last updated: 30+ days ago • Promoted
    Engineer, Site Reliability

    Engineer, Site Reliability

    TMUS Global Solutions • Hyderabad, India
    Engineer reliability : Identify potential system issues early, implement preventive measures, and boost system resilience. Automate for speed : Build tools, pipelines, and scripts that eliminate manua...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer L1

    Site Reliability Engineer L1

    APTO SOLUTIONS - EXECUTIVE SEARCH & CONSULTANTS • Hyderabad, Telangana, India
    We’re Hiring | Site Reliability Engineer – L1.Build scalable & reliable systems through automation.Bridge between Dev & Ops teams for faster delivery. Work on CI / CD, Docker, Kubernetes, Jenkins, Git...Show more
    Last updated: 18 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    NationsBenefits India • Hyderabad, Telangana, India
    Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog |.SRE team focused on maintaining the performance, reliability, and availability of our fintech platforms.Triage and resolve product...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VXI Global Solutions • Hyderabad, Telangana, India
    We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show more
    Last updated: 30+ days ago • Promoted