Talent.com
Platform Reliability Engineer

Platform Reliability Engineer

Elios TalentHyderabad, Republic Of India, IN
2 days ago
Job description

Site Reliability Engineer

Key Highlights

🛠️ Build, automate, and support cloud-native infrastructure powering high-availability platforms

⚡ Contribute to automation-first engineering across AWS, Terraform, CI / CD, and observability tooling

📊 Improve reliability, uptime, system health, and performance across production environments

🔐 Strengthen DevSecOps workflows—enhancing security, delivery consistency, and operational excellence

🚨 Support major incident response, troubleshoot complex issues, and ensure platform stability at scale

Position Overview

We are seeking a Site Reliability Engineer to enhance reliability, automation, and performance across cloud-based platforms. This role blends hands-on engineering, systems thinking, and strong cross-team collaboration within a modern DevOps environment.

You will help build scalable infrastructure, evolve observability, improve resiliency, contribute to incident response, and support secure delivery practices across engineering. This role is essential to supporting high-volume user traffic while ensuring systems remain stable, fast, and secure.

Key Responsibilities

Cloud Engineering

  • Deploy, maintain, and improve AWS environments using automation and Infrastructure-as-Code
  • Build tooling that increases predictability, stability, and delivery speed
  • Tune systems for scale, performance, and cost efficiency
  • Maintain reproducible and auditable infrastructure using Terraform
  • Monitor cloud spend and usage to support service-level objectives

Observability & Reliability

  • Support uptime, reliability, and performance monitoring across critical services
  • Participate in incident management and bridge calls during major events
  • Enhance telemetry tools (NewRelic, CloudWatch, DataDog) for operational visibility
  • Apply data-driven insights to improve system stability and user experience
  • Help ensure architecture and deployment patterns meet reliability expectations
  • DevSecOps & Automation

  • Strengthen CI / CD pipelines and code-quality practices
  • Collaborate with Cybersecurity to remediate vulnerabilities via automation
  • Support secure, consistent, and scalable delivery workflows
  • Resiliency Engineering

  • Identify potential failure points and architectural risks
  • Support chaos testing and failure-injection exercises
  • Assist in capacity planning for seasonal or unexpected load spikes
  • Recommend improvements to infrastructure and services to increase resiliency
  • Leadership & Collaboration

  • Contribute to knowledge sharing and best practices across engineering
  • Collaborate closely with product, engineering, and security teams
  • Write clear documentation to improve operational readiness
  • Qualifications

  • Experience as a software or systems engineer with strong debugging skills
  • Hands-on experience with AWS and Terraform (required)
  • Experience with ECS;
  • Kubernetes / EKS experience preferred

  • Proficiency in Python, Golang, Bash, or similar automation languages
  • CI / CD experience with GitHub Enterprise, Jenkins, CircleCI, or similar
  • Ability to troubleshoot across servers, networks, storage, and databases
  • Experience supporting production systems in cloud environments
  • Strong communication, analytical thinking, and incident response skills
  • BS in Computer Science or equivalent experience
  • About Us

    We build reliable, scalable, and secure digital platforms that support global user experiences. Through automation, observability, cloud engineering, and resilient systems design, we help organizations operate confidently, innovate quickly, and maintain high-quality service delivery.

    Why Join Us

    Join an engineering organization where reliability, automation, and performance sit at the core of everything we do. You’ll work with modern cloud technologies, collaborate with high-performing teams, and contribute directly to systems that support millions of users. This is an opportunity to influence platform maturity, strengthen engineering practices, and grow as a reliability-focused engineer.

    Create a job alert for this search

    Platform Engineer • Hyderabad, Republic Of India, IN

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Elios TalentHyderabad, Telangana, India
    Site Reliability Engineer Key Highlights ️ Build, automate, and support cloud-native infrastructure powering high-availability platforms ⚡ Contribute to automation-first engineering across AWS, Te...Show moreLast updated: 2 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20425]

    Sr Engineer, Site Reliability [T500-20425]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - AWS / Google Cloud Platform

    Site Reliability Engineer - AWS / Google Cloud Platform

    INDIGLOBE IT SOLUTIONS PRIVATE LIMITEDHyderabad
    Job Summary : We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team.As an SRE, you will play a key role in ensuring the rel...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE) / DevOps Engineer

    Site Reliability Engineer (SRE) / DevOps Engineer

    Stoopa AIHyderabad, IN
    AI is building next-generation AI-driven platforms for ports and is focused on reliability, speed, and intelligent automation. As we scale our next generation smart port product Turi, we are hiring ...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesHyderabad, Telangana, India
    GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABITHyderabad, Telangana, India
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Inspire Brands Hyderabad Support CenterHyderabad, India
    Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The companys technology hub, Inspire Brands Hyderabad Support Center, India, will le...Show moreLast updated: 25 days ago
    • Promoted
    Sr Engineer, Site Reliability

    Sr Engineer, Site Reliability

    TMUS Global SolutionsHyderabad, India
    As a Senior Site Reliability Engineer, you will be a key member of the CFL Platform Engineering and Operations team you will play a pivotal role in building and scaling intelligent infrastructure t...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer [T500-21132]

    Site Reliability Engineer [T500-21132]

    InspireHyderabad, Telangana, India
    Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The company’s technology hub, Inspire Brands Hyderabad Support Center, India, will l...Show moreLast updated: 17 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Elios TalentHyderabad, Telangana, India
    Senior Site Reliability Engineer.Build, scale, and optimize cloud-native infrastructure powering global, high-availability platforms. Drive automation-first engineering across AWS, Terraform, CI / CD,...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PhonePeHyderabad, IN
    SRE We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production ...Show moreLast updated: 17 days ago
    • Promoted
    Engineer - Site Relibility - FPT

    Engineer - Site Relibility - FPT

    Talent500 INCHyderabad, India
    Engineer - Site Reliability - FPT.As a Site Reliability Engineer, youll play a crucial role in keeping our digital backbone running seamlessly for millions of customers. Your mission : reduce inciden...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Platform Engineer - Site Reliability

    Lead Platform Engineer - Site Reliability

    Prometheus consultingHyderabad
    Description : What You Will Own : - Build, manage, and mentor a high-performing Platform Engineeri...Show moreLast updated: 30+ days ago
    • Promoted
    Hosting Reliability Engineer

    Hosting Reliability Engineer

    MathWorksHyderabad, Telangana, India
    Would you like to join a team making a positive impact at MathWorks? IT Hosting is modernizing our infrastructure and the way we operate it. You will be responsible for designing, deploying, maintai...Show moreLast updated: 8 days ago
    • Promoted
    Engineer, Site Reliability

    Engineer, Site Reliability

    TMUS Global SolutionsHyderabad, India
    Engineer reliability : Identify potential system issues early, implement preventive measures, and boost system resilience. Automate for speed : Build tools, pipelines, and scripts that eliminate manua...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Awign ExpertHyderabad, IN
    Position : SRE Observability Engineer.Mandatory Skills : Observability, Grafana and Writing queries using Prometheus and Loki. We are seeking a highly experienced and driven Senior Observability Engin...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VXI Global SolutionsHyderabad, Telangana, India
    We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer L1

    Site Reliability Engineer L1

    APTO SOLUTIONS - EXECUTIVE SEARCH & CONSULTANTSHyderabad, Telangana, India
    We’re Hiring | Site Reliability Engineer – L1.Build scalable & reliable systems through automation.Bridge between Dev & Ops teams for faster delivery. Work on CI / CD, Docker, Kubernetes, Jenkins, Git...Show moreLast updated: 10 days ago