Talent.com
Senior Site Reliability Engineer
Senior Site Reliability EngineerConfidential • Hyderabad / Secunderabad, Telangana
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Confidential • Hyderabad / Secunderabad, Telangana
30+ days ago
Job description

Key Responsibilities :

  • Lead incident management , monitoring, and alerting processes to ensure timely detection and resolution of production issues.
  • Ensure reliability, availability, and performance of systems by defining and maintaining SLIs, SLOs, and SLAs.
  • Design and implement fault-tolerant, scalable architectures to minimize downtime and improve resiliency.
  • Develop automation and tooling for monitoring, incident remediation, and infrastructure management.
  • Participate in a 24x7 on-call rotation to manage production incidents and maintain system uptime.
  • Create and maintain SOPs and technical documentation for processes, tools, and incident management protocols.
  • Implement and manage Infrastructure as Code (IaC) using tools such as Terraform and Ansible to automate provisioning and deployments.
  • Work with cloud platforms —primarily AWS (EC2, S3, VPC, RDS, EKS, ECS, CloudWatch, CloudFormation)—to support scalable system operations.
  • Integrate and manage CI / CD pipelines using tools like Jenkins to enable seamless deployments.
  • Utilize monitoring and alerting tools (Datadog, Site24x7, Grafana, CloudWatch) to proactively identify issues.
  • Conduct performance tuning and optimization , addressing bottlenecks and improving efficiency.
  • Drive cost optimization strategies while maintaining performance and reliability standards.
  • Adhere to security best practices and ensure infrastructure compliance with organizational standards.
  • Collaborate with development, product, and security teams to enhance system reliability and service delivery.
  • Mentor junior engineers and promote a culture of reliability engineering across the organization.

Qualifications :

  • 5–8 years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.
  • Strong hands-on expertise with AWS (experience with GCP or Azure is a plus).
  • Proficiency in Infrastructure as Code (IaC) tools such as Terraform and Ansible .
  • Experience with monitoring and alerting tools including Datadog, Site24x7, Grafana, and CloudWatch.
  • Solid understanding of CI / CD tools such as Jenkins.
  • Proven ability in incident management, root cause analysis , and implementing long-term reliability improvements.
  • Familiarity with automation scripting (Python, Bash, or Shell scripting preferred).
  • Knowledge of security best practices , networking , and cloud cost management .
  • Excellent problem-solving, analytical, and collaboration skills.
  • AWS certification or equivalent cloud certification is an advantage.
  • Skills Required

    Aws, Rds, ECS, Vpc, Cloud, Ci

    Create a job alert for this search

    Senior Site Reliability Engineer • Hyderabad / Secunderabad, Telangana

    Related jobs
    Engineer, Site Reliability [T500-20517]

    Engineer, Site Reliability [T500-20517]

    TMUS Global Solutions • Hyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Blue Spire Inc • Hyderabad, Republic Of India, IN
    We are seeking a highly skilled Senior L2 Ops Engineer to join our dynamic team.You will play a critical role in maintaining the stability, performance, and reliability of our systems through robus...Show more
    Last updated: 4 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Zyoin Group • Hyderabad
    Description : As the most senior technical individual contributor within an entire division of Engine...Show more
    Last updated: 16 days ago • Promoted
    Sr Engineer, Site Reliability Engineer [T500-20464]

    Sr Engineer, Site Reliability Engineer [T500-20464]

    TMUS Global Solutions • Hyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
    Last updated: 30+ days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    AutoRABIT • Hyderabad, Republic Of India, IN
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdge • Hyderabad, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show more
    Last updated: 24 days ago • Promoted
    SRE (Site Reliability Engineer)

    SRE (Site Reliability Engineer)

    Tata Consultancy Services • Hyderabad, Republic Of India, IN
    Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional), Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment experience,.Show more
    Last updated: 1 day ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABIT • Hyderabad, Telangana, India
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy Services • Hyderabad, Telangana, India
    GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Senior Engineer

    Site Reliability Senior Engineer

    Confidential • Hyderabad / Secunderabad, Telangana, India
    Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in hist...Show more
    Last updated: 15 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HRhelpdesk • secunderabad, telangana, in
    Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.As a Site Reliability Engineer (SRE), you will be responsible for building and maintainin...Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Hyderabad, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 21 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutions • secunderabad, telangana, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show more
    Last updated: 11 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Infosys • Hyderabad, Republic Of India, IN
    We are seeking a skilled and motivated Site Reliability Engineer with hands-on expertise.DevOps tools, and SRE principles. Provide production support for Production applications, ensuring the stabil...Show more
    Last updated: 25 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    TMUS Global Solutions • Hyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Foodsmart • Hyderabad, Republic Of India, IN
    Foodsmart is the leading telenutrition and foodcare solution, backed by a robust network of Registered Dietitians.Our platform is designed to foster healthier food choices, drive lasting behavior c...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    o9 Solutions, Inc. • secunderabad, India
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
    Last updated: 6 hours ago • Promoted • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Confidential • Hyderabad / Secunderabad, Telangana, India
    As a senior site reliability engineer will work in our global organization to provide operational support for all Thomson Reuters products, including development tools and infrastructure used by en...Show more
    Last updated: 30+ days ago • Promoted