Talent.com
Site Reliability Engineer - Cloud Solutions

Site Reliability Engineer - Cloud Solutions

SMARTWORK IT SERVICESHyderabad
5 days ago
Job description

Description :

Role : Site Reliability Engineer (SRE).

Location : Hyderabad.

Experience : 10 to 15 Years.

Job Summary :

The Site Reliability Engineer (SRE) will play a critical role in ensuring the reliability, scalability, and performance of Citizens Banks enterprise systems and cloud environments.

The ideal candidate brings deep technical expertise across multi-cloud platforms, automation, observability, and incident management driving reliability engineering practices and operational excellence in a complex financial services environment.

Key Responsibilities :

  • Manage and support cloud-based solutions across AWS, Azure, GCP, and other IaaS / PaaS / SaaS / CDN environments.
  • Design, implement, and maintain reliable, scalable, and secure infrastructure, ensuring high availability and performance.
  • Collaborate with DevOps and security teams to implement DevSecOps workflows using Git, Jenkins, Docker, Kubernetes (EKS / AKS).
  • Automate infrastructure and configuration management using Terraform, Ansible, and scripting languages like Python, Bash, or PowerShell.
  • Analyze traffic flows, system logs, and application events to troubleshoot issues and identify interdependencies across systems.
  • Utilize monitoring and observability tools such as DataDog, Splunk, and CloudWatch for proactive system health management.
  • Implement on-call support processes, develop and maintain runbook documentation, and work toward full automation of repetitive tasks.
  • Collaborate with other SREs to build resilient systems and promote Site Reliability Engineering best practices across the enterprise.
  • Handle critical application outages, perform root cause analysis, and drive incident resolution and preventive measures.
  • Work within an Agile environment, partnering with cross-functional teams to continuously improve performance and reliability.

Technical Skills Required :

  • Cloud Platforms : AWS, Azure, GCP.
  • DevOps / DevSecOps Tools : Jenkins, Git, Docker, Kubernetes (EKS, AKS).
  • Infrastructure as Code (IaC) : Terraform, Ansible.
  • Monitoring & Logging : DataDog, Splunk, CloudWatch.
  • Scripting : Python, Bash, PowerShell.
  • Networking : TCP / IP, DNS, HTTP, Load Balancing, Routing.
  • OS Environments : Linux, Windows Server.
  • Familiarity with AMI builds, patching, and rehydration processes.
  • Core Competencies :

  • Strong analytical and troubleshooting skills.
  • Proven ability to drive incident response and post-incident reviews.
  • Excellent communication and stakeholder management.
  • Ability to collaborate in global, distributed teams.
  • Focus on automation, resilience, and continuous improvement.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Hyderabad

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmahyderabad, telangana, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 20 days ago
    • Promoted
    Cloud Reliability Engineer

    Cloud Reliability Engineer

    GSPANN Technologies, IncHyderabad, Republic Of India, IN
    Headquartered in California, U.GSPANN provides consulting and IT services to global clients.We help clients transform how they deliver business value by helping them optimize their IT capabilities,...Show moreLast updated: 10 days ago
    • Promoted
    Engineer, Site Reliability [T500-20521]

    Engineer, Site Reliability [T500-20521]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesHyderabad, Telangana, India
    We are currently seeking a for a position SRE Engineer in Hyderabad.Job ID : 375656 • • • •Apply Here : • • (TCS iBegin) • •Job Description : • • - Proven experience as a DevOps / SRE Engineer - Expertise in...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer - AWS / Google Cloud Platform

    Site Reliability Engineer - AWS / Google Cloud Platform

    INDIGLOBE IT SOLUTIONS PRIVATE LIMITEDHyderabad
    Job Summary : We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team.As an SRE, you will play a key role in ensuring the rel...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Prometheus consultingHyderabad
    WHAT YOU'LL DO : - Support, maintain, and enhance the reliability, scalability, and performance of our Azure-based Data Analytics Platform. Collaborate closely with Data En...Show moreLast updated: 8 days ago
    • Promoted
    Engineer, Site Reliability [T500-20517]

    Engineer, Site Reliability [T500-20517]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 24 days ago
    • Promoted
    Engineer, Site Reliability [T500-20515]

    Engineer, Site Reliability [T500-20515]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 24 days ago
    • Promoted
    Lead - Cloud Reliability Engineer

    Lead - Cloud Reliability Engineer

    Searce Incsecunderabad, telangana, in
    The ‘process-first’ AI-native modern tech consultancy that's rewriting the rules.As an engineering-led consultancy, we are dedicated to relentlessly improving the real business outcomes.Our solvers...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiHyderabad, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Sonata SoftwareHyderabad, India
    Site Reliability Engineer (SRE) III – Data Engineering.AWS, CI / CD, Jenkins, IAAC, Terraform, Kubernetes.Secondary Skills (Good-to-Have). AWS systems; Dataiku data, Platform updates and patching.Data...Show moreLast updated: 20 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    TMUS Global SolutionsHyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 25 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SID Global SolutionsHyderabad, Telangana, India
    Job Role : Site Reliability Engineer (SRE) – GCP.SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortu...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    o9 Solutions, Inc.hyderabad, telangana, in
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TMUS Global SolutionsHyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 25 days ago
    • Promoted
    Engineer, Site Reliability [T500-20519]

    Engineer, Site Reliability [T500-20519]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 24 days ago
    • Promoted
    Engineer, Site Reliability [T500-20518]

    Engineer, Site Reliability [T500-20518]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    NationsBenefits IndiaHyderabad, Telangana, India
    Job Title : Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog | 24 / 7 Support Department : Site Reliability Engineering Location : Hyderabad, India Employment Type : Full-Time Noti...Show moreLast updated: 21 days ago