Talent.com
Site Reliability Engineer
Site Reliability EngineerHRhelpdesk • Hyderabad, Republic Of India, IN
Site Reliability Engineer

Site Reliability Engineer

HRhelpdesk • Hyderabad, Republic Of India, IN
10 days ago
Job description

About the company : Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.

Job Summary : As a Site Reliability Engineer (SRE), you will be responsible for building and maintaining the infrastructure, tools, and pipelines that keep our systems running smoothly. You will collaborate closely with DevOps, engineering, and product teams to design and deploy reliable, scalable, and automated systems. You will also improve the application code for user-facing bugs, ensuring enhanced performance and resilience.

RESPONSIBILITIES : Comfortable with work shift aligned with U.S. time zone (7 pm to 3 am IST)

1. CI / CD Pipeline Management :

  • Design, implement, and maintain robust CI / CD pipelines for automated software deployment.
  • Collaborate with DevOps and engineering teams to integrate testing, monitoring, and security checks into pipelines.
  • Continuously improve deployment processes to ensure smooth and error-free production releases.

2. Monitoring and Observability :

  • Create and manage comprehensive logging dashboards in Datadog to monitor system health, performance, and logs.
  • Set up alerting mechanisms to proactively identify and respond to system issues.
  • Analyze and visualize key performance metrics to drive improvements.
  • 3. Collaborate on Architectural Solutions :

  • Work closely with DevOps and engineering teams to design scalable, resilient, and secure infrastructure.
  • Ensure solutions adhere to best practices for performance, security, and maintainability.
  • 4. Code Optimization and Bug Fixing :

  • Improve application code to resolve user-facing bugs and enhance system resilience.
  • Troubleshoot and fix issues that impact the performance or availability of production systems.
  • Contribute to the continuous improvement of the codebase, focusing on optimizing performance and reliability.
  • 5. Automation and Continuous Improvement :

  • Automate repetitive tasks related to infrastructure management, monitoring, and troubleshooting.
  • Identify and propose innovative solutions to improve system efficiency and performance. 6. Custom Node.Js CLI Tool Development :
  • Develop and automate custom Node.Js CLI tools to enhance operational workflows and streamline repetitive tasks.
  • Implement automated solutions to optimize system processe
  • Requirements

    MUST HAVES :

  • Experience Level : 6-8 years
  • Comfortable with work shift aligned with U.S. time zone (7 pm to 3 am IST)
  • Prior experience working in cross-functional teams
  • Systems architecture and design skills
  • Proficiency in scripting languages such as Bash, Python, or PowerShell.
  • Experience with CI / CD tools such as Github Actions or similar platforms.
  • Build and deployment automation experience especially in a containerized world
  • Proficiency with common ops tools (ECS, Logstash, Datadog + Kibana, EKS etc)
  • Experience with AWS or Azure
  • Comfort maintaining live production systems
  • Strong communication and collaboration skills, with the ability to work effectively in a fast-paced team environment.
  • Create a job alert for this search

    Site Reliability Engineer • Hyderabad, Republic Of India, IN

    Related jobs
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Zyoin Group • Hyderabad
    Description : As the most senior technical individual contributor within an entire division of Engine...Show more
    Last updated: 26 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Infosys Finacle • Secunderabad, Telangana, India
    Role : DevSecOps Developer – Secure Coding & Automation Required Skills : 4 to 12 years of experience in building secure applications using any popular programming language like Java / Node.Strong...Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Prometheus consulting • Hyderabad
    WHAT YOU'LL DO : - Support, maintain, and enhance the reliability, scalability, and performance of our Azure-based Data Analytics Platform. Collaborate closely with Data En...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    HRhelpdesk • Secunderabad, Telangana, India
    About the company : Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions. Job Summary : As a Site Reliability Engineer (SRE), you will be re...Show more
    Last updated: 11 hours ago • Promoted • New!
    Site Reliability Engineer T500-21132

    Site Reliability Engineer T500-21132

    Inspire • Hyderabad, Republic Of India, IN
    Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The company’s technology hub, Inspire Brands Hyderabad Support Center, India, will l...Show more
    Last updated: 21 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy Services • Hyderabad, Telangana, India
    GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show more
    Last updated: 30+ days ago • Promoted
    GCP Site Reliability Engineer

    GCP Site Reliability Engineer

    inTune Systems Inc • Hyderabad, Telangana, India
    We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team.As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our...Show more
    Last updated: 20 hours ago • Promoted • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABIT • Hyderabad, Telangana, India
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show more
    Last updated: 30+ days ago • Promoted
    Gcp Site Reliability Engineer

    Gcp Site Reliability Engineer

    inTune Systems Inc • Hyderabad, Republic Of India, IN
    We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team.As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our...Show more
    Last updated: 17 hours ago • Promoted • New!
    Site Reliability Engineer [T500-21132]

    Site Reliability Engineer [T500-21132]

    Inspire • Hyderabad, Telangana, India
    Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The company’s technology hub, Inspire Brands Hyderabad Support Center, India, will l...Show more
    Last updated: 20 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Elios Talent • Hyderabad, Telangana, India
    Senior Site Reliability Engineer.Build, scale, and optimize cloud-native infrastructure powering global, high-availability platforms. Drive automation-first engineering across AWS, Terraform, CI / CD,...Show more
    Last updated: 5 days ago • Promoted
    Engineer, Site Reliability [T500-20266]

    Engineer, Site Reliability [T500-20266]

    TMUS Global Solutions • Hyderabad, Telangana, India
    About T-Mobile : T-Mobile US, Inc.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship b...Show more
    Last updated: 30+ days ago • Promoted
    Sr Engineer, Site Reliability T500-20425

    Sr Engineer, Site Reliability T500-20425

    TMUS Global Solutions • Hyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Elios Talent • Hyderabad, Telangana, India
    Build, automate, and support cloud-native infrastructure powering high-availability platforms.Contribute to automation-first engineering across AWS, Terraform, CI / CD, and observability tooling.Impr...Show more
    Last updated: 5 days ago • Promoted
    Engineer, Site Reliability T500-20266

    Engineer, Site Reliability T500-20266

    TMUS Global Solutions • Hyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent Partners • secunderabad, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show more
    Last updated: 11 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    VXI Global Solutions • Hyderabad, Telangana, India
    We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    NationsBenefits India • Hyderabad, Telangana, India
    Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog |.SRE team focused on maintaining the performance, reliability, and availability of our fintech platforms.Triage and resolve product...Show more
    Last updated: 30+ days ago • Promoted