Talent.com
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

ConfidentialHyderabad / Secunderabad, Telangana, Chennai, Pune
30+ days ago
Job description

We are seeking a highly skilled Site Reliability Engineer (SRE) to maintain the stability of our software product throughout its entire development lifecycle. You will be responsible for measuring and monitoring the system's general state, analyzing incident data, automating monitoring processes, and developing frameworks and scripts to enhance product stability and reliability.

Roles & Responsibilities :

  • Maintain the stability of the software product throughout the entire software development process.
  • Measure and monitor the general state of the system on all mediums using tools like DataDog, GCP Matrix / Platform, and Grafana .
  • Run the collection and analysis of data from incidents and post-mortems to identify root causes and preventive measures.
  • Utilize various tools to automate the monitoring of the software system.
  • Identify new instruments and technologies to develop and streamline the stability of the product.
  • Develop monitoring and testing frameworks, solutions, or scripts in various programming languages.
  • Ensure to maintain and run tests in order to ensure the reliability and stability of the product.
  • Apply knowledge of Kubernetes for managing containerized applications and deployments.
  • Contribute to defining and achieving SLO / SLAs (Service Level Objectives / Agreements) and managing Error Budgeting .
  • Optimize deployment processes for efficiency and reliability.

Skills Required :

  • Proficiency with monitoring tools such as DataDog, GCP Matrix / Platform, and Grafana .
  • Experience with Kubernetes .
  • Strong understanding of deployment processes .
  • Knowledge of SLO / SLAs, Error Budgeting , and related tools.
  • Ability to measure and monitor system state effectively.
  • Experience in collecting and analyzing data from incidents and post-mortems.
  • Proficiency in automating software system monitoring.
  • Capability to identify and implement new instruments for product stability.
  • Experience in developing monitoring and testing frameworks, solutions, or scripts in various programming languages.
  • Strong commitment to maintaining and running tests for product reliability and stability.
  • QUALIFICATION :

  • Bachelor's degree in Computer Science, Information Technology, or a related engineering field, or equivalent practical experience.
  • Skills Required

    Datadog, Grafana, Kubernetes, System Monitoring

    Create a job alert for this search

    Site Reliability Engineer • Hyderabad / Secunderabad, Telangana, Chennai, Pune

    Related jobs
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABITHyderabad, Telangana, India
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 30+ days ago
    SRE(Site Reliability Engineer)

    SRE(Site Reliability Engineer)

    Talent WorxHyderabad, TS, IN
    Quick Apply
    SRE (Site Reliability Engineer).Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, perfo...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Engineer, Site Reliability Engineer [T500-20464]

    Sr Engineer, Site Reliability Engineer [T500-20464]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 29 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 3 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20463]

    Sr Engineer, Site Reliability [T500-20463]

    TMUS Global SolutionsHyderabad, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 29 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    AutoRABITHyderabad, Republic Of India, IN
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeHyderabad, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceHyderabad, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 2 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20425]

    Sr Engineer, Site Reliability [T500-20425]

    TMUS Global SolutionsHyderabad, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 29 days ago
    • Promoted
    Sr Engineer, Site Reliability Engineer T500-20464

    Sr Engineer, Site Reliability Engineer T500-20464

    TMUS Global SolutionsHyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 29 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialHyderabad / Secunderabad, Telangana, India
    Must be able to join within 30 days or less!.An employer is looking for an SRE to join their enterprise level SRE team.They are building a specialized team of Senior Site Reliability Engineers to a...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiHyderabad, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 14 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20279]

    Sr Engineer, Site Reliability [T500-20279]

    TMUS Global Solutionshyderabad, telangana, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20439]

    Sr Engineer, Site Reliability [T500-20439]

    TMUS Global SolutionsHyderabad, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America's supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 29 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    InfosysHyderabad, Republic Of India, IN
    We are seeking a skilled and motivated Site Reliability Engineer with hands-on expertise.DevOps tools, and SRE principles. Provide production support for Production applications, ensuring the stabil...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupHyderabad, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer [T500-21132]

    Site Reliability Engineer [T500-21132]

    InspireHyderabad, Telangana, India
    About Inspire Brands : Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies. The company’s technology hub, Inspire Brands Hyderabad Suppor...Show moreLast updated: 4 days ago
    • Promoted
    Engineer, Site Reliability [T500-20515]

    Engineer, Site Reliability [T500-20515]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 29 days ago