Talent.com
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

ConfidentialMumbai, Kolkata, Delhi
30+ days ago
Job description

We are seeking a highly skilled Site Reliability Engineer (SRE) to maintain the stability of our software product throughout its entire development lifecycle. You will be responsible for measuring and monitoring the system's general state, analyzing incident data, automating monitoring processes, and developing frameworks and scripts to enhance product stability and reliability.

Roles & Responsibilities :

  • Maintain the stability of the software product throughout the entire software development process.
  • Measure and monitor the general state of the system on all mediums using tools like DataDog, GCP Matrix / Platform, and Grafana .
  • Run the collection and analysis of data from incidents and post-mortems to identify root causes and preventive measures.
  • Utilize various tools to automate the monitoring of the software system.
  • Identify new instruments and technologies to develop and streamline the stability of the product.
  • Develop monitoring and testing frameworks, solutions, or scripts in various programming languages.
  • Ensure to maintain and run tests in order to ensure the reliability and stability of the product.
  • Apply knowledge of Kubernetes for managing containerized applications and deployments.
  • Contribute to defining and achieving SLO / SLAs (Service Level Objectives / Agreements) and managing Error Budgeting .
  • Optimize deployment processes for efficiency and reliability.

Skills Required :

  • Proficiency with monitoring tools such as DataDog, GCP Matrix / Platform, and Grafana .
  • Experience with Kubernetes .
  • Strong understanding of deployment processes .
  • Knowledge of SLO / SLAs, Error Budgeting , and related tools.
  • Ability to measure and monitor system state effectively.
  • Experience in collecting and analyzing data from incidents and post-mortems.
  • Proficiency in automating software system monitoring.
  • Capability to identify and implement new instruments for product stability.
  • Experience in developing monitoring and testing frameworks, solutions, or scripts in various programming languages.
  • Strong commitment to maintaining and running tests for product reliability and stability.
  • QUALIFICATION :

  • Bachelor's degree in Computer Science, Information Technology, or a related engineering field, or equivalent practical experience.
  • Skills Required

    Datadog, Grafana, Kubernetes, System Monitoring

    Create a job alert for this search

    Site Reliability Engineer • Mumbai, Kolkata, Delhi

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiKolkata, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 11 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globalkolkata, west bengal, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 2 days ago
    • Promoted
    Senior Site Reliability Engineer / Senior Cloud Engineer

    Senior Site Reliability Engineer / Senior Cloud Engineer

    CloudHirekolkata, west bengal, in
    The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture.Repo...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Kolkata, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutionskolkata, west bengal, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    Consultant – Site Reliability Engineer I

    Consultant – Site Reliability Engineer I

    ConfidentialKolkata
    Ready to build the future with AI.At Genpact, we don't just keep up with technology-we set the pace.AI and digital innovation are redefining industries, and we're leading the charge.Genpact's AI Gi...Show moreLast updated: 22 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeKolkata, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupKolkata, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionskolkata, west bengal, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 20 hours ago
    • Promoted
    Site Reliability Engineering Lead

    Site Reliability Engineering Lead

    CloudHireKolkata, Republic Of India, IN
    The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture.Repo...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialBengaluru / Bangalore, Chennai, Kolkata
    We are excited to present a unique opportunity at Cognizant, a leading IT firm renowned for.We are seeking talented professionals with . Site Reliability Engineer (SRE), AWS Devops, Python, Java.You...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmakolkata, west bengal, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 22 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    iSoftStoneKolkata, Republic Of India, IN
    Greetings from ISoftStone Inc!.This is Rajlaxmi from the HR department of ISoftStone Inc.We are looking for a SRE / Devops. Location- Bangalore / Hybrid (2-3 days WFO).Bachelors degree in computer scie...Show moreLast updated: 18 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialMumbai, Kolkata, Delhi
    Build products with MVRs and reliability standards , ensuring system resilience and scalability.Set up and operate observability tools across multiple cloud providers, incorporating AI-powered anom...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Sr Engineer, Site Reliability T500-21295

    Sr Engineer, Site Reliability T500-21295

    TMUS Global SolutionsKolkata, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Site Reliability Engineer - Azure

    Site Reliability Engineer - Azure

    PhonePeKolkata, Republic Of India, IN
    We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production serv...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Lead Consultant – Site Reliability Engineer

    Lead Consultant – Site Reliability Engineer

    ConfidentialKolkata
    Ready to build the future with AI.At Genpact, we don't just keep up with technology-we set the pace.AI and digital innovation are redefining industries, and we're leading the charge.Genpact's AI Gi...Show moreLast updated: 22 hours ago
    • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    CloudHireKolkata
    Description : - Provide leadership and management to a remote team of Site Reliability Engineers, ensuring alignment with organizational priorities and goals.Oversee...Show moreLast updated: 8 days ago