Talent.com
[Apply in 3 Minutes] Senior Site Reliability Engineer (SRE) – Datadog Observability

[Apply in 3 Minutes] Senior Site Reliability Engineer (SRE) – Datadog Observability

Jade GlobalPune, Maharashtra, India
4 days ago
Job description

Job Description

Job Description

Job Title : Senior Site Reliability Engineer (SRE) – Datadog Observability

Experience Required : 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in Datadog

Location : Hyderabad preferable but open for Pune and remote

Job Summary :

We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on Datadog Observability. The ideal candidate will bring deep technical expertise in building reliable, scalable, and observable systems, with hands-on experience in integrating enterprise applications and middleware

Key Responsibilities :

  • Drive end-to-end SRE implementation, ensuring system reliability, scalability, and performance.
  • Design, configure, and manage Datadog dashboards, monitors, alerts, and APM for proactive issue detection and resolution.
  • Utilize the Datadog Roles API to create and manage user roles, global permissions, and access controls for various teams.
  • Collaborate with product managers, engineering teams, and business stakeholders to identify observability gaps and design solutions using Datadog.
  • Implement automation for alerting, incident response, and ticket creation to improve operational efficiency.
  • Work closely with business and IT teams to support critical Financial Month-End, Quarter-End, and Year-End closures.
  • Leverage Datadog AI
  • Provide technical leadership in observability, reliability, and performance engineering practices

Required Skills and Experience :

  • 8+ years of experience in Site Reliability Engineering, Observability
  • Minimum 3+ years of hands-on experience with Datadog (dashboards, APM, alerting, log management, Roles API, and monitoring setup).
  • Proven experience implementing SRE best practices—incident management, postmortems, automation, and reliability metrics
  • Excellent stakeholder management and communication skills; experience collaborating with business and IT teams.
  • Strong problem-solving mindset and ability to work in high-pressure production support environments.
  • Preferred Qualifications :

  • Certification in Datadog or related observability platforms.
  • Knowledge of CI / CD tools and automation frameworks.
  • Experience in cloud platforms (AWS, Azure, or OCI).
  • Exposure to ITIL-based production support processes.
  • Create a job alert for this search

    Senior Reliability • Pune, Maharashtra, India

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiPune, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 14 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Futurism Technologies, INC.Pune, Maharashtra, India
    Site Reliability Engineering (SRE) Lead.We are seeking a highly skilled and experienced.You will lead a team responsible for building and maintaining automated deployment pipelines, infrastructure ...Show moreLast updated: 3 days ago
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    OnitPune, Maharashtra, IN
    Quick Apply
    Site Reliability Engineer Onit, Inc.Site Reliability Engineer L2 to join our Core Infrastructure team.This role will help to ensure the reliability of a diverse set of applications across our AWS i...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SFS Group India Pvt. Ltd.pune, maharashtra, in
    Act as the Site Reliability Engineer for global operations, ensuring system stability, scalability, and efficiency through advanced automation, observability, and proactive infrastructure managemen...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PRI GlobalPune, Maharashtra, India
    Experience in Linux , Azure cloud certification and candidate must have good knowledge on Bash / jenkins / Chef / chef-habitat technologies.Show moreLast updated: 4 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade GlobalPune, Maharashtra, India
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PhonePePune, Maharashtra, India
    Troubleshoot issues across the entire stack - hardware, software, application, and network.Work to improve the reliability and performance of the next generation of distributed systems.Work to impr...Show moreLast updated: 3 days ago
    • Promoted
    Reveille Technologies - Site Reliability Engineer - DevOps

    Reveille Technologies - Site Reliability Engineer - DevOps

    Reveille TechnologiesPune
    Job Summary : We are seeking a proactive and skilled Site Reliability Engineer (SRE) to join our team on a Contract-to-Hire (C2H) basis.The ideal c...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer, Platform Engineering

    Site Reliability Engineer, Platform Engineering

    ConfidentialPune, India
    Tesla's Platform Engineering is looking for a Site Reliability Engineer to join our team.As a member of the team, you will be building and maintaining Kubernetes clusters using infrastructure-as-co...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupPune, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Pune, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - DevOps

    Site Reliability Engineer - DevOps

    QualysPune, Maharashtra, India
    Job Description As a Junior DevOps Engineer at Qualys, you will play a supporting yet impactful role in maintaining and enhancing our DevOps ecosystem. This position is ideal for someone with a str...Show moreLast updated: 4 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialPune, India
    Maintain and troubleshoot production cloud infrastructure to ensure optimal performance and uptime.Apply patches, updates, and security configurations across cloud environments.Execute scheduled re...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgePune, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServicePune, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmapune, maharashtra, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 25 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialPune
    As an SRE Engineer, you will be responsible for ensuring the seamless operation and optimal performance of large-scale distributed software applications. Your role revolves around maintaining a robu...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Elastic Kubernetes Service

    Site Reliability Engineer - Elastic Kubernetes Service

    MNR SolutionsPune
    Description : Site Reliability Engineer (SRE) Kubernetes & Cloud Position Summary : We are seeking a...Show moreLast updated: 14 days ago