Talent.com
Senior Site Reliability Engineer (SRE) – Datadog Observability

Senior Site Reliability Engineer (SRE) – Datadog Observability

Jade Globalindore, madhya pradesh, in
5 days ago
Job description

Job Description

Job Description

Job Title : Senior Site Reliability Engineer (SRE) – Datadog Observability

Experience Required : 8+ years overall in SRE and Infrastructure Operations with minimum 3 + years hands-on experience in Datadog

Location : Hyderabad preferable but open for Pune and remote

Job Summary :

We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on Datadog Observability . The ideal candidate will bring deep technical expertise in building reliable, scalable, and observable systems, with hands-on experience in integrating enterprise applications and middleware

Key Responsibilities :

  • Drive end-to-end SRE implementation , ensuring system reliability, scalability, and performance.
  • Design, configure, and manage Datadog dashboards , monitors, alerts, and APM for proactive issue detection and resolution.
  • Utilize the Datadog Roles API to create and manage user roles, global permissions, and access controls for various teams.
  • Collaborate with product managers, engineering teams, and business stakeholders to identify observability gaps and design solutions using Datadog.
  • Implement automation for alerting, incident response, and ticket creation to improve operational efficiency.
  • Work closely with business and IT teams to support critical Financial Month-End, Quarter-End, and Year-End closures .
  • Leverage Datadog AI
  • Provide technical leadership in observability, reliability, and performance engineering practices

Required Skills and Experience :

  • 8+ years of experience in Site Reliability Engineering, Observability
  • Minimum 3+ years of hands-on experience with Datadog (dashboards, APM, alerting, log management, Roles API, and monitoring setup).
  • Proven experience implementing SRE best practices —incident management, postmortems, automation, and reliability metrics
  • Excellent stakeholder management and communication skills ; experience collaborating with business and IT teams .
  • Strong problem-solving mindset and ability to work in high-pressure production support environments.
  • Preferred Qualifications :

  • Certification in Datadog or related observability platforms.
  • Knowledge of CI / CD tools and automation frameworks.
  • Experience in cloud platforms (AWS, Azure, or OCI).
  • Exposure to ITIL-based production support processes.
  • Create a job alert for this search

    Senior Site Reliability Engineer • indore, madhya pradesh, in

    Related jobs
    • Promoted
    Deployment Engineer

    Deployment Engineer

    Avocaindore, madhya pradesh, in
    Build, launch & optimize AI agents that power the next generation of home-service customer experiences.Avoca is the all-in-one AI lead-conversion platform. Our technology boosts booking rates, slash...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW Groupindore, madhya pradesh, in
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 4 days ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    Mitchell Martin Inc.indore, India
    Include, but are not limited to, the following : .Own productionizing models—from tracked experiments to governed releases—ensuring resilient services with clear SLOs, runbooks, and fast, safe rollba...Show moreLast updated: 19 days ago
    • Promoted
    • New!
    Lead Engineer

    Lead Engineer

    Hyqooindore, madhya pradesh, in
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show moreLast updated: 1 hour ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionsindore, madhya pradesh, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 4 days ago
    • Promoted
    Remote GenAI Engineer

    Remote GenAI Engineer

    EazyMLindore, madhya pradesh, in
    Remote
    Founded by Bell Labs research veterans, and associated with breakthrough startups like Amelia, EazyML, specializes in Transparent Machine Learning. Early on EazyML founders saw the need for Transpa...Show moreLast updated: 19 days ago
    • Promoted
    Senior ML Engineer

    Senior ML Engineer

    Piramal Financeindore, madhya pradesh, in
    Build and operate end-to-end ML / AI pipelines (data → training → deployment → monitoring).Automate CI / CD for ML / AI with Jenkins, integrate MLflow for tracking and registry.Deploy scalable batch and ...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgeminiindore, madhya pradesh, in
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceindore, madhya pradesh, in
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 3 days ago
    • Promoted
    Senior AppDynamics Observability SME

    Senior AppDynamics Observability SME

    Dexian Indiaindore, madhya pradesh, in
    Position Title : Senior AppDynamics Observability SME.IT operations, system administration, or engineering.Ansible, Jenkins, Terraform, Python to develop configuration, deployment, and orchestration...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmaindore, madhya pradesh, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 25 days ago
    • Promoted
    Sr Data Engineer - DBT Tech Lead

    Sr Data Engineer - DBT Tech Lead

    Ascendionindore, madhya pradesh, in
    We are seeking an experienced Senior DBT Technical lead to be part of our upcoming data 2025-2026 data initiatives.This role requires deep expertise in dbt, cloud data platforms, and complex ETL / EL...Show moreLast updated: 3 days ago
    • Promoted
    Engineer : Senior LLM Optimization (LLMO) / GEO Expert – Google Vertex

    Engineer : Senior LLM Optimization (LLMO) / GEO Expert – Google Vertex

    Proso.aiIndore, Madhya Pradesh, India
    About the Role : Proso AI is seeking a Senior Expert with hands-on experience in Google Vertex AI to lead short-term projects focused on LLM Optimization (LLMO) and Generative Engine Optimiz...Show moreLast updated: 3 days ago
    • Promoted
    Senior Implementation Engineer

    Senior Implementation Engineer

    Accelyaindore, madhya pradesh, in
    This is a backend-focused development role responsible for building APIs, integrations, and services.We are looking for a Senior Development Engineer to lead complex software development initiative...Show moreLast updated: 15 days ago
    • Promoted
    Senior / Lead Engineer - DevOps (AWS / Azure / GCP)

    Senior / Lead Engineer - DevOps (AWS / Azure / GCP)

    QBurstindore, madhya pradesh, in
    We are seeking an experienced and versatile DevOps Engineer.The ideal candidate will have hands-on experience with CI / CD pipelines, Kubernetes, Linux systems, monitoring / logging tools, and Infrastr...Show moreLast updated: 19 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.indore, madhya pradesh, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Lead - Cloud Reliability Engineer

    Lead - Cloud Reliability Engineer

    Searce Incindore, madhya pradesh, in
    The ‘process-first’ AI-native modern tech consultancy that's rewriting the rules.As an engineering-led consultancy, we are dedicated to relentlessly improving the real business outcomes.Our solvers...Show moreLast updated: 30+ days ago
    • Promoted
    Resident Engineer – Kubernetes & Portworx

    Resident Engineer – Kubernetes & Portworx

    CMK Resources, Inc.indore, madhya pradesh, in
    CMK Resources Resident Engineer – Kubernetes & Portworx.Remote - based in India working U.EST standard time business hours. compensation expectation of up to 30 lakhs per annum depending on experie...Show moreLast updated: 30+ days ago