Talent.com
Senior Site Reliability Engineer (Sre) – Datadog Observability

Senior Site Reliability Engineer (Sre) – Datadog Observability

Jade GlobalThiruvananthapuram, Republic Of India, IN
3 days ago
Job description

Job Description

Job Description

Job Title : Senior Site Reliability Engineer (SRE) – Datadog Observability

Experience Required : 8+ years overall in SRE and Infrastructure Operations with minimum 3 + years hands-on experience in Datadog

Location : Hyderabad preferable but open for Pune and remote

Job Summary :

We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on Datadog Observability . The ideal candidate will bring deep technical expertise in building reliable, scalable, and observable systems, with hands-on experience in integrating enterprise applications and middleware

Key Responsibilities :

  • Drive end-to-end SRE implementation , ensuring system reliability, scalability, and performance.
  • Design, configure, and manage Datadog dashboards , monitors, alerts, and APM for proactive issue detection and resolution.
  • Utilize the Datadog Roles API to create and manage user roles, global permissions, and access controls for various teams.
  • Collaborate with product managers, engineering teams, and business stakeholders to identify observability gaps and design solutions using Datadog.
  • Implement automation for alerting, incident response, and ticket creation to improve operational efficiency.
  • Work closely with business and IT teams to support critical Financial Month-End, Quarter-End, and Year-End closures .
  • Leverage Datadog AI
  • Provide technical leadership in observability, reliability, and performance engineering practices

Required Skills and Experience :

  • 8+ years of experience in Site Reliability Engineering, Observability
  • Minimum 3+ years of hands-on experience with Datadog (dashboards, APM, alerting, log management, Roles API, and monitoring setup).
  • Proven experience implementing SRE best practices —incident management, postmortems, automation, and reliability metrics
  • Excellent stakeholder management and communication skills ;
  • experience collaborating with business and IT teams .

  • Strong problem-solving mindset and ability to work in high-pressure production support environments.
  • Preferred Qualifications :

  • Certification in Datadog or related observability platforms.
  • Knowledge of CI / CD tools and automation frameworks.
  • Experience in cloud platforms (AWS, Azure, or OCI).
  • Exposure to ITIL-based production support processes.
  • Create a job alert for this search

    Senior Site Reliability Engineer • Thiruvananthapuram, Republic Of India, IN

    Related jobs
    • Promoted
    • New!
    Reliability Test Engineer

    Reliability Test Engineer

    L&T Technology ServicesThiruvananthapuram, IN
    Job Title : Reliability Test Engineer.Develop and execute reliability test plans including : .Accelerated Life Testing (ALT). Highly Accelerated Life Testing (HALT).Highly Accelerated Stress Screening ...Show moreLast updated: 13 hours ago
    • Promoted
    Equifax - Senior Site Reliability Engineer - IAC Terraform

    Equifax - Senior Site Reliability Engineer - IAC Terraform

    EquifaxTrivandrum
    About the job Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distr...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiKollam, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 13 days ago
    • Promoted
    Lead - Cloud Reliability Engineer

    Lead - Cloud Reliability Engineer

    Searce Incthiruvananthapuram, kerala, in
    The ‘process-first’ AI-native modern tech consultancy that's rewriting the rules.As an engineering-led consultancy, we are dedicated to relentlessly improving the real business outcomes.Our solvers...Show moreLast updated: 30+ days ago
    • Promoted
    Equifax - Site Reliability Engineer

    Equifax - Site Reliability Engineer

    EquifaxThiruvananthapuram
    Site Reliability Engineering (SRE) at Equifax SRE is a discipline that combines software and systems engineering for building and running large-scale, distrib...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    The world's top banks use Zafin's integrated platform to drive transformative customer value.Powered by an innovative AI-powered architecture, Zafin's platform seamlessly unifies data from across t...Show moreLast updated: 7 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Kollam, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    Mitchell Martin Inc.Thiruvananthapuram, IN
    Include, but are not limited to, the following : .Own productionizing models—from tracked experiments to governed releases—ensuring resilient services with clear SLOs, runbooks, and fast, safe rollba...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    ConfidentialThiruvananthapuram / Trivandrum
    As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability.Should be able to gathe...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialThiruvananthapuram / Trivandrum, India
    Site Reliability Engineering (SRE).Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that ...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupThiruvananthapuram, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 2 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech SolutionsThiruvananthapuram, Republic Of India, IN
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 3 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionskollam, kerala, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    Site reliability engineer

    Site reliability engineer

    CitNOW GroupThiruvananthapuram, Kerala, India
    Founded in 2008, Cit NOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.Cit NOW’s app-based pl...Show moreLast updated: 5 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeKollam, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmathiruvananthapuram, kerala, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 24 days ago
    • Promoted
    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Jade GlobalKollam, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer- Elk Expert

    Senior Site Reliability Engineer- Elk Expert

    iVedha Inc.Thiruvananthapuram, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 18 days ago