Talent.com
Senior Site Reliability Engineer (SRE) – Datadog Observability

Senior Site Reliability Engineer (SRE) – Datadog Observability

Jade Globalkollam, kerala, in
1 day ago
Job description

Job Description

Job Description

Job Title : Senior Site Reliability Engineer (SRE) – Datadog Observability

Experience Required : 8+ years overall in SRE and Infrastructure Operations with minimum 3 + years hands-on experience in Datadog

Location : Hyderabad preferable but open for Pune and remote

Job Summary :

We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on Datadog Observability . The ideal candidate will bring deep technical expertise in building reliable, scalable, and observable systems, with hands-on experience in integrating enterprise applications and middleware

Key Responsibilities :

  • Drive end-to-end SRE implementation , ensuring system reliability, scalability, and performance.
  • Design, configure, and manage Datadog dashboards , monitors, alerts, and APM for proactive issue detection and resolution.
  • Utilize the Datadog Roles API to create and manage user roles, global permissions, and access controls for various teams.
  • Collaborate with product managers, engineering teams, and business stakeholders to identify observability gaps and design solutions using Datadog.
  • Implement automation for alerting, incident response, and ticket creation to improve operational efficiency.
  • Work closely with business and IT teams to support critical Financial Month-End, Quarter-End, and Year-End closures .
  • Leverage Datadog AI
  • Provide technical leadership in observability, reliability, and performance engineering practices

Required Skills and Experience :

  • 8+ years of experience in Site Reliability Engineering, Observability
  • Minimum 3+ years of hands-on experience with Datadog (dashboards, APM, alerting, log management, Roles API, and monitoring setup).
  • Proven experience implementing SRE best practices —incident management, postmortems, automation, and reliability metrics
  • Excellent stakeholder management and communication skills ; experience collaborating with business and IT teams .
  • Strong problem-solving mindset and ability to work in high-pressure production support environments.
  • Preferred Qualifications :

  • Certification in Datadog or related observability platforms.
  • Knowledge of CI / CD tools and automation frameworks.
  • Experience in cloud platforms (AWS, Azure, or OCI).
  • Exposure to ITIL-based production support processes.
  • Create a job alert for this search

    Senior Site Reliability Engineer • kollam, kerala, in

    Related jobs
    • Promoted
    Senior Site Reliability Engineer / Senior Cloud Engineer

    Senior Site Reliability Engineer / Senior Cloud Engineer

    CloudHirekollam, kerala, in
    The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture.Repo...Show moreLast updated: 1 day ago
    • Promoted
    Equifax - Senior Site Reliability Engineer - IAC Terraform

    Equifax - Senior Site Reliability Engineer - IAC Terraform

    EquifaxTrivandrum
    About the job Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distr...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiKollam, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 11 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeThiruvananthapuram, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 14 days ago
    • Promoted
    Lead - Cloud Reliability Engineer

    Lead - Cloud Reliability Engineer

    Searce Inckollam, kerala, in
    The ‘process-first’ AI-native modern tech consultancy that's rewriting the rules.As an engineering-led consultancy, we are dedicated to relentlessly improving the real business outcomes.Our solvers...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    ConfidentialThiruvananthapuram / Trivandrum
    As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability.Should be able to gathe...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Thiruvananthapuram, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Tata Consultancy Servicesalappuzha, India
    Senior Site Reliability Engineer (SRE).Senior Site Reliability Engineer (SRE).Desired Experience Range : 7 - 10 yrs.Notice Period : Immediate to 90Days only. We are currently planning to do a Virtual....Show moreLast updated: 12 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globalalappuzha, kerala, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionskollam, kerala, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 12 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutionskollam, kerala, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmaalappuzha, kerala, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 22 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ACL Digitalalappuzha, kerala, in
    ACL Digital (An Alten Group Company) hiring for Site Reliability Engineer.Interested candidates can reach out at.Location : Devarabisanahalli, Bengaluru. Notice Period : Less than 2 Weeks.Service Mana...Show moreLast updated: 12 hours ago
    • Promoted
    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Jade GlobalAlleppey, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 1 day ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Futurism Technologies, INC.kollam, India
    Site Reliability Engineering (SRE) Lead.We are seeking a highly skilled and experienced.You will lead a team responsible for building and maintaining automated deployment pipelines, infrastructure ...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Atomic Norththiruvananthapuram, kerala, in
    Hybrid ( Bengaluru, Bhopal, Gurgaon, Hyderabad, Jaipur, Mumbai, Pune , Chennai).Design, build, and manage secure, scalable AWS infrastructure (EC2, Lambda, EKS, S3, RDS, IAM, etc.Automate deploymen...Show moreLast updated: 12 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupKollam, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 16 hours ago
    • Promoted
    • New!
    Site Reliability Engineer - Azure

    Site Reliability Engineer - Azure

    PhonePealappuzha, kerala, in
    We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production serv...Show moreLast updated: 12 hours ago