Talent.com
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Nebula Tech SolutionsKollam, Republic Of India, IN
2 days ago
Job description

At Nebula Tech Solutions , we’re building a high-performing SRE team supporting mission-critical applications for our US-based enterprise clients .

We’re now looking for engineers who can go beyond operations — those who can work directly with application code to improve observability, reliability, and performance at scale. 🌎🌙

🔧 What You’ll Do

✅ Enhance application reliability through code

  • Add or modify code to improve telemetry and resilience in existing applications.
  • Implement and validate retries, timeouts, and failover logic to improve system reliability.
  • Contribute to and review application code changes with a focus on SRE and production-readiness.

✅ Advance observability and telemetry

  • Embed new telemetry data (e.G., counters, histograms, traces, structured logs ) into existing services.
  • Add or upgrade OpenTelemetry and related libraries;
  • test for compatibility and regression before rollout.

  • Integrate observability enhancements with Prometheus, Grafana, ELK, and OpenTelemetry pipelines.
  • ✅ Collaborate & support global reliability efforts

  • Partner with developers to ensure metric coverage, tracing, and alerting meet production standards.
  • Participate in incident response, root cause analysis (RCA) , and postmortems.
  • Automate recurring operational tasks using Python, Go, or similar scripting .
  • Improve deployment pipelines and infrastructure using Kubernetes, Terraform, Helm , and CI / CD tools.
  • 🧩 What We’re Looking For

    🔹 5+ years of experience in DevOps, SRE, or software development roles.

    🔹 Strong coding proficiency in C# or Java (Python or Go is a plus).

    🔹 Hands-on experience with Kubernetes , containerized workloads , and microservices architecture .

    🔹 Deep understanding of telemetry and observability concepts — metrics, logs, traces, and alerting.

    🔹 Familiarity with OpenTelemetry , Prometheus , Grafana , or similar APM tools.

    🔹 Strong understanding of resilient design patterns (retry, circuit breaker, fail-fast, graceful degradation).

    🔹 Experience collaborating with developers to improve code-level reliability and metric instrumentation.

    📍 Location : Remote – India

    🕐 Shift : US Night Shift (Continuous)

    🌍 Client : US-based Enterprise Applications

    #NebulaTechSolutions #Hiring #SRE #DevOps #CSharp #Java #OpenTelemetry #Prometheus #ReliabilityEngineering #NightShift

    Create a job alert for this search

    Senior Site Reliability Engineer • Kollam, Republic Of India, IN

    Related jobs
    • Promoted
    Senior Site Reliability Engineer / Senior Cloud Engineer

    Senior Site Reliability Engineer / Senior Cloud Engineer

    CloudHirekollam, kerala, in
    The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture.Repo...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiKollam, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 12 days ago
    • Promoted
    Equifax - Site Reliability Engineer

    Equifax - Site Reliability Engineer

    EquifaxThiruvananthapuram
    Site Reliability Engineering (SRE) at Equifax SRE is a discipline that combines software and systems engineering for building and running large-scale, distrib...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    The world's top banks use Zafin's integrated platform to drive transformative customer value.Powered by an innovative AI-powered architecture, Zafin's platform seamlessly unifies data from across t...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    Job Title : Senior Site Reliability Engineer (SRE II).Location : Thiruvananthapuram, KL (Hybrid 3 days Onsite).We're looking for an experienced. Senior Site Reliability Engineer.The ideal candidate ha...Show moreLast updated: 2 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialThiruvananthapuram / Trivandrum, India
    Site Reliability Engineering (SRE).Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that ...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    ConfidentialThiruvananthapuram / Trivandrum
    As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability.Should be able to gathe...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeAlappuzha, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 15 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Thiruvananthapuram, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutionsalappuzha, kerala, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 2 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globalalappuzha, kerala, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 2 days ago
    • Promoted
    Sr Engineer, Site Reliability T500-21295

    Sr Engineer, Site Reliability T500-21295

    TMUS Global SolutionsAlleppey, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 23 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmaalappuzha, kerala, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 23 days ago
    • Promoted
    Equifax - Senior Site Reliability Engineer - IAC Terraform

    Equifax - Senior Site Reliability Engineer - IAC Terraform

    EquifaxThiruvananthapuram
    About the job Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distr...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Jade GlobalAlleppey, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceAlappuzha, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 8 hours ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionsalappuzha, kerala, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupKollam, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 1 day ago