Talent.com
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Nebula Tech Solutionssecunderabad, telangana, in
22 hours ago
Job description

At Nebula Tech Solutions , we’re building a high-performing SRE team supporting mission-critical applications for our US-based enterprise clients .

We’re now looking for engineers who can go beyond operations — those who can work directly with application code to improve observability, reliability, and performance at scale. 🌎🌙

🔧 What You’ll Do

✅ Enhance application reliability through code

  • Add or modify code to improve telemetry and resilience in existing applications.
  • Implement and validate retries, timeouts, and failover logic to improve system reliability.
  • Contribute to and review application code changes with a focus on SRE and production-readiness.

✅ Advance observability and telemetry

  • Embed new telemetry data (e.g., counters, histograms, traces, structured logs ) into existing services.
  • Add or upgrade OpenTelemetry and related libraries; test for compatibility and regression before rollout.
  • Integrate observability enhancements with Prometheus, Grafana, ELK, and OpenTelemetry pipelines.
  • ✅ Collaborate & support global reliability efforts

  • Partner with developers to ensure metric coverage, tracing, and alerting meet production standards.
  • Participate in incident response, root cause analysis (RCA) , and postmortems.
  • Automate recurring operational tasks using Python, Go, or similar scripting .
  • Improve deployment pipelines and infrastructure using Kubernetes, Terraform, Helm , and CI / CD tools.
  • 🧩 What We’re Looking For

    🔹 5+ years of experience in DevOps, SRE, or software development roles.

    🔹 Strong coding proficiency in C# or Java (Python or Go is a plus).

    🔹 Hands-on experience with Kubernetes , containerized workloads , and microservices architecture .

    🔹 Deep understanding of telemetry and observability concepts — metrics, logs, traces, and alerting.

    🔹 Familiarity with OpenTelemetry , Prometheus , Grafana , or similar APM tools.

    🔹 Strong understanding of resilient design patterns (retry, circuit breaker, fail-fast, graceful degradation).

    🔹 Experience collaborating with developers to improve code-level reliability and metric instrumentation.

    📍 Location : Remote – India

    🕐 Shift : US Night Shift (Continuous)

    🌍 Client : US-based Enterprise Applications

    #NebulaTechSolutions #Hiring #SRE #DevOps #CSharp #Java #OpenTelemetry #Prometheus #ReliabilityEngineering #NightShift

    Create a job alert for this search

    Senior Site Reliability Engineer • secunderabad, telangana, in

    Related jobs
    • Promoted
    Sr Engineer, Site Reliability Engineer

    Sr Engineer, Site Reliability Engineer

    TMUS Global SolutionsHyderabad, India
    The Senior Systems Reliability Engineer (SRE) ensures the stability, performance, and reliability of IT services and infrastructure. This role combines software engineering and operations expertise ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Zyoin GroupHyderabad
    Description : As the most senior technical individual contributor within an entire division of Engine...Show moreLast updated: 6 days ago
    • Promoted
    Sr Engineer, Site Reliability Engineer [T500-20464]

    Sr Engineer, Site Reliability Engineer [T500-20464]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 25 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    AutoRABITHyderabad, Republic Of India, IN
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Lead Site Reliability Engineer

    Senior Lead Site Reliability Engineer

    ConfidentialHyderabad / Secunderabad, Telangana, India
    Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Principal Site ...Show moreLast updated: 4 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeHyderabad, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 13 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialHyderabad / Secunderabad, Telangana
    SOPs and technical documentation.AWS (EC2, S3, VPC, RDS, EKS, ECS, CloudWatch, CloudFormation)—to support scalable system operations. Jenkins to enable seamless deployments.Datadog, Site24x7, Grafan...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABITHyderabad, Telangana, India
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Senior Engineer

    Site Reliability Senior Engineer

    ConfidentialHyderabad / Secunderabad, Telangana, India
    Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in hist...Show moreLast updated: 4 days ago
    • Promoted
    Sr Engineer, Site Reliability

    Sr Engineer, Site Reliability

    TMUS Global SolutionsHyderabad, India
    The Senior Systems Reliability Engineer (SRE) ensures the stability, performance, and reliability of IT services and infrastructure. This role combines software engineering and operations expertise ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Talent SutraHyderabad, Telangana, India
    The position exists to deploy the products and their updates ensuring smooth infrastructure and configuration management for robust project delivery. Technical Skills Required : • Operating System (...Show moreLast updated: 19 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiHyderabad, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 10 days ago
    • Promoted
    Engineer - Site Relibility - FPT

    Engineer - Site Relibility - FPT

    Talent500 INCHyderabad, India
    Engineer - Site Reliability - FPT.As a Site Reliability Engineer, youll play a crucial role in keeping our digital backbone running seamlessly for millions of customers. Your mission : reduce inciden...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    InfosysHyderabad, Republic Of India, IN
    We are seeking a skilled and motivated Site Reliability Engineer with hands-on expertise.DevOps tools, and SRE principles. Provide production support for Production applications, ensuring the stabil...Show moreLast updated: 14 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    TMUS Global SolutionsHyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    o9 Solutions, Inc.hyderabad, telangana, in
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 22 days ago
    • Promoted
    Engineer, Site Reliability

    Engineer, Site Reliability

    TMUS Global SolutionsHyderabad, India
    Engineer reliability : Identify potential system issues early, implement preventive measures, and boost system resilience. Automate for speed : Build tools, pipelines, and scripts that eliminate manua...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    S&P GlobalHyderabad, Telangana, India
    This job is with S&P Global, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.About the Rol...Show moreLast updated: 7 days ago