Talent.com
This job offer is not available in your country.
Lead - Site Reliability Engineer

Lead - Site Reliability Engineer

VXI Global SolutionsIndia
25 days ago
Job description

We are looking for a Lead - Site Reliability Engineer with 8+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have hands-on experience with

Prometheus ,

Grafana ,

Google Cloud Monitoring , and

OpenTelemetry , along with exposure to

SolarWinds . You should be comfortable working with

metrics, logs, and traces , and be able to

correlate telemetry data

to proactively detect, diagnose, and resolve performance issues.

Key Responsibilities :

Design and maintain observability pipelines using OpenTelemetry, Prometheus, and Grafana.

Build dashboards and alerts to monitor system health, application performance, and business KPIs.

Integrate observability solutions with Google Cloud Platform services and SolarWinds.

Correlate logs, metrics, and traces to troubleshoot incidents and reduce MTTR.

Collaborate with SREs, DevOps, and development teams to improve end-to-end system observability.

Implement best practices for telemetry data collection, enrichment, storage, and visualization.

Requirements :

Strong experience with Prometheus and Grafana for monitoring and alerting.

Proficiency in OpenTelemetry for instrumenting distributed systems.

Working knowledge of observability tools in Google Cloud (e.g., Cloud Monitoring, Logging, Trace).

Exposure to SolarWinds for network and infrastructure monitoring.

Solid understanding of telemetry data types : metrics, logs, and traces.

Ability to correlate and analyze multi-source observability data.

Scripting skills (Python, Bash) and familiarity with Infrastructure-as-Code is a plus.

Preferred Qualifications :

Experience in Site Reliability Engineering or Platform Engineering roles.

Knowledge of SLIs / SLOs and performance benchmarking.

Experience with APM tools (e.g., Datadog, New Relic) is a plus.

Create a job alert for this search

Site Reliability Engineer • India

Related jobs
  • Promoted
Senior Site Reliability Engineer- ELK Expert

Senior Site Reliability Engineer- ELK Expert

iVedha Inc.Nagpur, IN
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
  • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

WSO2nagpur, maharashtra, in
Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 7 days ago
  • Promoted
Site Reliability Engineer - Chaos Management

Site Reliability Engineer - Chaos Management

Xebianagpur, maharashtra, in
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 7 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

BirlasoftIndia
Responsibilities : Be primarily responsible for providing production, operations support and application administration to business and web applications, 3rd party applications and related ecosystem...Show moreLast updated: 24 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

UplersNagpur, IN
Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 24 days ago
  • Promoted
Lead Site Reliability Engineer [T500-20012]

Lead Site Reliability Engineer [T500-20012]

Delta Air LinesIndia
About Delta Tech Hub : Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our...Show moreLast updated: 26 days ago
  • Promoted
  • New!
Site Reliability Engineer

Site Reliability Engineer

ExasoftNagpur, IN
Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 1 hour ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

XebiaNagpur, IN
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 26 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Luxoft IndiaIndia
Project Description : We are looking for an experienced technical developer to work for one of our client from the banking industry. Project goal is to maintain and develop solutions.Responsibilities...Show moreLast updated: 17 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Zyoin GroupIndia
Site Reliability Engineer (SRE) Experience : .Chennai (Hybrid – 2 days in office).Role Overview : We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuri...Show moreLast updated: 30+ days ago
  • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Onit IndiaIndia
Site Reliability Engineer L2 to join our Core Infrastructure team.This role will help to ensure the reliability of a diverse set of applications across our AWS infrastructure.To be successful in th...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

TavantIndia
With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 25 days ago
  • Promoted
Lead Site Reliability Engineer- Remote

Lead Site Reliability Engineer- Remote

ConfidentialIndia
Remote
By raising the bar on information security, Sprinto ensures compliance, healthy operational practices, and the ability for businesses to grow and scale with unwavering confidence.We are funded by t...Show moreLast updated: 9 days ago
  • Promoted
  • New!
Reliability Engineer and Planning Engineer

Reliability Engineer and Planning Engineer

JobTravia Pvt. Ltd.Nagpur, IN
Reliability / Planning Superintendent.Lead reliability and maintenance planning across the processing plant to ensure safe, efficient, and cost-effective operations. Drive continuous improvement, asse...Show moreLast updated: 1 hour ago
  • Promoted
Lead Site Reliability Engineer

Lead Site Reliability Engineer

ConfidentialIndia
Senior Site Reliability Engineer.As a Site Reliability Engineer at ChartIQ, you'll play a critical role not only in building, maintaining, and scaling the infrastructure that supports our Developme...Show moreLast updated: 9 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

GSPANN Technologies, IncIndia
About Company : GSPANN is a global IT services and consultancy provider headquartered in Milpitas, California (U.With five global delivery centers across the globe, GSPANN provides digital solution...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ConcordNagpur, IN
Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 17 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Amicon Hub Servicesnagpur, maharashtra, in
Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 5 days ago