Talent.com
Deutsche Bank
Senior Site Reliability Engineer - AVPDeutsche Bank • Bengaluru / Bangalore, India
Senior Site Reliability Engineer - AVP

Senior Site Reliability Engineer - AVP

Deutsche Bank • Bengaluru / Bangalore, India
4 days ago
Job description
Senior Site Reliability Engineer - AVP

Position Overview

Job Title: Senior Site Reliability Engineer

Corporate Title: AVP

Location: Bangalore, India

Role Description

  • We are seeking a Site Reliability Engineer for Observability platforms in the Bank to enhance, scale, and modernise our enterprise observability capability.
  • This role focuses on owning and evolving Observability and Monitoring tools across the Bank, driving a shift towards OpenTelemetry (OTel)-based telemetry standardisation.
  • The successful candidate will contribute to automation, AI adoption, and observability-by-design practices to improve reliability, scalability, and developer experience.

What we'll offer you

As part of our flexible scheme, here are just some of the benefits that you'll enjoy,

  • Best in class leave policy.
  • Gender neutral parental leaves
  • 100% reimbursement under childcare assistance benefit (gender neutral)
  • Sponsorship for Industry relevant certifications and education
  • Employee Assistance Program for you and your family members
  • Comprehensive Hospitalization Insurance for you and your dependents
  • Accident and Term life Insurance
  • Complementary Health screening for 35 yrs. and above

Your key responsibilities

Tools Reliability Governance:

  • Own the availability, performance, and resilience of the Observability tool stack in the Bank
  • Act as admin of the tool stack, ensuring platforms effectively support enterprise monitoring requirements
  • Drive standardisation of telemetry using OpenTelemetry (OTel) across Metrics, Events, Logs, and Traces (MELT)
  • Define and implement telemetry collection, enrichment, and routing strategies using OTel collectors and pipelines
  • Identify and implement automation and self-healing for common issues and adopt AI practices to enhance tools availability and user experience

Own Incident and Problem Management framework (severity, escalation, response and resolution):

  • Ensure quick incident response, containment, and service restoration
  • Perform deep root cause analysis and deliver permanent resolutions
  • Oversee major incidents and proactively identify systemic risks
  • Identify and eliminate audit and control risks

Align and adhere with SRE best practices:

  • Provide frameworks, playbooks, and automation capabilities
  • Conduct reliability reviews and implement and improve SLO/SLI tracking
  • Maintain and govern error budgets
  • Promote observability-by-design principles across application and platform teams

Strong SRE / production engineering experience

  • Expertise in SLOs, error budgets, incident governance, and modern observability practices
  • Experience with distributed systems, GCP, Kubernetes, Openshift
  • Leverage OTel-driven telemetry insights to improve reliability and proactive issue detection
  • Strong understanding of risk, audit, and compliance (financial services preferred)
  • Own and evolve the Observability platform ecosystem - ITRS Geneos, New Relic (SaaS), Netcool, Grafana (KDB), and OTel-based telemetry pipelines

Your skills and experience

  • Strong experience as admin of at-least 2 of the observability tools: ITRS Geneos, New Relic (SaaS), Netcool, Grafana (KDB)
  • Strong understanding of MELT concepts and modern Observability architectures
  • Hands-on experience with OpenTelemetry (OTel):
  • Application and infrastructure instrumentation (auto and manual)
  • OTel collectors, exporters, and telemetry pipelines
  • Integration of OTel with tools such as Grafana and New Relic
  • Understanding of vendor-agnostic telemetry frameworks
  • Hands-on experience in working on Unix servers (Windows server would be added benefit), Google Cloud, Openshift
  • Strong hands-on experience in any scripting language: shell, bash, python etc. Experience with ansible playbooks and terraform will be beneficial
  • Experience in Oracle, MSSQL database, KDB knowledge will be an added advantage

How we'll support you

  • Training and development to help you excel in your career.
  • Coaching and support from experts in your team.
  • A culture of continuous learning to aid progression.
  • A range of flexible benefits that you can tailor to suit your needs.

About us and our teams

Please visit our company website for further information:

We strive for a in which we are empowered to excel together every day. This includes acting responsibly, thinking commercially, taking initiative and working collaboratively.
Together we share and celebrate the successes of our people. Together we are Deutsche Bank Group.
We welcome applications from all people and promote a positive, fair and inclusive work environment.


Skills Required
Mssql, Ansible, Kubernetes, Terraform, Gcp, Openshift, Distributed Systems, Oracle
Create a job alert for this search

Senior Site Reliability Engineer - AVP • Bengaluru / Bangalore, India

Similar jobs

Senior Site Reliability Engineer

ScaleneWorksBengaluru, Karnataka, India
Quick Apply

Experience in C++ / Java: if one of the two it is ok.Knowledge of cloud would be appreciated.Knowledge of software development life cycle: nice to have.Has working experience and advanced and speci... Show more

Senior Site Reliability Engineer

QuantiphiBengaluru, Republic Of India, IN

Work Location: Mumbai/Bangalore/Trivandrum.Deep cloud expertise with hands-on experience in GCP (Azure + GCP experience is a plus) including compute, storage, networking, and managed services.Distr... Show more

 • Promoted

Site Reliability Engineer

Shell Recharge SolutionsBengaluru, Republic Of India, IN

EV charging infrastructure at scale.Our technology is connecting EV infrastructure solutions with public and private charging needs in a safer, cleaner, and smarter way.Headquartered in offshore In... Show more

 • Promoted

Site Reliability Engineer (SRE)

OrangepeopleBengaluru, KA, India
Quick Apply

OP is partnering with a globally renowned leader in media, entertainment, and consumer experiences to hire a skilled and collaborative Site Reliability Engineer (SRE) to join their Enterprise Techn... Show more

Site Reliability Engineer

HyperVergekarnataka, bengaluru, India

Role Overview We are looking for an SRE who doesn't just "maintain" systems but builds them.You won't be stuck in a traditional support loop; instead, you will focus on the reliability, scalability... Show more

 • Promoted

Lead Site Reliability Engineer

Concentrixbangalore district, karnataka, in

As a Lead Site Reliability Engineer, you will own the reliability and availability of our production systems.You will champion SRE principles across engineering teams — defining SLOs, managing erro... Show more

 • Promoted

Site Reliability Engineer

Resource Algorithmbangalore, karnataka, in

We are seeking an experienced and dynamic.Site Reliability Engineering (SRE) Lead.As an SRE Lead, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engi... Show more

 • Promoted

Site Reliability Engineer

Synechronbangalore, karnataka, in

Position Site Reliability Engineer.Notice: Immediate joiner to 15 days.Synechron is a global technology consulting firm that helps leading organizations accelerate digital transformation through in... Show more

 • Promoted

Site Reliability Engineer

LTMkarnataka, bengaluru, India

If interested, please apply to this link Experience Range: 3 to 5 years Notice Period: Immediate Joiners Job Role SRE Incident handling with Servicenow following runbooks Observability tools like ... Show more

 • Promoted

Senior Site Reliability Engineer

Solvex SolutionsBengaluru, Republic Of India, IN

Role: Senior Operations Engineer.Bangalore or Chennai (1-2 day/week onsite if near office).Handle and fix system issues, find root causes, and prevent them from happening again.Set up monitoring, l... Show more

 • Promoted

Site Reliability Engineer I

Aqilea (formerly Soltia)Bangalore, Karnataka, India
Quick Apply

Aqilea is an IT and engineering consulting partner that helps companies get more out of their technology and operations.With teams in Stockholm and Bangalore, we work closely with our clients to bu... Show more

Site reliability engineer

HyperVergebengaluru, karnataka, India

Role Overview We are looking for an SRE who doesn't just "maintain" systems but builds them.You won't be stuck in a traditional support loop; instead, you will focus on the reliability, scalabilit... Show more

 • Promoted

Principal Site Reliability Engineer

HCLTechBengaluru, Republic Of India, IN

Job Title: Lead Site Reliability Engineer.The Support Lead (SRE) is responsible for overseeing the support operations and site reliability engineering tasks, ensuring the effective functioning of s... Show more

 • Promoted

Site Reliability Engineer

Genpactkarnataka, bengaluru, India

Inviting applications for the role of Site Reliability Engineer with over 7 years of experience to join our team.Key Requirements: - Strong experience in Site Reliability Engineering and Developmen... Show more

 • Promoted

Senior Site Reliability Engineer

Josysbangalore, karnataka, in

Senior Site Reliability Engineer (SRE).Josys, a dynamic B2B SaaS platform startup, has embarked on a mission to revolutionize IT operations globally, following an exceptional launch in Japan and se... Show more

 • Promoted

Lead Site Reliability Engineer

HCLTechbangalore, karnataka, in

Job Title: Lead Site Reliability Engineer.The Support Lead (SRE) is responsible for overseeing the support operations and site reliability engineering tasks, ensuring the effective functioning of s... Show more

 • Promoted

Site Reliability Engineer

HiroJetkarnataka, bengaluru, India

Role - Site Reliability Engineer Location - In Office (Bengaluru, India) About Company - This is an AI voice automation platform that enables businesses to streamline high-volume, repetitive commun... Show more

 • Promoted

Advanced Engineer Site Reliability

Albertsons Companies Indiabengaluru, India

This role will be an individual contributor responsible for building and finetuning the platform components for the Observability product.The candidate will work closely with the Lead engineer, per... Show more

 • Promoted

Reliability Engineer

Birlasoftbangalore, karnataka, in

Job Description: Reliability Sr.Reliability Architect with 8 to 12 years of experience in proactive monitoring, automation, and observability.Skilled in AIOps/MLOps, infrastructure management, and ... Show more

 • Promoted

Site reliability engineer

Resource Algorithmbengaluru, karnataka, India

We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems.As an SRE Lead, you will play a pi... Show more