Site Reliability Engineer 5Confidential • India, Bengaluru / Bangalore

Site Reliability Engineer 5

Confidential • India, Bengaluru / Bangalore

1 hour ago

Job description

Our Company

Changing the world through digital experiences is what Adobe's all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.

We're on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!

Adobe Pass
is a leading authentication and authorization platform that enables seamless access to premium TV and video content across devices. It powers 'TV Everywhere' experiences by allowing users to sign in with their pay-TV credentials to watch subscribed content from broadcasters and streaming services. Trusted by major media companies, Adobe Pass ensures secure, scalable, and frictionless user authentication, while providing insights and analytics that help content providers deliver personalized and compliant viewing experiences.

System Architecture & Technical Strategy

Define and drive the long-term reliability and scalability strategy for the Adobe Pass platform, aligning with product and business goals.

Architect large-scale, distributed, and multi-region systems designed for resiliency, observability, and self-healing.

Anticipate systemic risks and design proactive mitigation strategies — ensuring zero single points of failure across critical services.

Partner with software architecture and infrastructure teams to evolve the platform toward greater reliability, efficiency, and cost optimization.

Automation, Observability & Reliability Engineering

Build and champion advanced automation frameworks that enable zero-touch operations across deployment, recovery, and scaling workflows.

Introduce AI / ML-based predictive monitoring and anomaly detection systems to anticipate failures before they impact users.

Lead organization-wide reliability initiatives — such as chaos engineering, error budgets, and SLO adoption — driving measurable reliability improvements.

Continuously refine observability architecture (metrics, traces, logs) to ensure comprehensive, actionable insights into production health.

Incident Response & Operational Excellence

Serve as a technical authority during high-impact incidents, guiding cross-functional teams through real-time mitigation and long-term prevention.

Establish and enforce best-in-class incident management frameworks, improving MTTR, MTBF, and reducing incident recurrence rates.

Lead blameless postmortems and translate findings into actionable reliability roadmaps.

Drive reliability reviews and operational readiness assessments for all major product launches.

Performance, Scalability & Cost Efficiency

Lead large-scale performance tuning and capacity engineering efforts, ensuring optimal resource utilization and cost efficiency across environments.

Identify architectural bottlenecks, drive performance benchmarking, and influence platform evolution for better scalability and elasticity.

Partner with FinOps and CloudOps to optimize spend while maintaining reliability SLAs and SLOs.

Cross-Team Leadership & Mentorship

Mentor and coach SREs and software engineers, cultivating deep reliability-first thinking across teams.

Serve as a thought leader in reliability engineering — driving best practices, evangelizing automation-first culture, and influencing technical standards across multiple teams.

Collaborate with engineering leaders, PMs, and operations to align priorities, set strategic goals, and deliver on high-impact reliability initiatives.

Lead technical deep dives and design reviews, ensuring all systems are built to scale securely and reliably.

Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, or a related field.

12+ years of experience in site reliability, production engineering, or large-scale distributed system operations.

Proven track record of designing and managing highly available, globally distributed systems in cloud-native environments (AWS, Azure, GCP).

Expert-level proficiency in one or more programming / scripting languages (Python, Go, Java, Bash) for automation and tooling.

Deep understanding of Kubernetes, microservices, and service mesh architectures.

Advanced experience with Infrastructure as Code (Terraform, CloudFormation) and CI / CD automation frameworks.

Mastery in observability and monitoring stacks (Prometheus, Grafana, Datadog, OpenTelemetry).

Strong expertise in networking, storage, and distributed databases (both SQL and NoSQL).

Demonstrated ability to influence architectural decisions and drive reliability strategy across organizations.

Exceptional communication, leadership, and stakeholder management skills.

Preferred Qualifications

Experience designing reliability frameworks or SRE platforms at scale (error budgets, chaos engineering, reliability reviews).

Prior experience in high-traffic or latency-sensitive systems (media streaming, advertising, or real-time platforms).

Familiarity with big data ecosystems (Kafka, Spark, Hadoop) and large-scale data ingestion pipelines.

Hands-on experience with security, compliance, and governance in production environments (SOC2, GDPR, ISO27001).

Cloud or Kubernetes certifications (AWS Solutions Architect Professional, CKA / CKAD, GCP Professional Cloud Architect).

Published contributions or conference talks on reliability, automation, or distributed systems.

Adobe is proud to be an Equal Employment Opportunity employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other applicable characteristics protected by law. Learn more.

Adobe aims to make Adobe.com accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email [HIDDEN TEXT] or call (408) 536-3015.

Skills Required

Java, Cloudformation, Prometheus, Go, Bash, Grafana, Datadog, Sql, Nosql, Gcp, Terraform, Azure, Kubernetes, Python, Aws

Create a job alert for this search

Site Reliability Engineer • India, Bengaluru / Bangalore

Related jobs

Site Reliability Engineer

Reyika • Bengaluru, Karnataka, India

Senior Site Reliability Engineer / Reliability Architect.Pune,Bengalore,Chennai,Pune,Noida.Reliability Architect with over 9 years of experience in proactive monitoring, automation, and observabili...Show more

Last updated: 9 days ago • Promoted

Site Reliability Engineer

Synamedia • Bengaluru, Karnataka, India

At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show more

Last updated: 17 days ago • Promoted

Site Reliability Engineer

WhiteLotus Talent Partners • Bengaluru, India

We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure power...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Glocomms • Bengaluru, Karnataka, India

We are currently looking for an SRE Lead - to join our customer - an IT consultancy with urgent projects on board.This will be a 6 month contract initially with an option to extend further.Responsi...Show more

Last updated: 3 days ago • Promoted

Site Reliability Engineer

Delta Electronics India • Bengaluru, Karnataka, India

Define and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to balance reliability with feature velocity and ensure optimal system availability.Respond to...Show more

Last updated: 6 days ago • Promoted

Lead Site Reliability Engineer

Tata Consultancy Services • Bengaluru, Republic Of India, IN

Senior Site Reliability Engineer (SRE).Senior Site Reliability Engineer (SRE).Desired Experience Range : 7 - 10 yrs.Notice Period : Immediate to 90Days only. We are currently planning to do a Virtual....Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Pagos Consultants • Bengaluru, IN

This team will play a pivotal role in spearheading innovation.As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future d...Show more

Last updated: 1 day ago • Promoted

Site Reliability Engineer

Synechron • Bengaluru, Karnataka, India

We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer.At Synechron, we believe in the power of digital to transform businesses for the better.Our globa...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

GREYTIP SOFTWARE PRIVATE LIMITED • Bengaluru, India

We are looking for a skilled Site Reliability Engineer II to join our SRE team.The ideal candidate will have hands-on experience in production monitoring, alert handling, and L1 production support....Show more

Last updated: 10 days ago • Promoted

Site Reliability Engineer

ACL Digital • Bengaluru, Republic Of India, IN

ACL Digital is Hiring for the Below position.ACL Digital, part of the ALTEN Group, is a trusted AI-led, Digital & Systems Engineering Partner driving innovation by designing and building intelligen...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer IC3

Oracle • Bengaluru, Republic Of India, IN

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and e...Show more

Last updated: 17 days ago • Promoted

Senior Site Reliability Engineer

o9 Solutions, Inc. • Bengaluru, Karnataka, India

Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more

Last updated: 14 days ago • Promoted

Lead Site Reliability Engineer

o9 Solutions, Inc. • Bengaluru, Republic Of India, IN

Last updated: 14 days ago • Promoted

Site Reliability Engineer

super.money • Bengaluru, Karnataka, India

Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show more

Last updated: 24 days ago • Promoted

Site Reliability Engineer

Capgemini • Bangalore, IN

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

HireAlpha • Bengaluru, Karnataka, India

Role-Site Reliability Engineer.We are looking for an engineer to focus on Developer Experience and who can help us design, build, and maintain high-performance, scalable, and reliable services.As C...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Enterprise Minds, Inc • Bengaluru, Karnataka, India

Senior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call).Site Reliability Engineer (SRE).If you thrive in fast-paced environments, excel in incident management, and love buildin...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Landmark Group • Bengaluru, India

Ensure reliability and high availability of Java and microservices-based applications through proactive monitoring and automation. Define and track SLIs / SLOs to maintain service performance and ...Show more

Last updated: 15 days ago • Promoted