Act as the Site Reliability Engineer for global operations, ensuring system stability, scalability, and efficiency through advanced automation, observability, and proactive infrastructure management.
Provide expertise in Kubernetes, Linux, networking, and automation practices to support reliable deployments and resilient services.
Maintain a strong sense of reliability, with clear awareness of the risks and impacts that infrastructure and application changes can have.
Principal duties
Has strong knowledge of Kubernetes (including Talos) for deployment, scaling, and maintaining containerized applications.
Provides Linux administration expertise and ensures secure, efficient system operations.
Implements and maintains GitOps workflows using Flux for consistent, automated deployments.
Designs and manages infrastructure automation using Puppet and Terraform.
Ensures reliable operation of databases such as MySQL / MariaDB, Yugabyte, and MongoDB, supporting data integrity and availability.
Operates and integrates streaming platforms (Confluent, Strimzi) for event-driven and real-time processing.
Develops automation scripts and tools using Python to improve operational efficiency.
Supports and integrates solutions with Azure and hybrid / multi-cloud environments.
Builds and operates monitoring and observability systems (Datadog, Prometheus, Grafana) to ensure system health and transparency.
Designs for scalability and high availability, including disaster recovery and failover strategies.
Applies security best practices across infrastructure, applications, and data.
Evaluates risks carefully before changes, ensuring reliable rollout strategies and minimizing downtime or service disruption.
Monitors system reliability, identifies risks, and implements proactive improvements.
Collaborates with global teams to share best practices and ensure consistency across environments.
Defines and standardizes developer tooling (e.g., IDEs, code quality tools, CI / CD integrations) to ensure consistent development environments and maintain high software quality.
Manages developer workstations and operating system standards (currently Ubuntu-based), ensuring performance, security, and compatibility across the engineering organization with focus on the Asia team.
Promotes a documentation culture, ensuring clear processes, runbooks, and troubleshooting guides.
Report to the offshore Digital Manufacturing team based in Switzerland.
Create a job alert for this search
Site Reliability Engineer • Pune, Maharashtra, India
Related jobs
Promoted
Site Reliability Engineer
CapgeminiPune, IN
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 13 days ago
Senior Site Reliability Engineer
OnitPune, Maharashtra, IN
Quick Apply
Site Reliability Engineer Onit, Inc.Site Reliability Engineer L2 to join our Core Infrastructure team.This role will help to ensure the reliability of a diverse set of applications across our AWS i...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
SFS Group India Pvt. Ltd.pune, maharashtra, in
Act as the Site Reliability Engineer for global operations, ensuring system stability, scalability, and efficiency through advanced automation, observability, and proactive infrastructure managemen...Show moreLast updated: 20 days ago
Promoted
Site Reliability Engineer
PRI GlobalPune, Maharashtra, India
Experience in Linux , Azure cloud certification and candidate must have good knowledge on Bash / jenkins / Chef / chef-habitat technologies.Show moreLast updated: 3 days ago
Promoted
Lead Site Reliability Engineer
Futurism Technologies, INC.pune, maharashtra, in
Site Reliability Engineering (SRE) Lead.We are seeking a highly skilled and experienced.You will lead a team responsible for building and maintaining automated deployment pipelines, infrastructure ...Show moreLast updated: 3 days ago
Promoted
Site Reliability Engineer
PhonePepune, maharashtra, in
Troubleshoot issues across the entire stack - hardware, software, application, and network.Work to improve the reliability and performance of the next generation of distributed systems.Work to impr...Show moreLast updated: 3 days ago
Promoted
Senior Site Reliability Engineer
Nebula Tech Solutionspune, maharashtra, in
SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can.
Enhance application reliability through code.Add or modify cod...Show moreLast updated: 3 days ago
Promoted
New!
Site Reliability Engineer [09 / 11 / 2025]
SFS Group India Pvt. Ltd.Pune, Maharashtra, India
Act as the Site Reliability Engineer for global operations, ensuring system stability, scalability, and efficiency through advanced automation, observability, and proactive infrastructure managemen...Show moreLast updated: 5 hours ago
Promoted
Site Reliability Engineer - DevOps
QualysPune, India
As a Junior DevOps Engineer at Qualys, you will play a supporting yet impactful role in maintaining and enhancing our DevOps ecosystem.
This position is ideal for someone with a strong foundation in...Show moreLast updated: 3 days ago
Promoted
Site Reliability Engineer, Platform Engineering
ConfidentialPune, India
Tesla's Platform Engineering is looking for a Site Reliability Engineer to join our team.As a member of the team, you will be building and maintaining Kubernetes clusters using infrastructure-as-co...Show moreLast updated: 7 days ago
Promoted
Site Reliability Engineer
CitNOW GroupPune, IN
Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 2 days ago
Promoted
Senior Site Reliability Engineer- ELK Expert
iVedha Inc.Pune, IN
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone.
Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
Promoted
Sr Engineer, Site Reliability [T500-21295]
TMUS Global Solutionspune, maharashtra, in
NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 2 days ago
Promoted
Site Reliability Engineer
ConfidentialIndia, Pune
Troubleshoot issues across the entire stack - hardware, software, application, and network.Work to improve the reliability and performance of the next generation of distributed systems.Work to impr...Show moreLast updated: 7 days ago
Promoted
Site Reliability Engineer - OpenShift
ConfidentialPune
Applies software engineering principles to the operations domain.Contributes to a service's codebase, writes automation that aids in the management of a service, and performs operational engineerin...Show moreLast updated: 30+ days ago
Promoted
Senior Site Reliability Engineer
IntraEdgePune, IN
Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 16 days ago
Promoted
Site Reliability Engineer
CodeKarmapune, maharashtra, in
Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 24 days ago
Promoted
Senior Site Reliability Engineer (SRE) – Datadog Observability
Jade GlobalPune, Maharashtra, India
Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote.
Site Reliability Engineer (SRE).SRE...Show moreLast updated: 3 days ago