Act as the Site Reliability Engineer for global operations, ensuring system stability, scalability, and efficiency through advanced automation, observability, and proactive infrastructure management.
Provide expertise in Kubernetes, Linux, networking, and automation practices to support reliable deployments and resilient services.
Maintain a strong sense of reliability, with clear awareness of the risks and impacts that infrastructure and application changes can have.
Principal duties
Has strong knowledge of Kubernetes (including Talos) for deployment, scaling, and maintaining containerized applications.
Provides Linux administration expertise and ensures secure, efficient system operations.
Implements and maintains GitOps workflows using Flux for consistent, automated deployments.
Designs and manages infrastructure automation using Puppet and Terraform.
Ensures reliable operation of databases such as MySQL / MariaDB, Yugabyte, and MongoDB, supporting data integrity and availability.
Operates and integrates streaming platforms (Confluent, Strimzi) for event-driven and real-time processing.
Develops automation scripts and tools using Python to improve operational efficiency.
Supports and integrates solutions with Azure and hybrid / multi-cloud environments.
Builds and operates monitoring and observability systems (Datadog, Prometheus, Grafana) to ensure system health and transparency.
Designs for scalability and high availability, including disaster recovery and failover strategies.
Applies security best practices across infrastructure, applications, and data.
Evaluates risks carefully before changes, ensuring reliable rollout strategies and minimizing downtime or service disruption.
Monitors system reliability, identifies risks, and implements proactive improvements.
Collaborates with global teams to share best practices and ensure consistency across environments.
Defines and standardizes developer tooling (e.g., IDEs, code quality tools, CI / CD integrations) to ensure consistent development environments and maintain high software quality.
Manages developer workstations and operating system standards (currently Ubuntu-based), ensuring performance, security, and compatibility across the engineering organization with focus on the Asia team.
Promotes a documentation culture, ensuring clear processes, runbooks, and troubleshooting guides.
Report to the offshore Digital Manufacturing team based in Switzerland.
Create a job alert for this search
Site Reliability Engineer • Pune, Maharashtra, India
Related jobs
Promoted
Site Reliability Engineer
CapgeminiPune, IN
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 3 days ago
Promoted
Site Reliability Engineer
o9 Solutions, Inc.pune, India
Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises.
With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 8 days ago
Promoted
Site Reliability Engineer
AllianzPune
Site Reliability Engineer (SRE) - One Identity Access Management The primary objective of the Site Reliability Engineer (SRE) specializing in One Identity Access Mana...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
ConfidentialBengaluru / Bangalore, Chennai, Pune
As a Senior Site Reliability Engineer, you will play a critical role in supporting application developers by providing expert guidance on Application and infrastructure best practices from reliabil...Show moreLast updated: 30+ days ago
Site Reliability Engineer
Talent WorxPune, MH, IN
Quick Apply
Site Reliability Engineer (SRE).At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team.
This role involves maintaining high availability and reliability of o...Show moreLast updated: 30+ days ago
Promoted
Senior Site Reliability Engineer
ConfidentialPune
The Software Engineering team delivers next-generation application enhancements and new products for a changing world.Working at the cutting edge, we design and develop software for platforms, peri...Show moreLast updated: 30+ days ago
Promoted
Reveille Technologies - Site Reliability Engineer - DevOps
Reveille TechnologiesPune
Job Summary : We are seeking a proactive and skilled Site Reliability Engineer (SRE) to join our team on a Contract-to-Hire (C2H) basis.The ideal c...Show moreLast updated: 30+ days ago
Promoted
Rosemallow Technologies - Site Reliability Engineer
ROSEMALLOW TECHNOLOGIES PRIVATE LIMITEDPune
Job Title : Site Reliability Engineer (SRE).Department : Technology / Infrastructure / DevOps.Employment Type : Full-time.Job Summary : Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
SFS Group India Pvt. Ltd.Pune, Maharashtra, India
Act as the Site Reliability Engineer for global operations, ensuring system stability, scalability, and efficiency through advanced automation, observability, and proactive infrastructure managemen...Show moreLast updated: 10 days ago
Promoted
Senior Site Reliability Engineer- ELK Expert
iVedha Inc.Pune, IN
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone.
Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
Promoted
Principal Site Reliability Enginee
ConfidentialPune
As a Principal Site Reliability Engineer, you will be responsible for developing sophisticated systems and software based on the customer s business goals, needs and general business environment.Yo...Show moreLast updated: 30+ days ago
Promoted
Member of Technical Staff, Site Reliability Engineer
ConfidentialPune
Build and maintain Kubernetes infrastructure and Helm charts for AKS deployments.Implement IaC solutions using Terraform and GitOps practices.
Improve observability, monitoring, and reliability of m...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer - Linux
Persistent SystemsPune, Maharashtra, India
We are looking for a versatile and experienced Linux & Cloud Infrastructure Engineer to join our technology team.This role involves managing and optimizing cloud infrastructure, automating system c...Show moreLast updated: 15 days ago
Promoted
Site Reliability Engineer - OpenShift
ConfidentialPune
Applies software engineering principles to the operations domain.Contributes to a service's codebase, writes automation that aids in the management of a service, and performs operational engineerin...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer / Architect - CI / CD Pipeline
Cling Multi SolutionsPune
Job Description : Role : Site Reliability Engineer (SRE) Location : Bangalore / Chennai / Pune (Hybrid) Experience : 5+ y...Show moreLast updated: 16 days ago
Promoted
Senior Site Reliability Engineer
IntraEdgePune, IN
Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 6 days ago
Promoted
Lead - Cloud Reliability Engineer
Searce IncPune, Maharashtra, India
The ‘process-first’ AI-native modern tech consultancy that's rewriting the rules.As an engineering-led consultancy, we are dedicated to relentlessly improving the real business outcomes.Our solvers...Show moreLast updated: 6 days ago
Promoted
TCS Is Hiring For Site Reliability Engineering (SRE)
Tata Consultancy ServicesPune, Maharashtra, India
Exp Range- 8-10 years Location- Pune / Kochi / Indore (Must have) - To Detect the Incidents and act proactively escalate using the built in dashboards.
Hands on using Dynatrace dashboards and creatio...Show moreLast updated: 8 days ago