SFS Group India Pvt. Ltd.Pune, Republic Of India, IN
12 days ago
Job description
Objectives
Act as the Site Reliability Engineer for global operations, ensuring system stability, scalability, and efficiency through advanced automation, observability, and proactive infrastructure management.
Provide expertise in Kubernetes, Linux, networking, and automation practices to support reliable deployments and resilient services.
Maintain a strong sense of reliability, with clear awareness of the risks and impacts that infrastructure and application changes can have.
Principal duties
Has strong knowledge of Kubernetes (including Talos) for deployment, scaling, and maintaining containerized applications.
Provides Linux administration expertise and ensures secure, efficient system operations.
Implements and maintains GitOps workflows using Flux for consistent, automated deployments.
Designs and manages infrastructure automation using Puppet and Terraform.
Ensures reliable operation of databases such as MySQL / MariaDB, Yugabyte, and MongoDB, supporting data integrity and availability.
Operates and integrates streaming platforms (Confluent, Strimzi) for event-driven and real-time processing.
Develops automation scripts and tools using Python to improve operational efficiency.
Supports and integrates solutions with Azure and hybrid / multi-cloud environments.
Builds and operates monitoring and observability systems (Datadog, Prometheus, Grafana) to ensure system health and transparency.
Designs for scalability and high availability, including disaster recovery and failover strategies.
Applies security best practices across infrastructure, applications, and data.
Evaluates risks carefully before changes, ensuring reliable rollout strategies and minimizing downtime or service disruption.
Monitors system reliability, identifies risks, and implements proactive improvements.
Collaborates with global teams to share best practices and ensure consistency across environments.
Defines and standardizes developer tooling (e.G., IDEs, code quality tools, CI / CD integrations) to ensure consistent development environments and maintain high software quality.
Manages developer workstations and operating system standards (currently Ubuntu-based), ensuring performance, security, and compatibility across the engineering organization with focus on the Asia team.
Promotes a documentation culture, ensuring clear processes, runbooks, and troubleshooting guides.
Report to the offshore Digital Manufacturing team based in Switzerland.
Create a job alert for this search
Site Reliability Engineer • Pune, Republic Of India, IN
Related jobs
Promoted
Senior Site Reliability Engineer- ELK Expert
iVedha Inc.Nagpur, IN
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone.
Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
Promoted
MLOps Engineer
X4 TechnologyNagpur, IN
MLOps Engineer - Role & Responsibilities.Design, deploy and manage scalable & secure cloud infrastructure.Apply least privilege across cloud platforms (Azure, RBAC, AWS IAM).Enable audit logging co...Show moreLast updated: 17 days ago
Promoted
QA Engineer
SupplyHouseNagpur, India
Remote
At G enerosity, R espect, I nnovation, Show moreLast updated: 30+ days ago
Promoted
MLOps Lead Engineer
RecroNagpur, IN
Experience with Azure services such as Azure AI services, Azure Search, Azure ML, Databricks, Azure Kubernetes Service, and AWS services like AWS SageMaker, AWS Bedrock and AWS Lambda.Exposure to G...Show moreLast updated: 16 days ago
Promoted
Technical Lead
ThumoNagpur, IN
Founding Engineer @ Thumo (Africa’s first super-app).We’re building Africa’s super-app, starting with food delivery.M funding round led by Soma Capital with top Silicon Valley angels, we’re hiring ...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
CapgeminiNagpur, IN
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 5 days ago
EC-Energy Events is looking for an experienced Rotating Equipment Reliability Consultant / Trainer to join our growing pool of experts supporting technical conferences, training programs, and worksho...Show moreLast updated: 27 days ago
Promoted
Resident Engineer – Kubernetes & Portworx
CMK Resources, Inc.Nagpur, IN
CMK Resources Resident Engineer – Kubernetes & Portworx (3 openings).Help Shape the Future of Kubernetes Storage.Our client's largest and most strategic customer is moving VMware-based workloads to...Show moreLast updated: 30+ days ago
Promoted
DevOps / Platform Engineer
iVedha Inc.Nagpur, IN
Hiring a seasoned DevOps / Platform Engineer to drive automation, platform reliability, and robust.Design, deploy, and manage CI / CD pipelines and infrastructure automation, leveraging AI for.Implemen...Show moreLast updated: 30+ days ago
Promoted
Emulation Engineer / Lead
eInfochips (An Arrow Company)Nagpur, IN
Role : Emulation Engineer / Lead.Job Location : Noida, Chennai, Bangalore, Hyderabad, Ahmedabad.You must be having BS or MS in Electrical OR Electronics engineering.
Minimum 4+ Years of Emulation Expe...Show moreLast updated: 30+ days ago
Promoted
Senior MLOps Engineer
Mitchell Martin Inc.Nagpur, IN
Include, but are not limited to, the following : .Own productionizing models—from tracked experiments to governed releases—ensuring resilient services with clear SLOs, runbooks, and fast, safe rollba...Show moreLast updated: 30+ days ago
Promoted
Delinea Implementation Engineer
K&K Talents - IndiaNagpur, IN
This position is with one of our.Title : Delinea Implementation Engineer.Employment Type : Full-time Permanent.Delinea Implementation Engineer.
Delinea (formerly Thycotic & Centrify) Privileged Access...Show moreLast updated: 8 days ago
Promoted
Site Reliability Engineer - CI / CD Pipeline
Hashone CareersIndia
We are looking for a skilled Site Reliability Engineer (SRE) with a strong DevOps background and deep expertise in Google Cloud Platform (GCP).
The ideal candidate will be responsible for ensuring t...Show moreLast updated: 30+ days ago
Promoted
Deployment Engineer
AvocaNagpur, IN
Build, launch & optimize AI agents that power the next generation of home-service customer experiences.Avoca is the all-in-one AI lead-conversion platform.
Our technology boosts booking rates, slash...Show moreLast updated: 30+ days ago
Promoted
Senior Site Reliability Engineer
IntraEdgeIndia
Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 8 days ago
Promoted
Sr. Engineer, Site Reliability
Intelindia, India
Do you want to innovate an industry leading developer cloud? Join SATG as a Sr.The cloud development division within Software and Advanced Technology Group (SATG) is developing and shaping the way ...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
o9 Solutions, Inc.Nagpur, Maharashtra, India
Be part of something revolutionary At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises.
With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 3 days ago
Promoted
Site Reliability Engineer-II
Bloomreachindia, India
Improve and manage infrastructure to drive efficiency and scalability.Write and review code, develop documentation, capacity plans, and optimize service costs.
Set up Service Level Indicators (SLIs)...Show moreLast updated: 30+ days ago