Talent.com
Lead Site Reliability Engineer
Lead Site Reliability EngineerTrimble • Chennai, Tamil Nadu, India
Lead Site Reliability Engineer

Lead Site Reliability Engineer

Trimble • Chennai, Tamil Nadu, India
3 days ago
Job description

Site Reliability Engineer

Reporting to : Sr Manager Availability Management

Office Location : Chennai India

Flexible Working : Hybrid (Part Office / Part Home)

Cloud Site Reliability Engineer Responsibilities

AI in Observability : Heavily utilise migration tooling and AI to eliminate key tasks as well as optimising the collection analysis pre-configuration and implementation of alerts and dashboard configurations.

Create and maintain AI agents required to perform onboardings migrations and cost prevention tasks

Platform Management & Scalability : maintain the entire observability platform that our customers rely on. This includes managing the Otel gateway and ensuring the observability platform is available and meeting our customers needs

Performance & Cost Optimization : Proactively monitor and optimize the platforms performance and efficiency to ensure the platform remains performant and economically viable as our observability grows.

Operational Excellence & Automation : Develop and maintain automation scripts and tools to streamline core SRE tasks such as managing customer accounts provisioning new data sources and deploying updates. Youll use Infrastructure as Code (IaC) tools like Terraform to ensure consistent and reproducible delivery.

Feature Enablement & Support : Work closely with engineering teams to ensure new features are built with reliability and scalability in mind. Youll also provide technical expertise to assist the business in resolving complex issues and understanding platform capabilities.

Problem Management : you will input problems that impact customer experience and thorough root cause analysis to prevent future incidents.

Observability Centre of Excellence : Play a key role in the Observability Centre of Excellence working closely with engineering teams to advise on best practices / standards for products / platforms to maximise / improve product availability reliability resiliency and security.

Availability Management : On-board internal customers to our 24x7 Applications Support and Enterprise Status Page services

Skills & Experience

Observability : Proficiency with monitoring logging and tracing tools such as NewRelic (preferred) Datadog Splunk.

OpenTelemetry : Hands-on experience with the OpenTelemetry framework including the APIs SDKs and the OpenTelemetry Collector would be beneficial.

Coding and Scripting : Proficiency in at least one high-level language like Python Go or Java is crucial for building automation tools and monitoring scripts. Shell scripting (e.g. Bash) is also desired for system administration tasks. Experience with with coding assistants like Co-Pilot

Infrastructure as Code (IaC) : Skills in tools like Terraform Ansible or Puppet are vital for automating the provisioning and management of infrastructure ensuring consistency and repeatability.

Cloud Platforms : Solid expertise with a major cloud provider such as AWS Google Cloud Platform (GCP) or Azure is expected

Containerization and Orchestration : A command of Docker for containerizing applications and Kubernetes for managing those containers at scale

No travel required

How to Apply : Please submit an online application for this position by clicking on the Apply Now button located in this posting.

Application Deadline : Applications could be accepted until at least 30 days from the posting date.

Join a Values-Driven Team : Belong Grow Innovate.

At Trimble our core values of Belong Grow and Innovate arent just wordstheyre the foundation of our culture. We foster an environment where you are seen heard and valued (Belong); where you have an opportunity to build a career and drive our collective growth (Grow); and where your innovative ideas shape the future (Innovate). We believe in empowering local teams to create impactful strategies ensuring our global vision resonates with every individual. Become part of a team where your contributions truly matter.

Trimbles Privacy Policy

If you need assistance or would like to request an accommodation in connection with the application process please contact

Key Skills

Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting

Employment Type : Full-Time

Experience : years

Vacancy : 1

Create a job alert for this search

Site Reliability Engineer • Chennai, Tamil Nadu, India

Related jobs
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Miratech • Chennai, Tamil Nadu, India
Join us in revolutionizing customer experiences with our client a global leader in cloud contact center software.Senior Site Reliability Engineer. You will design dashboards work with observability ...Show more
Last updated: 30+ days ago • Promoted
Cloud Site Reliability Engineer

Cloud Site Reliability Engineer

Ford Motor • Chennai, Tamil Nadu, India
Be at the Forefront of Mobilitys Future : Join Ford as a Site Reliability Engineer!.Enterprise Technology is the engine driving the future of transportation and were looking for a talented Site Reli...Show more
Last updated: 11 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Capgemini • Chennai, IN
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
Last updated: 30+ days ago • Promoted
AWS Site Reliability Engineer

AWS Site Reliability Engineer

HTC Global Services • Chennai, Tamil Nadu, India
Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show more
Last updated: 27 days ago • Promoted
Site Reliability Engineer 2

Site Reliability Engineer 2

PhonePe • Chennai, IN
Headquartered in India, its flagship product, the PhonePe digital payments app, was launched in Aug 2016.As of April 2025, PhonePe has over 60 Crore (600 Million) registered users and a digital pay...Show more
Last updated: 16 hours ago • Promoted • New!
Site Reliability Engineer

Site Reliability Engineer

Infinova Global Corporate Services LLP • Chennai, IN
Infinova is an emerging player in intelligent business transformation, dedicated to helping organizations scale smarter and achieve sustainable success. We are building a foundation that combines st...Show more
Last updated: 16 hours ago • Promoted • New!
Site Reliability Engineer

Site Reliability Engineer

Pagos Consultants • Chennai, IN
This team will play a pivotal role in spearheading innovation.As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future d...Show more
Last updated: 4 days ago • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

WSO2 • Chennai, IN
Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show more
Last updated: 16 hours ago • Promoted • New!
Site Reliability Engineer

Site Reliability Engineer

NielsenIQ • Chennai, Tamil Nadu, India
NIQ Activate is the leading provider of AI-powered customer analytics personalization and brand collaboration platform.Serving dozens of retailers and brands across the world using cutting edge big...Show more
Last updated: 30+ days ago • Promoted
Senior Site Reliability Engineer (Middleware)

Senior Site Reliability Engineer (Middleware)

Nextiva • Chennai, Tamil Nadu, India
Redefine the future of customer experiences.At Nextiva were reimagining how businesses connect bringing together customer experience and team collaboration on a single conversation centric platform...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Datum Technologies Group • Chennai, Tamil Nadu, India
Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show more
Last updated: 18 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Grootan Technologies • Chennai, Tamil Nadu, India
Site Reliability Engineer (SRE).In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications.You will leverage your e...Show more
Last updated: 17 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Insight Global • Chennai, IN
Contract with Insight Global Client.Join our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building and maintaining highly reliable, automated, and scalable systems.Y...Show more
Last updated: 30+ days ago • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Arcadia • Chennai, Tamil Nadu, India
Senior Site Reliability Engineer.Arcadia is the technology company empowering energy innovators and consumers to fight the climate crisis. Our software and APIs are revolutionizing an industry held ...Show more
Last updated: 4 days ago • Promoted
Lead Site Reliability Engineer (SRE)

Lead Site Reliability Engineer (SRE)

Datum Technologies Group • Chennai, Tamil Nadu, India
Job Title : Lead Site Reliability Engineer (SRE).Duration : Contract to Hire (On the Payroll of Datum Technology Group).Location : Chennai || Mumbai || Gurugram. Interview Process : Virtual (2 Rounds) +...Show more
Last updated: 3 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Tata Consultancy Services • Chennai, Tamil Nadu, India
GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show more
Last updated: 30+ days ago • Promoted
Sr. Site Reliability Engineer (SRE)

Sr. Site Reliability Engineer (SRE)

Datum Technologies Group • Chennai, Tamil Nadu, India
Site Reliability Engineer (SRE).Duration : Contract to Hire (On the Payroll of Datum Technology Group).Location : Chennai || Mumbai || Gurugram. Interview Process : Virtual (2 Rounds) + 1 Technical scr...Show more
Last updated: 3 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Alp Consulting Ltd. • Saint Thomas Mount, Tamil Nadu, India
Cloud AWS Web Application Security Cloudflare Infrastructure Kubernetes / EKS Helm Terraform GitOps / ArgoCD Kyverno CI / CD Concourse Fabric / Routing Istio mTLS Observability Datadog Prometheus Graf...Show more
Last updated: 5 hours ago • Promoted • New!