We're looking for a self-motivated , enthusiastic , and hands-on engineer to set up solid DevOps and SRE foundations. If you thrive in a small, high-energy team and want to play a key role in shaping infrastructure and reliability at scale, this is the place for you.
We’re looking for a hands-on engineer with 3–6 years of experience who has a solid grasp of cloud infrastructure , a strong foundation in Infrastructure as Code (IaC) , and a keen eye for choosing the right tools for the job. You’ll help design, build, and scale resilient infrastructure for a fast-growing, product-driven team.
Design, build, and manage cloud infrastructure using Infrastructure as Code (IaC) tools like Terraform, Ansible, Chef, or CloudFormation .
Champion observability by defining SLIs, SLOs , and building robust monitoring, logging, and alerting systems using tools like Prometheus, Grafana , and custom telemetry.
Ensure availability, scalability, and resilience of our SaaS platform and platform services in production.
Proven ability to improve system observability through the design and instrumentation of system-level metrics , enhancing visibility into system health, performance, and bottlenecks.
Dive deep into complex system architectures to solve critical performance and reliability challenges.
Work with developers and product teams to embed NFR (Non-functional Requirements) into every product and feature release.
Conduct root cause analysis and system-level debugging (primarily on Linux ).
Build and maintain CI / CD pipelines , automating deployments and infrastructure operations across environments.
Scale infrastructure to meet growth needs while optimizing cost and performance.
Take ownership of incident response, on-call rotations , and blameless postmortems.
Collaborate cross-functionally to drive technical and architectural decision
Highly self-driven , accountable , and eager to own initiatives end-to-end. Comfortable working in startups or small teams , where flexibility, speed, and autonomy are key. Strong communication and cross-team collaboration skills.
You should apply if
Proficient in at least one programming language — Python, Java, or similar.
Demonstrated experience with performance optimization , latency reduction , and scaling services .
Strong analytical skills for incident debugging , log analysis , and system troubleshooting .
Understanding of service-level metrics (SLIs, SLOs, error budgets) and how to operationalize them.
Experience building large-scale, distributed, resilient systems .
Strong understanding of core infrastructure components such as load balancers, firewalls, and databases — including their internal workings and operational fundamentals.
Solid understanding of infrastructure cost management — proactively identifies cost drivers, implements optimization strategies, and contributes to cost reduction initiatives without compromising reliability or performance.
Familiarity with on-call responsibilities , incident management , and root cause analysis .
Strong experience with Infrastructure as Code (IaC) : Terraform, Ansible, Chef, or CloudFormation and other orchestration tools
Ability to deep-dive into third-party or internal library codebases to understand internal behavior, debug complex issues, and contribute insights or fixes when needed.
Solid understanding of cloud platforms — preferably AWS , but Azure or GCP is also acceptable.
Create a job alert for this search
Site Reliability Engineer • Chennai, Tamil Nadu, India
Related jobs
Promoted
New!
Senior Site Reliability Engineer I
RELXChennai, Tamil Nadu, India
LexisNexis Risk Solutions is looking for a Senior SRE / DevSecOps Engineer to join our collaborative and innovative SRE team.
In this role, you’ll help design, build, and maintain secure, scalable s...Show moreLast updated: 4 hours ago
Promoted
New!
Site Reliability Engineers (SREs) - Robust background in Google Cloud Platform (GCP) | RedHat OpenShift administration
UPS IndiaChennai, Tamil Nadu, India
Explore your next opportunity at a Fortune Global 500 organization.Envision innovative possibilities, experience our rewarding culture, and work with talented teams that help you become better ever...Show moreLast updated: 4 hours ago
Promoted
Senior Site Reliability Engineer
PoshmarkChennai, Tamil Nadu, India
We’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale...Show moreLast updated: 6 days ago
Promoted
Site Reliability Engineer
ExasoftChennai, IN
Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites.
Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 3 days ago
Promoted
Site Reliability Engineer
XebiaChennai, IN
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 29 days ago
Promoted
New!
Site Reliability Engineer III
RELXChennai, Tamil Nadu, India
We are seeking a Site Reliability Engineer (SRE) with experience in Azure and a track record of success in cloud migration project initiatives.
The successful candidate will help design and coordina...Show moreLast updated: 4 hours ago
Promoted
New!
Middle Site Reliability Engineer
MiratechChennai, Tamil Nadu, India
Our client is a global technology company with a complex microservices environment and a strong focus on system observability and reliability.
Build, automate, and maintain dashboards to monitor 30+...Show moreLast updated: 4 hours ago
Promoted
MLOps Site Reliability Engineer
KLAChennai, Tamil Nadu, India
We are seeking a highly skilled and motivated MLOps Site Reliability Engineer (SRE) to join our team.In this role, you will be responsible for ensuring the reliability, scalability, and performance...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
UplersChennai, IN
Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required.
OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 27 days ago
Promoted
Site Reliability Engineer
ConcordChennai, IN
Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 21 days ago
Promoted
New!
Site Reliability Engineer
TrimbleChennai, Tamil Nadu, India
We are seeking a motivated Site Reliability Engineer (SRE) Level 1 to enhance the infrastructure and operational reliability of our ERP product, specifically within Azure and Windows environments.T...Show moreLast updated: 4 hours ago
Promoted
New!
Site Reliability Engineers - Google Cloud Platform (GCP) | RedHat OpenShift administration
UPS IndiaChennai, Tamil Nadu, India
Explore your next opportunity at a Fortune Global 500 organization.Envision innovative possibilities, experience our rewarding culture, and work with talented teams that help you become better ever...Show moreLast updated: 4 hours ago
Promoted
Senior Site Reliability Engineer
Tata Consultancy ServicesChennai, Tamil Nadu, India
TCS is looking for Senior Site Reliability Engineer – AWS.Design, implement, and maintain scalable, secure, and highly available infrastructure on AWS.
Develop and improve CI / CD pipelines, Infrastru...Show moreLast updated: 8 days ago
Promoted
New!
Senior Site reliability Engineer
RELXChennai, Tamil Nadu, India
We’re looking for an experienced Site Reliability Engineer (SRE) to join our team.In this role, you’ll work on meaningful projects that improve the reliability, performance, and efficiency of our s...Show moreLast updated: 4 hours ago
Promoted
New!
Senior Site Reliability Engineer II
RELXChennai, Tamil Nadu, India
DevOps / Site Reliability Engineer (SRE).Whether your background is software engineering or SRE-focused, what matters most is your ability to automate, optimize, and improve systems through smart scr...Show moreLast updated: 4 hours ago
Promoted
New!
Site Reliability Operations Engineer - India
FinalsiteChennai, Tamil Nadu, India
Finalsite is the preferred website, communications, enrollment, and marketing platform of more than 7,000 schools and school districts in 119 countries around the world.
The company's people, produc...Show moreLast updated: 4 hours ago
Promoted
New!
Senior Site Reliability Engineer
AthenahealthChennai, Tamil Nadu, India
Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.
We are looking for a Senior Site Reliability Engineer to join our Servic...Show moreLast updated: 4 hours ago
Promoted
New!
Senior Site Reliability Engineer
SaamaChennai, Tamil Nadu, India
Job Title : Senior Site Reliability Engineer.We are seeking a highly motivated and experienced Site Reliability Engineer to join our team.
As a Site Reliability Engineer, you will be responsible for ...Show moreLast updated: 4 hours ago