A proactive Site Reliability Engineer (SRE) to ensure 99.99% uptime for our scalable, multi-tier microservices platform.
You will troubleshoot both networking and application uptime issues, supporting seamless service delivery.
Key Responsibilities :
Maintain strict SLOs (99.99% uptime) across distributed systems including Redis, Golang services, and DocDB.
Diagnose and resolve complex application and network issues, including DNS troubleshooting and network latency.
Use monitoring and observability tools such as Kibana, Grafana, Instana, and Dynatrace for proactive incident detection.
Automate infrastructure and workflows with Python, Bash, Terraform, and Ansible.
Manage container orchestration on AWS Elastic Kubernetes Service (EKS) and Red Hat OpenShift, ensuring high availability and scalability.
Collaborate with development and QA teams to embed reliability best practices and improve system observability.
Participate in on-call rotations, incident response, and blameless postmortems.
Document runbooks and mentor junior engineers on SRE and networking fundamentals.
(ref : hirist.tech)
Create a job alert for this search
Site Reliability Engineer • Thane
Related jobs
Promoted
Site Reliability Engineer
Haysmumbai, maharashtra, in
Required skills and qualifications.Experience : Proven experience in technical support or engineering, preferably in AI / ML / GenAI environments.
Technical Proficiency : Expertise in GenAI models (e.GPT,...Show moreLast updated: 24 days ago
Promoted
RELX - Site Reliability Engineer - IAC Terraform
REED ELSEVIER INDIA (a part of RELX India Pvt Ltd)Mumbai
Job Description : - Lead initiatives to identify and eliminate manual, repetitive tasks through automation and tooling.Develop s...Show moreLast updated: 18 days ago
Promoted
Akasa Air - Site Reliability Engineer
SNV AVIATION PRIVATE LIMITED / Akasa AirMumbai
As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure.
This includes troubleshooting issues, developing and maintaini...Show moreLast updated: 19 days ago
Promoted
Site Reliability Engineer
ConcordKalyan-Dombivli, IN
Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 17 days ago
Promoted
Senior Site Reliability Engineer
WSO2mumbai city, maharashtra, in
Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 7 days ago
Promoted
Site Reliability Engineer
XebiaMumbai, IN
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 25 days ago
Promoted
Site Reliability Engineer / Lead - CI / CD Pipeline
SolutionTech HRMumbai
Key Responsibilities : - Lead and mentor a team of SREs / DevOps Engineers, fostering a culture of ownership, reliability,...Show moreLast updated: 6 days ago
Promoted
Site Reliability Engineer - Docker / Kubernetes
hirezy.aiMumbai
Technical Skills : - Programming : Proficiency in languages like Python, Bash, or Java is essential.Operating Systems : ...Show moreLast updated: 27 days ago
Promoted
Site Reliability Engineer
Amicon Hub Servicesmumbai, maharashtra, in
Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation.
Collaborate with development teams to en...Show moreLast updated: 5 days ago
Promoted
Site Reliability Engineer - Chaos Management
Xebiamumbai, maharashtra, in
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 7 days ago
Promoted
DevOps Engineer / SRE
SuprSendmumbai city, maharashtra, in
SuprSend is redefining notification infrastructure for businesses, enabling seamless communication at scale.Our platform ensures reliability, scalability, and efficiency in delivering notifications...Show moreLast updated: 6 days ago
Promoted
Site Reliability Engineer
UplersThane, IN
Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required.
OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 23 days ago
Promoted
Site Reliability Engineer - Observability Services
TeamWare SolutionsMumbai
Role Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on observability.The ideal candidate will have 5-8 years of experie...Show moreLast updated: 30+ days ago
Promoted
Docsumo - Senior DevOps / Site Reliability Engineer - Python
DocsumoMumbai
About Docsumo : Docsumo is a Document Workflow platform that converts unstructured documents (like bank statements, financials, policies) into structured, actionable ...Show moreLast updated: 9 days ago
Promoted
Lead Site Reliability Engineer - Cloud Computing
NeemtreeMumbai
Responsibilities : - Team Leadership : Manage and mentor a team of SREs, assigning tasks, providing technical guidance, and fostering a culture of collaboration and ...Show moreLast updated: 4 days ago
Promoted
Azilen Technologies - Site Reliability Engineer - Cloud Technologies
Azilen Technologies Pvt LtdMumbai
About the job : Who you are : - Deployment of large distributed application in Production / Staging environment Show moreLast updated: 30+ days ago
Promoted
Senior Site Reliability Engineer- ELK Expert
iVedha Inc.Thane, IN
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone.
Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
Promoted
DevOps / Platform Engineer
iVedha Inc.Kalyan-Dombivli, IN
Hiring a seasoned DevOps / Platform Engineer to drive automation, platform reliability, and robust.Design, deploy, and manage CI / CD pipelines and infrastructure automation, leveraging AI for.Implemen...Show moreLast updated: 30+ days ago