Talent.com
No longer accepting applications
Lead Site Reliability Engineer

Lead Site Reliability Engineer

Futurism Technologies, INC.mumbai, India
1 day ago
Job description

Job Title : Site Reliability Engineering (SRE) Lead

Location : Hinjewadi Phase-1 (WFO)

Experience : 7+ years of experience

Shift Time : 11 : 00 AM to 8 : 00 PM

Working Days : Monday to Friday

About the Role

We are seeking a highly skilled and experienced SRE Lead to drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS and Azure. You will lead a team responsible for building and maintaining automated deployment pipelines, infrastructure as code, and observability systems using GitHub Actions, Terraform, and Datadog.

As the SRE Leader, you will collaborate closely with development, operations, and security teams to ensure our services are highly available, secure, and performant, while fostering a culture of automation, monitoring, and continuous improvement.

Key Responsibilities

  • Lead and mentor a team of SRE engineers to design, build, and maintain reliable, scalable, and secure cloud infrastructure across AWS and Azure.
  • Architect and implement Infrastructure as Code (IaC) solutions primarily using Terraform to manage multi-cloud environments efficiently.
  • Develop, maintain, and optimize CI / CD pipelines leveraging GitHub Actions to enable fast and reliable software delivery.
  • Establish and drive best practices in site reliability, monitoring, alerting, and incident response using Datadog and other observability tools.
  • Collaborate with software engineering teams to improve system reliability through automation, load testing, and performance tuning.
  • Define and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives.
  • Manage cloud resource costs and optimize usage across multiple cloud providers.
  • Promote a DevOps culture emphasizing automation, continuous deployment, and proactive incident management.
  • Stay current with the latest industry trends and technologies in cloud, automation, and SRE practices.

Required Skills

  • 7+ years of experience in Site Reliability Engineering, DevOps, or cloud infrastructure roles.
  • Implement dashboards to monitor and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives.
  • Proven experience leading and mentoring engineering teams.
  • Strong hands-on experience with AWS and Azure cloud platforms.
  • Expert in Infrastructure as Code using Terraform with multi-cloud deployments.
  • Proficient in building and managing CI / CD pipelines using GitHub Actions.
  • Deep knowledge of monitoring and observability tools, especially Datadog.
  • Solid understanding of networking, security, container orchestration (Kubernetes is a plus), and cloud-native architectures.
  • Strong scripting and automation skills (Python, Bash, or similar).
  • Experience with incident management, root cause analysis, and capacity planning.
  • Excellent communication, leadership, and collaboration skills.
  • Technical Skills

  • IAC : Terraform
  • CICD : Git Action, Git workflow and ArgoCD
  • Observability : Datadog, Prometheus and Fluent bit
  • POD Orchestration : EKS and EKS Faregate
  • Cloud : AWS and Azzure
  • Preferred

  • Certifications such as AWS Certified DevOps Engineer, Azure DevOps Engineer, or HashiCorp Terraform Associate.
  • Experience with Kubernetes and service mesh technologies.
  • Familiarity with chaos engineering and resilience testing.
  • Knowledge of security best practices in cloud environments.
  • Create a job alert for this search

    Site Reliability Engineer • mumbai, India

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupKalyan-Dombivli, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiMumbai, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 11 days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.netmumbai, maharashtra, in
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - 2 / 3

    Site Reliability Engineer - 2 / 3

    ConfidentialMumbai
    Influence technical direction by evaluating change requests, participating in architectural discussions across teams to uphold best practices and decide on appropriate technologies.Lead incident re...Show moreLast updated: 30+ days ago
    • Promoted
    RELX - Site Reliability Engineer - IAC Terraform

    RELX - Site Reliability Engineer - IAC Terraform

    REED ELSEVIER INDIA (a part of RELX India Pvt Ltd)Mumbai
    Job Description : - Lead initiatives to identify and eliminate manual, repetitive tasks through automation and tooling.Develop s...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    MorningstarMumbai, Maharashtra, India
    This job is with Morningstar, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.Job Title : S...Show moreLast updated: 10 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmadombivli, maharashtra, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 22 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronMumbai, Maharashtra, India
    We have immediate opportunity for.Site Reliability Engineer Devop 5 to 9 years.SRE (Senior Site Reliability Engineer) Devop. We began life in 2001 as a small, self-funded team of technology speciali...Show moreLast updated: 23 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialMumbai, India
    Our goal at Pivotree is to help accelerate the future of frictionless commerce.We will help lead this change over the next decade because we believe a future where technology is embedded intimately...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer / Lead - CI / CD Pipeline

    Site Reliability Engineer / Lead - CI / CD Pipeline

    SolutionTech HRMumbai
    Key Responsibilities : - Lead and mentor a team of SREs / DevOps Engineers, fostering a culture of ownership, reliability,...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- Elk Expert

    Senior Site Reliability Engineer- Elk Expert

    iVedha Inc.Dombivli, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 16 days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    ConfidentialMumbai, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 5 days ago
    • Promoted
    Senior Site Reliability Engineer II

    Senior Site Reliability Engineer II

    ConfidentialMumbai, India
    Would you like to be part of a team that delivers high-quality software to our customers.Are you a visible champion with a 'can do' attitude and enthusiasm that inspires others.LexisNexis Risk Solu...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeMumbai, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 14 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech SolutionsMumbai, Maharashtra, India
    At Nebula Tech Solutions, we’re building a high-performing SRE team supporting mission-critical applications for our US-based enterprise clients. We’re now looking for engineers who can go beyond op...Show moreLast updated: 2 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Thane, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionsthane, maharashtra, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 22 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialNavi Mumbai, Mumbai, India
    Serve as the first line of support for customer-reported technical issues related to our SaaS platform.This involves data connectivity issues, report errors, performance concerns, access problems, ...Show moreLast updated: 5 days ago