Talent.com
This job offer is not available in your country.
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Sapaadmumbai, maharashtra, in
2 days ago
Job description

WHO WE ARE

Sapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F&B businesses across 40+ countries —with many more coming onboard each day.

Driven by a passionate team of developers, designers, and product experts, Sapaad is constantly evolving—introducing innovative, industry-defining features that set the benchmark for F&B tech. Headquartered in Singapore, with offices across five countries, Sapaad is backed by seasoned technology veterans with deep expertise in web, mobility, and e-commerce.

JOB OVERVIEW

Sapaad Software Private Limited is seeking a Senior Site Reliability Engineer (SRE) to lead our infrastructure reliability efforts and mentor a growing SRE team.

This is a strategic, hands-on leadership position responsible for ensuring the reliability, scalability, and performance of our global cloud-based restaurant management platform serving thousands of customers worldwide.

As a senior member of our engineering organization, you will take ownership of system availability, drive automation initiatives, and establish SRE best practices across the company. You’ll work at the intersection of development and operations—embedding reliability into every layer of our technology stack while building and leading a team focused on operational excellence.

This role is ideal for an experienced SRE professional who is passionate about building resilient systems at scale, mentoring engineering talent, and shaping the reliability culture of a fast-growing SaaS organization.

WHAT YOU’LL DO

  • Own the reliability, availability, and performance of all production systems supporting our multi-tenant SaaS platform.
  • Define and manage SLIs, SLOs, and error budgets across critical services.
  • Architect and implement highly available, fault-tolerant systems on AWS and Heroku.
  • Proactively monitor and analyze performance to predict capacity needs and prevent issues.
  • Lead incident management and postmortem processes , driving root cause analysis and preventive actions.
  • Develop incident response playbooks , implement chaos engineering , and reduce MTTD and MTTR.
  • Design and implement comprehensive observability solutions —monitoring, logging, and alerting for microservices and distributed systems.
  • Enforce security and compliance standards , including access controls, vulnerability management, and patching.
  • Mentor and lead SRE and infrastructure engineers, driving team growth, knowledge sharing, and operational maturity.
  • Collaborate with development, DevOps, and product teams to embed reliability practices into every stage of the software lifecycle.

YOU’RE A STRONG FIT IF YOU HAVE

  • 5–8 years of experience in SRE, DevOps, or Systems Engineering roles within SaaS or cloud-based environments.
  • 2+ years in a technical leadership or mentoring capacity .
  • Proven experience maintaining large-scale, high-availability systems (99.9%+ uptime) .
  • Expertise with AWS (EC2, RDS, S3, ECS / EKS, Lambda) and Heroku .
  • Proficiency in Infrastructure as Code (Terraform, CloudFormation) and containerization (Docker, Kubernetes).
  • Strong scripting and automation skills in Python, Bash, or PowerShell .
  • Experience with CI / CD pipelines (Jenkins, GitLab CI, GitHub Actions) and configuration management tools (Chef, Ansible, Puppet).
  • Deep understanding of SRE principles —SLIs, SLOs, toil reduction, blameless postmortems, and incident management frameworks.
  • Hands-on experience with monitoring tools (Prometheus, Grafana, Datadog, New Relic, CloudWatch, ELK).
  • Excellent leadership, analytical, and communication skills with a customer-first mindset .
  • PREFERRED QUALIFICATIONS

  • AWS Certified Solutions Architect – Associate or Professional certification.
  • Experience with SOC 2, ISO 27001, GDPR, or PCI DSS compliance frameworks.
  • Background in microservices architectures , disaster recovery planning , or cost optimization .
  • Experience in the restaurant, hospitality, or retail technology sectors.
  • Create a job alert for this search

    Senior Site Reliability Engineer • mumbai, maharashtra, in

    Related jobs
    • Promoted
    RELX - Site Reliability Engineer - IAC Terraform

    RELX - Site Reliability Engineer - IAC Terraform

    REED ELSEVIER INDIA (a part of RELX India Pvt Ltd)Mumbai
    Job Description : - Lead initiatives to identify and eliminate manual, repetitive tasks through automation and tooling.Develop s...Show moreLast updated: 27 days ago
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Session AIMumbai, MH, IN
    Quick Apply
    Are you ready to make your mark with a true industry disruptor? ZineOne, a subsidiary of.We work with some of the leading brands nationwide and we innovate how brands connect with and convert custo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- Cloud Platform

    Senior Site Reliability Engineer- Cloud Platform

    ConfidentialMumbai
    As a Senior Site Reliability Engineer, you will be responsible for : .Demonstrating best practices pertaining to Cloud DevOps development along with a willingness to continually learn Cloud native te...Show moreLast updated: 27 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidentialMumbai
    What You'll Do & How You'll Make Your Mark.Be responsible for downtime and maintain the product SLA.Participate in weekly on call rotation, solving escalated tickets, resolving outages, and debuggi...Show moreLast updated: 30+ days ago
    • Promoted
    Akasa Air - Site Reliability Engineer

    Akasa Air - Site Reliability Engineer

    SNV AVIATION PRIVATE LIMITED / Akasa AirMumbai
    As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure. This includes troubleshooting issues, developing and maintaini...Show moreLast updated: 28 days ago
    • Promoted
    Senior Engineer - Reliability

    Senior Engineer - Reliability

    ConfidentialMumbai
    Execute projects for improved reliability requirements for products, components, subsystem, and assemblies, evaluate product designs and ensure optimal performance while working on opportunities to...Show moreLast updated: 14 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Mumbai, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.netMumbai, Maharashtra, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 8 days ago
    • Promoted
    Senior Site Reliability Engineer I

    Senior Site Reliability Engineer I

    ConfidentialMumbai, India
    Are you excited to work with a variety of products and technologies in a collaborative and supportive environment.Do you enjoy sharing knowledge and learning with colleagues from diverse background...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer / Lead - CI / CD Pipeline

    Site Reliability Engineer / Lead - CI / CD Pipeline

    SolutionTech HRMumbai
    Key Responsibilities : - Lead and mentor a team of SREs / DevOps Engineers, fostering a culture of ownership, reliability,...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer - Docker / Kubernetes

    Site Reliability Engineer - Docker / Kubernetes

    hirezy.aiMumbai
    Technical Skills : - Programming : Proficiency in languages like Python, Bash, or Java is essential.Operating Systems : ...Show moreLast updated: 30+ days ago
    • Promoted
    Media.net - Senior Site Reliability Engineer - IAC Terraform

    Media.net - Senior Site Reliability Engineer - IAC Terraform

    Media.netMumbai
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 8 days ago
    • Promoted
    • New!
    Sr site reliability engineer

    Sr site reliability engineer

    Media.netMumbai, Maharashtra, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 11 hours ago
    • Promoted
    Site Reliability Engineer - Observability Services

    Site Reliability Engineer - Observability Services

    TeamWare SolutionsMumbai
    Role Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on observability.The ideal candidate will have 5-8 years of experie...Show moreLast updated: 30+ days ago
    • Promoted
    Associate D&A Site Reliability Engineer (SRE)

    Associate D&A Site Reliability Engineer (SRE)

    ConfidentialMumbai
    Execute the business analytics agenda in conjunction with analytics team leaders.Work with best-in-class external partners who leverage analytics tools and processes. Use models / algorithms to uncove...Show moreLast updated: 15 days ago
    • Promoted
    Sr. Site Engineer

    Sr. Site Engineer

    ConfidentialMumbai
    Completes engineering projects by organizing and controlling project elements.Develops project objectives by reviewing project proposals and plans and conferring with management.Determines project ...Show moreLast updated: 18 days ago
    • Promoted
    Senior Site Reliability Engineer II

    Senior Site Reliability Engineer II

    ConfidentialMumbai
    We are seeking a skilled and proactive Site Reliability Engineer (SRE).This role involves close collaboration with.NET developers and QA teams, ensuring seamless transitions and ongoing reliability...Show moreLast updated: 18 days ago
    • Promoted
    Azilen Technologies - Site Reliability Engineer - Cloud Technologies

    Azilen Technologies - Site Reliability Engineer - Cloud Technologies

    Azilen Technologies Pvt LtdMumbai
    About the job : Who you are : - Deployment of large distributed application in Production / Staging environment Show moreLast updated: 30+ days ago
    • Promoted
    RELX - Senior Site Reliability Engineer II - GitHub Enterprise Cloud

    RELX - Senior Site Reliability Engineer II - GitHub Enterprise Cloud

    REED ELSEVIER INDIA (a part of RELX India Pvt Ltd)Mumbai
    About the Business : LexisNexis Risk Solutions is the essential partner in the assessment of risk.Within our Business Services vertical, we offer a multitude...Show moreLast updated: 8 days ago
    • Promoted
    Member of Technical Staff, Site Reliability Engineer

    Member of Technical Staff, Site Reliability Engineer

    ConfidentialNavi Mumbai, Mumbai, Mumbai City
    Build and maintain Kubernetes infrastructure and Helm charts for AKS deployments.Implement IaC solutions using Terraform and GitOps practices. Improve observability, monitoring, and reliability of m...Show moreLast updated: 16 days ago