Talent.com
Lead Reliability and Infrastructure Engineer
Lead Reliability and Infrastructure EngineerGroww • Bengaluru, Republic Of India, IN
Lead Reliability and Infrastructure Engineer

Lead Reliability and Infrastructure Engineer

Groww • Bengaluru, Republic Of India, IN
30+ days ago
Job description

About Groww

We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the customers’ needs and convenience in mind. Our people are our greatest strength. Everyone at Groww is driven by ownership, customer-centricity, integrity and the passion to constantly challenge the status quo.

Are you as passionate about defying conventions and creating something extraordinary as we are? Let’s chat.

Our Vision

Every individual deserves the knowledge, tools, and confidence to make informed financial decisions. At Groww, we are making sure every Indian feels empowered to do so through a cutting-edge multi-product platform offering a variety of financial services.

Our long-term vision is to become the trusted financial partner for millions of Indians.

Our Values

Our culture enables us to be what we are — India’s fastest-growing financial services company. It fosters an environment where collaboration, transparency, and open communication take center-stage and hierarchies fade away. There is space for every individual to be themselves and feel motivated to bring their best to the table, as well as craft a promising career for themselves.

The values that form our foundation are :

  • Radical customer centricity
  • Ownership-driven culture
  • Keeping everything simple
  • Long-term thinking
  • Complete transparency

Expertise and Qualifications

We are seeking a highly motivated and experienced Senior Site Reliability Engineer to join our engineering team. As an SRE, you will be responsible for ensuring the reliability, availability, scalability, and performance of our applications and infrastructure. You will collaborate closely with software developers, platform engineers, and other team members to design, provision, build, and maintain systems that are scalable, secure, and highly available.

What will make you a great fit for the role :

  • 6–9 years of experience in SRE, DevOps, or system architecture roles with large-scale production systems.
  • Extensive experience managing and scaling high-traffic, low-latency fintech systems, ensuring reliability, compliance, and secure transaction processing.
  • Proven expertise in the networking stack, with hands-on experience in BGP, OSPF, DNS, HTTP(S), TCP / IP, MPLS, and VPN protocols.
  • Advanced knowledge of GCP networking (VPC design, Shared VPC, Private Service Connect, Global Load Balancers, Cloud DNS, Cloud NAT, Network Intelligence Center, and Service Mesh).
  • Strong background in managing complex multi-cloud environments (AWS, GCP, Azure) with a focus on secure and compliant architectures in regulated industries.
  • Hands-on expertise in Terraform and Infrastructure-as-Code (IaC) for repeatable, automated deployments.
  • Expertise in Kubernetes, container orchestration, and microservices, with production experience in regulated fintech environments.
  • Advanced programming and scripting skills in Python, Go, or Java, applied to automation, risk reduction, and financial system resilience.
  • Proficiency with monitoring and logging tools (Prometheus, Mimir, Grafana, Loki) to ensure real-time visibility into trading, payments, and transaction flows.
  • Strong understanding of networking, load balancing, and DNS management across multi-cloud and hybrid infrastructures.
  • Implemented end-to-end observability solutions (metrics, logs, and traces) to monitor and optimize transaction throughput, adhering to latency SLAs.
  • Leadership skills with experience mentoring teams, fostering a culture of reliability, and partnering with cross-functional stakeholders in product teams.
  • Strong communication, critical thinking, and incident management abilities, especially in high-stakes production incidents involving customer transactions.
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or equivalent experience.
  • What you’ll do :

  • Architect and lead the design of scalable, reliable infrastructure solutions.
  • Implement strategies for high availability, scalability, and low-latency performance.
  • Define service-level objectives (SLOs) and service-level indicators (SLIs) to track performance and reliability.
  • Drive incident management by identifying root causes and providing long-term solutions.
  • Mentor junior engineers and foster a collaborative, learning-focused environment.
  • Design advanced monitoring and alerting systems for proactive system management.
  • Architect and optimize network topologies (hybrid cloud, multi-cloud, and on-prem) to support ultra-low-latency trading and compliance-driven workloads.
  • Configure and manage cloud and on-prem networking components (VPCs, Shared VPCs, Private Service Connect, Cloud NAT, and Global Load Balancers) for secure and compliant transaction flows.
  • Implement secure connectivity solutions (VPNs, Interconnect, Direct Connect, and service meshes) to meet fintech regulatory requirements and standards.
  • Develop and maintain DNS, load-balancing, and traffic-routing strategies to ensure millisecond-level latency for real-time transactions.
  • Evolve Infrastructure as Code (IaC) practices and principles to automate infrastructure provisioning.
  • Collaborate on reliability roadmaps, performance benchmarks, and disaster recovery plans tailored for low-latency and high-throughput workloads.
  • Manage Kubernetes clusters at scale, integrating service meshes like Istio or Linkerd.
  • Implement chaos engineering principles to strengthen system resilience.
  • Influence technical direction, reliability culture, and organizational strategies.
  • Create a job alert for this search

    Reliability Engineer • Bengaluru, Republic Of India, IN

    Related jobs
    Lead Reliability and Qualification Engineer

    Lead Reliability and Qualification Engineer

    L&T Semiconductor Technologies • Bengaluru, Karnataka, India
    Role : Lead - Reliability and Qualification Engineer.Reliability and Qualification Engineering within the Semiconductor IC development field. Lead Reliability and Qualification Engineer will be respo...Show more
    Last updated: 23 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ACL Digital • Bangalore, IN
    ACL Digital is Hiring for the Below position.ACL Digital, part of the ALTEN Group, is a trusted AI-led, Digital & Systems Engineering Partner driving innovation by designing and building intelligen...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent Partners • bangalore, karnataka, in
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show more
    Last updated: 30+ days ago • Promoted
    Lead Infrastructure Reliability Engineer

    Lead Infrastructure Reliability Engineer

    Allegion India • Bengaluru, Republic Of India, IN
    Allegion is a global leader in security products and solutions, dedicated to creating safer environments for homes and businesses. With a focus on innovation and technology, Allegion develops and ma...Show more
    Last updated: 5 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Synechron • hosur, tamil nadu, in
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5+ years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology specialists...Show more
    Last updated: 15 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    JRD Systems • Bengaluru, Karnataka, India
    Site Reliability Engineer (Windows / Cloud / Automation).We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments.T...Show more
    Last updated: 30+ days ago • Promoted
    Cloud Infrastructure Reliability Engineer

    Cloud Infrastructure Reliability Engineer

    Oracle • Bengaluru, Republic Of India, IN
    Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and e...Show more
    Last updated: 7 days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Media.net • Bengaluru, Karnataka, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Flipkart • Bengaluru, Karnataka, India
    Hiring Site Reliability Engineers.The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised ...Show more
    Last updated: 4 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Peoplefy • hosur, tamil nadu, in
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show more
    Last updated: 15 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    VXI Global Solutions • hosur, tamil nadu, in
    We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show more
    Last updated: 14 hours ago • Promoted • New!
    Infrastructure Reliability Engineer

    Infrastructure Reliability Engineer

    Flipkart • Bengaluru, Republic Of India, IN
    Hiring Site Reliability Engineers.The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised ...Show more
    Last updated: 4 days ago • Promoted
    Lead Engineer

    Lead Engineer

    Hyqoo • Bengaluru, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show more
    Last updated: 9 days ago • Promoted
    System Reliability Engineer

    System Reliability Engineer

    Andromeda Security • Bengaluru, Karnataka, India
    We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in DevOps technologies and cloud infrastructure. The ideal candidate will have hands-on experience with Kuberne...Show more
    Last updated: 30+ days ago • Promoted
    Principal Infrastructure Reliability Engineer

    Principal Infrastructure Reliability Engineer

    Palo Alto Networks • Bengaluru, Republic Of India, IN
    At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and m...Show more
    Last updated: 30+ days ago • Promoted
    Infrastructure Reliability Engineer

    Infrastructure Reliability Engineer

    Andromeda Security • Bengaluru, Republic Of India, IN
    We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in DevOps technologies and cloud infrastructure. The ideal candidate will have hands-on experience with Kuberne...Show more
    Last updated: 30+ days ago • Promoted
    Infrastructure Reliability Engineer

    Infrastructure Reliability Engineer

    Synamedia • Bengaluru, Republic Of India, IN
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show more
    Last updated: 8 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    People Prime Worldwide • hosur, tamil nadu, in
    Our client is a French multinational information technology (IT) services and consulting company, headquartered in Paris, France. Founded in 1967, It has been a leader in business transformation for...Show more
    Last updated: 30+ days ago • Promoted