Talent.com
Site Reliability Engineer 2
Site Reliability Engineer 2PhonePe • Pune, IN
Site Reliability Engineer 2

Site Reliability Engineer 2

PhonePe • Pune, IN
15 hours ago
Job description

About PhonePe Limited :

Headquartered in India, its flagship product, the PhonePe digital payments app, was launched in Aug 2016. As of April 2025, PhonePe has over 60 Crore (600 Million) registered users and a digital payments acceptance network spread across over 4 Crore (40+ million) merchants. PhonePe also processes over 33 Crore (330+ Million) transactions daily with an Annualized Total Payment Value (TPV) of over INR 150 lakh crore.

PhonePe’s portfolio of businesses includes the distribution of financial products (Insurance, Lending, and Wealth) as well as new consumer tech businesses (Pincode - hyperlocal e-commerce and Indus AppStore Localized App Store for the Android ecosystem) in India, which are aligned with the company’s vision to offer every Indian an equal opportunity to accelerate their progress by unlocking the flow of money and access to services.

Culture :

At PhonePe, we go the extra mile to make sure you can bring your best self to work, Everyday!. And that starts with creating the right environment for you. We empower people and trust them to do the right thing. Here, you own your work from start to finish, right from day one. PhonePe-rs solve complex problems and execute quickly; often building frameworks from scratch. If you’re excited by the idea of building platforms that touch millions, ideating with some of the best minds in the country and executing on your dreams with purpose and speed, join us!

Minimum Experience : 3 Years

About the Role :

This role is responsible for managing and maintaining complex, distributed big data ecosystems. It ensures the reliability, scalability, and security of large-scale production infrastructure. Key responsibilities include automating processes, optimizing workflows, troubleshooting production issues, and driving system improvements across multiple business verticals.

Roles and Responsibilities :

  • Manage, maintain, and support incremental changes to Linux / Unix environments.
  • Lead on-call rotations and incident responses, conducting root cause analysis and driving postmortem processes.
  • Design and implement automation systems for managing infrastructure, including provisioning, scaling, upgrades, and patching clusters.
  • Troubleshoot and resolve complex production issues while identifying root causes and implementing mitigating strategies.
  • Design and review scalable and reliable system architectures.
  • Collaborate with teams to optimize overall system / cluster performance.
  • Enforce security standards across systems and infrastructure.
  • Set technical direction, drive standardization, and operate independently.
  • Ensure availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning.
  • Resolve, analyze, and respond to system outages and disruptions and implement measures to prevent similar incidents from recurring.
  • Develop tools and scripts to automate operational processes, reducing manual workload, increasing efficiency and improving system resilience.
  • Monitor and optimize system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning.
  • Collaborate with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle.
  • Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities.
  • Develop and enforce SRE best practices and principles.
  • Align across functional teams on priorities and deliverables.
  • Drive automation to enhance operational efficiency.
  • Adapt new technologies as and when the need arises and define architectural recommendations for new tech stacks.

Skills Required :

  • 3 to 7 years of experience managing and maintaining distributed big ecosystems.
  • Strong expertise in Linux, MySQL, Networking, System Setup, Azure
  • Proficiency in scripting / programming in any backend language.
  • Familiarity with open-source configuration management and deployment tools.
  • Solid understanding of networking, open-source technologies, and related tools.
  • Excellent communication and collaboration skills.
  • On-Prem experience mandatory.
  • DevOps tools : Saltstack, Ansible, docker, Git.
  • SRE Logging and monitoring tools : ELK stack, Grafana, Prometheus, opentsdb, Open Telemetry.
  • Good to Have :

  • Experience managing infrastructure on public cloud platforms.
  • Experience in designing and reviewing system architectures for scalability and reliability.
  • Experience with observability tools to visualize and alert on system performance.
  • Experience in massive petabyte scale data migrations, massive upgrades.
  • PhonePe Full Time Employee Benefits (Not applicable for Intern or Contract Roles)

  • Insurance Benefits - Medical Insurance, Critical Illness Insurance, Accidental Insurance, Life Insurance
  • Wellness Program - Employee Assistance Program, Onsite Medical Center, Emergency Support System
  • Parental Support - Maternity Benefit, Paternity Benefit Program, Adoption Assistance Program, Day-care Support Program
  • Mobility Benefits - Relocation benefits, Transfer Support Policy, Travel Policy
  • Retirement Benefits - Employee PF Contribution, Flexible PF Contribution, Gratuity, NPS, Leave Encashment
  • Other Benefits - Higher Education Assistance, Car Lease, Salary Advance Policy
  • Our inclusive culture promotes individual expression, creativity, innovation, and achievement and in turn helps us better understand and serve our customers. We see ourselves as a place for intellectual curiosity, ideas and debates, where diverse perspectives lead to deeper understanding and better quality results. PhonePe is an equal opportunity employer and is committed to treating all its employees and job applicants equally; regardless of gender, sexual preference, religion, race, color or disability. If you have a disability or special need that requires assistance or reasonable accommodation, during the application and hiring process, including support for the interview or onboarding process, please fill out this form.

    Create a job alert for this search

    Site Reliability Engineer • Pune, IN

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Pune, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Infinova Global Corporate Services LLP • Pune, IN
    Infinova is an emerging player in intelligent business transformation, dedicated to helping organizations scale smarter and achieve sustainable success. We are building a foundation that combines st...Show more
    Last updated: 15 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Synechron • Pune, Maharashtra, India
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5 to 9 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology special...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Talent Worx • Pune, MH, IN
    Quick Apply
    Site Reliability Engineer (SRE).At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of o...Show more
    Last updated: 30+ days ago
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    DeepIntent • Pune, Maharashtra, India
    DeepIntent is leading the healthcare advertising industry with data-driven solutions built for the future.From day one our mission has been to improve patient outcomes through the artful use of adv...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Pagos Consultants • Pune, IN
    This team will play a pivotal role in spearheading innovation.As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future d...Show more
    Last updated: 4 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2 • Pune, IN
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show more
    Last updated: 15 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    PhonePe • Pune, Maharashtra, India
    Troubleshoot issues across the entire stack - hardware, software, application, and network.Work to improve the reliability and performance of the next generation of distributed systems.Work to impr...Show more
    Last updated: 27 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Alp Consulting Ltd. • Pune, Maharashtra, India
    Cloud AWS Web Application Security Cloudflare Infrastructure Kubernetes / EKS Helm Terraform GitOps / ArgoCD Kyverno CI / CD Concourse Fabric / Routing Istio mTLS Observability Datadog Prometheus Graf...Show more
    Last updated: 4 hours ago • Promoted • New!
    Site Reliability Engineer II

    Site Reliability Engineer II

    Entain • Pune, Maharashtra, India
    As an SRE Engineer II you will play a crucial role in ensuring that Entain delivers the service performance reliability and availability expected by our internal and external customers.You will sup...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy Services • Pune, Maharashtra, India
    Role : Site Reliability Engineer.Apply only if you are available on Saturday dated 6th Dec for face to face interview in Chennai / Pune.Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Boomi • Pune, Maharashtra, India
    About Boomi and What Makes Us Special.Are you ready to work at a fast-growing company where you can make a difference Boomi aims to make the world a better place by connecting everyone to everythin...Show more
    Last updated: 28 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Hydrolix • Pune, IN
    At Hydrolix, we are revolutionizing the world of data management and analytics with our innovative cloud data platform, purpose-built for petabyte-scale datasets. Our mission is to help organization...Show more
    Last updated: 15 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Worldline • Pune, Maharashtra, India
    Worldline helps businesses of all shapes and sizes to accelerate their growth journey - quickly, simply, and securely.We are the innovators at the heart of the payments technology industry, shaping...Show more
    Last updated: 1 day ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AppDirect • Pune, Maharashtra, India
    Become a digital global citizen and enable the new generation of digital entrepreneurs around the world.AppDirect offers a subscription commerce platform to sell any product through any channel on ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer Rotation shift

    Site Reliability Engineer Rotation shift

    Synechron • Pune, Maharashtra, India
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5-8 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology specialist...Show more
    Last updated: 20 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Insight Global • Pune, IN
    Contract with Insight Global Client.Join our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building and maintaining highly reliable, automated, and scalable systems.Y...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • Pune, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show more
    Last updated: 25 days ago • Promoted