Talent.com
This job offer is not available in your country.
Manager, Site Reliability Engineering (Cortex XDR XSIAM)

Manager, Site Reliability Engineering (Cortex XDR XSIAM)

Palo Alto NetworksMeerut, IN
2 days ago
Job description

Our Mission

At Palo Alto Networks® everything starts and ends with our mission :

Being the cybersecurity partner of choice, protecting our digital way of life.

Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are.

Who We Are

We take our mission of protecting the digital way of life seriously. We are relentless in protecting our customers and we believe that the unique ideas of every member of our team contributes to our collective success. Our values were crowdsourced by employees and are brought to life through each of us everyday - from disruptive innovation and collaboration, to execution. From showing up for each other with integrity to creating an environment where we all feel included.

As a member of our team, you will be shaping the future of cybersecurity. We work fast, value ongoing learning, and we respect each employee as a unique individual. Knowing we all have different needs, our development and personal wellbeing programs are designed to give you choice in how you are supported. This includes our FLEXBenefits wellbeing spending account with over 1,000 eligible items selected by employees, our mental and financial health resources, and our personalized learning opportunities - just to name a few!

At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. This is why our employees generally work full time from our office with flexibility offered where needed. This setup fosters casual conversations, problem-solving, and trusted relationships. Our goal is to create an environment where we all win with precision.

Job Description

Your Career

We’re seeking an experienced hands-on Cloud SRE manager to lead high-severity incident and problem management across our GCP-centric platforms. This role combines deep technical troubleshooting with process ownership, ensuring rapid recovery, root cause elimination, and long-term reliability improvements. You will own L3 OnCall responsibilities, drive post-incident learning, and champion automation and operational excellence.

Implement and lead post-mortem processes within SLAs, identify root causes, and drive corrective actions to reduce repeat incidents.

More information about the Cortex product can be found here

Your Impact

  • In your technical and leadership capacity you will contribute to a seamless production site reliability operations , partnering closely with regional and global SRE counterparts with special attention to the below
  • Incident Analysis & Problem Management : Implement and lead post-mortem processes within SLAs, identify root causes, and drive corrective actions to reduce repeat incidents. Establish and maintain a problem backlog, ensuring timely resolution and continuous process improvement.
  • Troubleshooting : Rapidly diagnose and resolve failures across Kubernetes, Terraform, and GCP using advanced troubleshooting frameworks.
  • Preventative Measures : Implement automation and enhanced monitoring to proactively detect issues and reduce incident frequency.
  • Stakeholder Communication : Work with GCP / AWS TAMs and other vendors to request new features or followups for updates.
  • Mentorship : Coach and elevate SRE and DevOps teams, promoting best practices in reliability and incident / problem management.
  • Documentation : Establish and maintain a problem backlog, ensuring timely resolution and continuous process improvement.
  • Envision the future or SRE with AI / ML : Ability to envision how a modern SRE team should operate leveraging AI / ML

Qualifications

Your Experience

  • 12+ years of experience in SRE / DevOps / Infrastructure roles, with a strong foundation in GCP cloud-based environments.
  • 5+ years of proven experience managing SRE / DevOps teams, preferably with a strong focus on Google Cloud Platform (GCP).
  • Deep hands-on knowledge of Terraform, Kubernetes (GKE), GitLab CI / CD, and modern observability practices (e.g., Prometheus, OpenTelemetry).
  • Strong knowledge in Data Platforms like BIgQuery , Cassandra , Kafka , PostgreSQL and MySQL is mandatory.
  • Strong experience in managing incident response and postmortems, reducing MTTR, and driving proactive reliability improvements.
  • Proficiency with cloud platforms such as GCP & AWS.
  • Solid grasp of Infrastructure as Code, container orchestration, and scalable cloud architectures.
  • Track record of building tools for system reliability, automated remediation, and performance tuning.
  • Experience leveraging AI / ML-based operations tools for automation, anomaly detection, and predictive alerting is a plus.
  • Expertise in SLI / SLO / SLA design and implementation, and driving operational maturity through data.
  • Strong interpersonal and leadership skills, with a demonstrated ability to coach, mentor, and inspire teams.
  • Effective communicator, capable of translating complex technical concepts to non-technical stakeholders.
  • Committed to inclusion, collaboration, and creating a culture where every voice is heard and respected.
  • Additional Information

    The Team

    To stay ahead of the curve, it’s critical to know where the curve is, and how to anticipate the changes we’re facing. For the fastest-growing cybersecurity company, the curve is the evolution of cyberattacks and access technology and the products and services that dedicatedly address them. Our engineering team is at the core of our products – connected directly to the mission of preventing cyberattacks and enabling secure access to all on-prem and cloud applications. They are constantly innovating – challenging the way we, and the industry, think about Access and security. These engineers aren’t shy about building products to solve problems no one has pursued before. They define the industry, instead of waiting for directions. We need individuals who feel comfortable in ambiguity, excited by the prospect of challenge, and empowered by the unknown risks facing our everyday lives that are only enabled by a secure digital environment.

    Our engineering team is provided with an unrivaled chance to create the products and practices that will support our company growth over the next decade, defining the cybersecurity industry as we know it. If you see the potential of how incredible people and products can transform a business, this is the team for you. If the prospect of affecting tens of millions of people, enabling them to work remotely securely and easily in ways never done before, thrill you - you belong with us.

    Our Commitment

    We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple : we can’t accomplish our mission without diverse teams innovating, together.

    We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at accommodations@paloaltonetworks.com.

    Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

    All your information will be kept confidential according to EEO guidelines.

    Is role eligible for Immigration Sponsorship? No. Please note that we will not sponsor applicants for work visas for this position.

    Create a job alert for this search

    Engineering Manager • Meerut, IN

    Related jobs
    • Promoted
    Engineering Manager

    Engineering Manager

    Pine LabsMeerut, IN
    We are looking for proactive engineering managers with 10+ years of engineering experience and proven leadership skills.Engineering managers are expected to work on many projects at the same time, ...Show moreLast updated: 23 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordMeerut, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaGhaziabad, IN
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 24 days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    Branch InternationalMeerut, IN
    Branch delivers world-class financial services to the mobile generation.With offices in the United States, Nigeria, Kenya, and India, Branch is a for-profit socially conscious company that uses the...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Manager - Product Engineering [T500-19241]

    Senior Manager - Product Engineering [T500-19241]

    Neighborly®meerut, uttar pradesh, in
    Neighborly is a local network of home service brands that will connect you to very specific vetted local experts.Our family of service professionals work with rigorous quality standards to repair, ...Show moreLast updated: 23 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CorroHealthNoida, Uttar Pradesh, India
    We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team.The ideal candidate will have a deep understanding of both software engineering and systems administration, with a f...Show moreLast updated: 15 days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    ConfidentialNoida, India
    We are seeking a Site Reliability Engineer (SRE) with a proven track record of self-regulation and extensive experience in modern DevOps practices. The ideal candidate will possess deep knowledge an...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub Servicesmeerut, uttar pradesh, in
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 3 days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    CareerPartnerMeerut, IN
    We want a Senior Engineering Manager who thrives as a technical leader first, with.You'll spend a significant portion of your time designing and coding, while also driving.Own end-to-end architectu...Show moreLast updated: 3 days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    Petals Careers Private Limitedmeerut, uttar pradesh, in
    Our client is on a mission to help organizations make sense of the world's data.Unstructured dark data contains nuggets of information that, when paired with human context, unlock some of the most ...Show moreLast updated: 17 days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    AlohaABA IndiaMeerut, IN
    AlohaABA, a dynamic technology product organization based in California, USA, with a development center in Hyderabad, India, specializes in providing innovative cloud-based practice management soft...Show moreLast updated: 2 days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    ApeiroNoida, India
    Enabling governments to transform healthcare and ensuring no one is left behind in receiving high quality healthcare services. Apeiro is bringing healthcare into the digital age with state-of-the-ar...Show moreLast updated: 3 days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    AiPriseMeerut, IN
    The ideal candidate will be responsible for managing and inspiring his or her team to achieve their performance metrics.Your role will involve strategizing, project management, part staff managemen...Show moreLast updated: 30+ days ago
    Site Reliability engineering II

    Site Reliability engineering II

    Trigent Software Private LimitedNoida, UP, India
    Quick Apply
    Design customized hosted managed solutions that are performant, cost effective, and delight our customers.Deploy and manage solutions efficiently on private or public cloud, and ensure they meet es...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2meerut, uttar pradesh, in
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebianoida, delhi, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersGhaziabad, IN
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 22 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Ghaziabad, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Resource Deployment Manager

    Resource Deployment Manager

    PTR GlobalMeerut, IN
    Pinnacle Group is a nationally recognized leader in workforce solutions, known for delivering high-impact staffing, talent management, and contingent workforce programs. We support some of the most ...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer / Lead Site Reliability Engineer

    Site Reliability Engineer / Lead Site Reliability Engineer

    ConfidentialNoida, India
    BOLD is seeking professionals who will be responsible for performing the build and release activities with Microsoft Technology stack. This person will also manage CI / CD pipelines and automate the b...Show moreLast updated: 6 days ago