Talent.com
Principal Site Reliability Engineer (Observability)

Principal Site Reliability Engineer (Observability)

Palo Alto NetworksBengaluru, Republic Of India, IN
7 days ago
Job description

Our Mission

At Palo Alto Networks® everything starts and ends with our mission :

Being the cybersecurity partner of choice, protecting our digital way of life.

Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are.

Who We Are

We take our mission of protecting the digital way of life seriously. We are relentless in protecting our customers and we believe that the unique ideas of every member of our team contributes to our collective success. Our values were crowdsourced by employees and are brought to life through each of us everyday - from disruptive innovation and collaboration, to execution. From showing up for each other with integrity to creating an environment where we all feel included.

As a member of our team, you will be shaping the future of cybersecurity. We work fast, value ongoing learning, and we respect each employee as a unique individual. Knowing we all have different needs, our development and personal wellbeing programs are designed to give you choice in how you are supported. This includes our FLEXBenefits wellbeing spending account with over 1,000 eligible items selected by employees, our mental and financial health resources, and our personalized learning opportunities - just to name a few!

At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. This is why our employees generally work full time from our office with flexibility offered where needed. This setup fosters casual conversations, problem-solving, and trusted relationships. Our goal is to create an environment where we all win with precision.

Job Description

Your Career

Palo Alto Networks is looking for a talented Senior Site Reliability Engineer for our ever expanding Infrastructure & Cloud Operations. This position will be a part of the Infrastructure team, you will be working and partnering with our Network, Compute, Security, Database, Applications, and other teams to provide availability, reliability, and observability for our global IT infrastructure environments. You will help with building our next-generation IT operations through Automation, Code, Analytics, and continuous improvement. We are looking for analytical, agile, and influential leaders who can quickly deliver meaningful results and solutions with the flexibility to accommodate evolving business needs and shifting priorities. Are you a motivated, intelligent, creative, and hardworking individual who wants to contribute and make a difference? If yes, this job is for you!

The ideal candidate enjoys working in a fast-paced environment with highly innovative technologies. Our team partners closely with IT and Engineering groups and requires individuals to bring a can-do, positive attitude, with a focus on delivering exceptional customer support.

Your Impact

  • Implementing and supporting the Linux infrastructure as code where our globally distributed customer-facing platform runs.
  • Provision, configure & support resilient hybrid cloud deployment architecture using the automation framework and make it more efficient
  • Manage Linux infrastructure CI / CD platform, work with other SREs in deploying and maintaining automation framework, capacity planning, create and review PKI operational runbooks.
  • Manage scalability, capacity planning, redundancy, and resiliency.
  • Maintain service availability and performance SLAs based on business and product requirements.
  • Contribute to documentation related to design, deployment, validation, operations and DR / BCP.
  • Design proactive service monitoring, alerting and trend analysis of underlying infrastructure, and support the operations team in implementation.
  • Build and operate compute fabric for 1000s of VMs, Kubernetes Clusters. Develop scripts, build tools and write code to automate routine tasks.
  • Provide technical support to platform users
  • Respond to security implementation and audits of the environment.
  • Plan maintenance windows, write up change requests, present technical updates.
  • Participate in On-Call support including participating in RCA as required.
  • Design and implement network, compute and application-level monitoring solutions
  • Implement integrated and automated processes that drive operational excellence
  • Advise on industry best practices as it relates to new product selection
  • Drive operational cadences around business planning and performance management to ensure the efficient running of the IT org

Qualifications

Your Experience

  • Bachelors / Masters degree in Computer Science, Information Technology or technical stream with the equivalent combination of with Min of 8+ years work experience required.
  • Design, implement, and maintain comprehensive monitoring and observability solutions. This includes implementing and managing observability frameworks with a solid understanding of MELT (Metrics, Logs, Events, Traces)
  • Strong working experience and exposure to containers and orchestration ( Docker, Kubernetes)
  • Experience with administration and orchestration of cloud computing (AWS, GCP, etc.) running virtual or container environments.
  • Infrastructure as Code knowledge - Terraform, Ansible, Git, Puppet
  • Fluent Scripting skills preferably Python OR Shell OR Bash
  • Proficient in CI / CD platforms like Jenkins, CircleCI, etc
  • Background knowledge of network and security technologies
  • Experience in developing and managing APIs, understanding of API infrastructure optimization and security
  • Ability to work cross-functionally across multiple business units, such as product development and engineering
  • Must be able to collaborate with a global team spread across multiple time zones.
  • Passion, drive, energy, a sense of humour and a great attitude!
  • Knowledge of AIOps, Application of Machine Learning / Artificial Intelligence in Cloud Infrastructure, Observability or IT Operations.
  • Additional experience in one or more of the following areas is a big plus

  • Development of self-healing infrastructure and applications.
  • Understanding of Big data, data analytics theory and application.
  • Exposure to Enterprise Business Applications, ITSM frameworks and tools is a big plus.
  • Additional Information

    The Team

    We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple : we can’t accomplish our mission without diverse teams innovating, together.

    We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at accommodations@paloaltonetworks.com.

    Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

    All your information will be kept confidential according to EEO guidelines.

    Our Commitment

    We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple : we can’t accomplish our mission without diverse teams innovating, together.

    We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at accommodations@paloaltonetworks.com.

    Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

    All your information will be kept confidential according to EEO guidelines.

    Create a job alert for this search

    Site Reliability Engineer • Bengaluru, Republic Of India, IN

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ACL DigitalBengaluru, IN
    ACL Digital is Hiring for the Below position.ACL Digital, part of the ALTEN Group, is a trusted AI-led, Digital & Systems Engineering Partner driving innovation by designing and building intelligen...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynamediaBengaluru, Karnataka, India
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show moreLast updated: 8 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Synechronhosur, tamil nadu, in
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5+ years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology specialists...Show moreLast updated: 20 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    London Stock Exchange GroupBangalore, India
    Engineer, Site Reliability Engineering.We are evolving our Reliability Engineering team to move beyond support and operations. As a Senior Engineer in Site Reliability, you will be part of a diverse...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    JRD SystemsBengaluru, Karnataka, India
    Site Reliability Engineer (Windows / Cloud / Automation) Job Summary : We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    FlipkartBengaluru, Karnataka, India
    Hiring Site Reliability Engineers.The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised ...Show moreLast updated: 4 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Media.netBengaluru, Karnataka, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronBengaluru, Karnataka, India
    We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer.At Synechron, we believe in the power of digital to transform businesses for the better.Our globa...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Peoplefyhosur, tamil nadu, in
    We’re looking for an SRE who can.Define SLIs / SLOs for Tier-0 / Tier-1 services & review quarterly.Change gating via CI / CD based on error budgets. Azure Monitor / Grafana / Prometheus / App Insights da...Show moreLast updated: 20 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    o9 Solutions, Inc.Bengaluru, Karnataka, India
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GREYTIP SOFTWARE PRIVATE LIMITEDBengaluru, Karnataka, India
    About the Role We are looking for a skilled Site Reliability Engineer II to join our SRE team.The ideal candidate will have hands-on experience in production monitoring, alert handling, and L1 pro...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersBengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    super.moneyBengaluru, Karnataka, India
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    People Prime WorldwideBengaluru, IN
    Our client is a French multinational information technology (IT) services and consulting company, headquartered in Paris, France. Founded in 1967, It has been a leader in business transformation for...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Rakuten IndiaBengaluru, Karnataka, India
    Design, develop SLA, SLO, SLI of services within the Business Unit.Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    VXI Global SolutionsBengaluru, IN
    We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show moreLast updated: 21 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Media.netBengaluru, Karnataka, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Landmark GroupBengaluru, India
    Ensure reliability and high availability of Java and microservices-based applications through proactive monitoring and automation. Define and track SLIs / SLOs to maintain service performance and stab...Show moreLast updated: 6 days ago