Talent.com
Principal Infrastructure Reliability Engineer

Principal Infrastructure Reliability Engineer

Palo Alto NetworksBengaluru, Republic Of India, IN
30+ days ago
Job description

Our Mission

At Palo Alto Networks® everything starts and ends with our mission :

Being the cybersecurity partner of choice, protecting our digital way of life.

Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are.

Who We Are

We take our mission of protecting the digital way of life seriously. We are relentless in protecting our customers and we believe that the unique ideas of every member of our team contributes to our collective success. Our values were crowdsourced by employees and are brought to life through each of us everyday - from disruptive innovation and collaboration, to execution. From showing up for each other with integrity to creating an environment where we all feel included.

As a member of our team, you will be shaping the future of cybersecurity. We work fast, value ongoing learning, and we respect each employee as a unique individual. Knowing we all have different needs, our development and personal wellbeing programs are designed to give you choice in how you are supported. This includes our FLEXBenefits wellbeing spending account with over 1,000 eligible items selected by employees, our mental and financial health resources, and our personalized learning opportunities - just to name a few!

At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. This is why our employees generally work full time from our office with flexibility offered where needed. This setup fosters casual conversations, problem-solving, and trusted relationships. Our goal is to create an environment where we all win with precision.

Job Description

Your Career

Palo Alto Networks is looking for a talented Senior Site Reliability Engineer for our ever expanding Infrastructure & Cloud Operations. This position will be a part of the Infrastructure team, you will be working and partnering with our Network, Compute, Security, Database, Applications, and other teams to provide availability, reliability, and observability for our global IT infrastructure environments. You will help with building our next-generation IT operations through Automation, Code, Analytics, and continuous improvement. We are looking for analytical, agile, and influential leaders who can quickly deliver meaningful results and solutions with the flexibility to accommodate evolving business needs and shifting priorities. Are you a motivated, intelligent, creative, and hardworking individual who wants to contribute and make a difference? If yes, this job is for you!

The ideal candidate enjoys working in a fast-paced environment with highly innovative technologies. Our team partners closely with IT and Engineering groups and requires individuals to bring a can-do, positive attitude, with a focus on delivering exceptional customer support.

Your Impact

  • Implementing and supporting the Linux infrastructure as code where our globally distributed customer-facing platform runs.
  • Provision, configure & support resilient hybrid cloud deployment architecture using the automation framework and make it more efficient
  • Manage Linux infrastructure CI / CD platform, work with other SREs in deploying and maintaining automation framework, capacity planning, create and review PKI operational runbooks.
  • Manage scalability, capacity planning, redundancy, and resiliency.
  • Maintain service availability and performance SLAs based on business and product requirements.
  • Contribute to documentation related to design, deployment, validation, operations and DR / BCP.
  • Design proactive service monitoring, alerting and trend analysis of underlying infrastructure, and support the operations team in implementation.
  • Build and operate compute fabric for 1000s of VMs, Kubernetes Clusters. Develop scripts, build tools and write code to automate routine tasks.
  • Provide technical support to platform users
  • Respond to security implementation and audits of the environment.
  • Plan maintenance windows, write up change requests, present technical updates.
  • Participate in On-Call support including participating in RCA as required.
  • Design and implement network, compute and application-level monitoring solutions
  • Implement integrated and automated processes that drive operational excellence
  • Advise on industry best practices as it relates to new product selection
  • Drive operational cadences around business planning and performance management to ensure the efficient running of the IT org

Qualifications

Your Experience

  • First-hand experience with Enterprise infrastructure and application monitoring and reporting tools
  • Strong working experience and exposure to containers and orchestration ( Docker, Kubernetes)
  • Infrastructure as Code knowledge - Terraform, Ansible, Git, Puppet
  • Fluent Scripting skills preferably Python OR Shell OR Bash
  • Exposure to Public Cloud Platforms - GCP (Google cloud) OR AWS
  • Proficient in CI / CD platforms like Jenkins, CircleCI, etc
  • Excellent problem-solving skills;
  • ability to multi-task and prioritize

  • Ability to work independently;
  • works well under pressure

  • Possess solid communication skills, and will be comfortable working in a fast-paced technical environment
  • Background knowledge of network and security technologies
  • Strong hands-on Linux experience in managing and supporting Linux server infrastructure in CentOS / RHEL / Ubuntu.
  • Bachelors / Masters degree in Computer Science, Information Technology or technical stream with the equivalent combination of work experience required.
  • Design and performance tuning for Linux infrastructure and API, in-depth knowledge of multi-tier web applications.
  • Experience in developing and managing APIs, understanding of API infrastructure optimization and security.
  • In-depth knowledge of Certificate Lifecycle Management
  • Fluent in Linux security & system hardening, vulnerability management & patching process. Familiarity with CIS compliance levels.
  • Must be comfortable with Ansible, Chef or similar configuration management tool to manage infrastructure as code and source code control systems such as GIT or SVN.
  • Ability to work cross-functionally across multiple business units, such as product development and engineering
  • Must be able to collaborate with a global team spread across multiple time zones.
  • Passion, drive, energy, a sense of humour and a great attitude!
  • 6+ years of relevant experience, Bachelor or Master’s degree in Computer Science or a related technical field.
  • Experience with administration and orchestration of cloud computing (AWS, GCP, etc.) running virtual or container environments.
  • Good user and admin Linux skills (Ubuntu a plus).Experience with virtual networking.
  • Working experience with IaC tools like Terraform and Ansible. Knowledge of Python and shell scripting.
  • Experience with CI / CD development using platforms like - Jenkins, Harness, Artifactory.
  • Solid problem solving, troubleshooting, critical thinking, communication, and teamwork skills.
  • Passion for automation and monitoring instrumentation in the code.
  • Fluency in coding with one or more - Python, Go, Java, You will have to take coding and design tests as required.
  • Experience in Infrastructure as Code environment - Terraform, Ansible.You will be asked to write and troubleshoot IaC code during interview.
  • Proficient in Kubernetes based deployments, CI / CD platforms like Jenkins, Harness etc..
  • Takes great care in documenting conceptual work, detailed design specifications and can present ideas to engineers and engineering leaders.
  • Knowledge of AIOps, Application of Machine Learning / Artificial Intelligence in Cloud Infrastructure or IT Operations.
  • Additional experience in one or more of the following areas is a big plus

  • Development of self-healing infrastructure and applications.
  • Understanding of Big data, data analytics theory and application.
  • Exposure to Enterprise Business Applications, ITSM frameworks and tools is a big plus.
  • On an everyday basis bring the following traits to succeed :

  • Self-motivated, decisive, with the ability to work through ambiguity, and adapt to change and competing demands.
  • Excellent problem-solving skills;
  • ability to multitask and prioritize

  • Ability to work independently;
  • works well under pressure

  • Possess solid communication skills, and will be comfortable working in a fast-paced technical environment
  • Additional Information

    The Team

    We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple : we can’t accomplish our mission without diverse teams innovating, together.

    We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at accommodations@paloaltonetworks.com.

    Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

    All your information will be kept confidential according to EEO guidelines.

    Our Commitment

    We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple : we can’t accomplish our mission without diverse teams innovating, together.

    We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at accommodations@paloaltonetworks.com.

    Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

    All your information will be kept confidential according to EEO guidelines.

    Covid-19 Vaccination Information for Palo Alto Networks Jobs

  • Vaccine requirements and disclosure obligations vary by country.
  • Unless applicable law requires otherwise, you must be vaccinated for COVID or qualify for a reasonable accommodation if :
  • The job requires accessing a company worksite
  • The job requires in-person customer contact and the customer has implemented such requirements
  • You choose to access a Palo Alto Networks worksite
  • If you have questions about the vaccine requirements of this particular position based on your location or job requirements, please inquire with the recruiter.
  • Create a job alert for this search

    Reliability Engineer • Bengaluru, Republic Of India, IN

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    JRD SystemsBengaluru, Karnataka, India
    Site Reliability Engineer (Windows / Cloud / Automation).We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments.T...Show moreLast updated: 24 days ago
    • Promoted
    Regional Cloud Infrastructure Engineer

    Regional Cloud Infrastructure Engineer

    Argyll ScottBangalore, IN
    This position offers an opportunity to lead and support a diverse hybrid IT landscape across the APAC region.The Regional IT and Cloud Specialist will be responsible for managing, optimizing, and s...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmahosur, tamil nadu, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 25 days ago
    • Promoted
    Principal Engineer

    Principal Engineer

    FV Bankhosur, tamil nadu, in
    FV Bank is a fully licensed and regulated U.With a focus on innovation, security, and compliance, FV Bank is Banking the Future by providing USD banking, digital asset custody services, money marke...Show moreLast updated: 16 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Infrastructure Engineer - Tier3

    Infrastructure Engineer - Tier3

    NEXPLAY SECUREbangalore district, karnataka, in
    The Infrastructure Engineer (Tier III, remote) serves as the senior technical authority within Nexplay Secure's Managed Services division. This role leads the deployment and ongoing support of criti...Show moreLast updated: 30+ days ago
    • Promoted
    Infrastructure Solutions Architect

    Infrastructure Solutions Architect

    BayOne Solutionshosur, tamil nadu, in
    Systems or Solutions Architect.IaaS), and cloud-scale system design.The ideal candidate combines strong fundamentals in.Kubernetes, observability, and automation. You’ll design scalable systems that...Show moreLast updated: 5 days ago
    • Promoted
    Enterprise Engineer

    Enterprise Engineer

    Estarta Solutionshosur, tamil nadu, in
    The role focuses on designing, implementing, and optimizing large-scale enterprise network infrastructures that enable secure, high-performing, and resilient business operations.As a key technical ...Show moreLast updated: 5 days ago
    • Promoted
    Information Technology Infrastructure Engineer

    Information Technology Infrastructure Engineer

    Extended Teams by ExtendedGThosur, tamil nadu, in
    Full-time – working directly with a UK-based company via.UK time, Monday to Friday (flexibility required).We’re looking for an experienced. In this hands-on role, you’ll be responsible for maintaini...Show moreLast updated: 5 days ago
    • Promoted
    System Reliability Engineer

    System Reliability Engineer

    Andromeda SecurityBengaluru, Karnataka, India
    We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in DevOps technologies and cloud infrastructure. The ideal candidate will have hands-on experience with Kuberne...Show moreLast updated: 26 days ago
    • Promoted
    Infrastructure Engineer

    Infrastructure Engineer

    Orbit Core TechBengaluru, Karnataka, India
    AI-driven products and enterprise IT services.Our mission is to simplify complex challenges in healthcare, education, HR, and communication through secure, scalable, and intelligent solutions.Orbit...Show moreLast updated: 4 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Delta Air LinesBengaluru, India
    Execute on the Incident, Change Management, Problem Management processes.Building and supporting reliable applications that meet development and maintenance requirements. Provide consultation and di...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Technical Engineer(Configurations)

    Principal Technical Engineer(Configurations)

    Qinecsa Solutionshosur, tamil nadu, in
    We are seeking a Principal Technical Engineer to develop and deploy client configurations for our flagship Qinecsa Vigilance Workbench signal detection platform. The ideal candidate will be dynamic ...Show moreLast updated: 25 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServicehosur, tamil nadu, in
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 3 days ago
    • Promoted
    Sr Systems Engineer Linux – AI Infrastructure

    Sr Systems Engineer Linux – AI Infrastructure

    DC Tech Consultinghosur, tamil nadu, in
    Position : Senior Linux Administrator – AI / ML Infrastructure.We are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiBangalore, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 15 days ago
    • Promoted
    • New!
    Infrastructure Engineer

    Infrastructure Engineer

    Check Point SoftwareBengaluru, Karnataka, India
    Senior Platform / Infrastructure Engineer.We're seeking a Senior Platform / Infrastructure Engineer to join our Infrastructure team at Check Point Harmony SASE. You'll be instrumental in deploying, main...Show moreLast updated: 6 hours ago
    • Promoted
    Lead - Cloud Reliability Engineer

    Lead - Cloud Reliability Engineer

    Searce Incbangalore district, karnataka, in
    The ‘process-first’ AI-native modern tech consultancy that's rewriting the rules.As an engineering-led consultancy, we are dedicated to relentlessly improving the real business outcomes.Our solvers...Show moreLast updated: 30+ days ago