Talent.com
This job offer is not available in your country.
Principal Site Reliability Engineer

Principal Site Reliability Engineer

ConfidentialBengaluru / Bangalore, India
30+ days ago
Job description

Get to know Okta

Okta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.

At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we're looking for lifelong learners and people who can make us better with their unique experiences.

Join our team! We're building a world where Identity belongs to you.

What You'll Be Doing

  • Demonstrate full-stack fluency and adaptability to work across unfamiliar systems with minimal ramp-up time.
  • Tackle complex infrastructure challenges by delivering scalable, repeatable solutions with measurable outcomes.
  • Maintain deep engagement with evolving industry trends and internal architecture to align practices with leading standards.
  • Align proactively with organizational strategy, identifying gaps and collaborating on pragmatic, forward-thinking solutions.
  • Bring clarity to ambiguous problems, contributing thoughtful insights and suggesting nuanced improvements based on broad system understanding.
  • Influence through thought leadership, mentoring, and technical guidance across teams, mediums, and organizational layers.
  • Command broad visibility and trust, with impact recognized across senior leadership and peer teams.
  • Independently drive complex initiatives end-to-end, from roadmap planning to cross-functional execution.
  • Take ownership and act with urgency, proactively addressing issues and proposing innovative solutions.
  • Cultivate and manage the technical reputation of your team and functions through high-quality delivery and stakeholder engagement.

What You'll Bring To The Role

  • Strong leadership in building and scaling modern cloud-native infrastructure.
  • Expert-level understanding of Kubernetes (K8s) and Amazon EKS, including architecture, scaling, and advanced configurations.
  • Deep expertise in cloud platforms (AWS preferred), infrastructure as code (Terraform, CloudFormation), and GitOps workflows.
  • Strong Coding Experience in Go or Python
  • Expert in CI / CD, Linux, networking, and OS hardening, with deep knowledge of IP protocols.
  • Deploy and manage k8s clusters & proficient knowledge on EKS
  • Proficient in infrastructure as code (Terraform, Ansible, Chef) and operational tooling (Ruby, Python, Go, Shell).
  • Hands on with deploying and managing database and caching technology like Redis
  • Proven experience implementing robust monitoring, logging, and alerting systems using Prometheus, Grafana, ELK, or similar tools.
  • Practical experience with container security, secrets management, and compliance in production environments.
  • Excellent problem-solving and communication skills, with a track record of mentoring engineers and driving complex projects to completion.
  • Experience

  • 10+ years of experience in DevOps, SRE, or Infrastructure Engineering roles.
  • 5+ years working with Kubernetes / EKS, Helm & Karpenter in production-grade environments.
  • 5+ years of experience architecting and running complex AWS or other cloud networking infrastructure resources
  • 5+ years of experience with Ansible, Chef, and Terraform
  • 5+ years of experience of Coding / Scripting in Python or GoLang or Java
  • 3+ Experience with service meshes (e.g., Istio)
  • Demonstrated success in leading reliability or platform initiatives across large-scale distributed systems.
  • Experience participating in or leading incident response and root cause analysis processes.
  • Prior experience in a high-scale SaaS, or cloud-native startup environment is a strong plus.
  • Strong Linux & security understanding and experience.
  • BS In computer science (or equivalent experience)
  • What you can look forward to as a Full-Time Okta employee!

  • Amazing Benefits
  • Making Social Impact
  • Developing Talent and Fostering Connection + Community at Okta
  • Okta cultivates a dynamic work environment, providing the best tools, technology and benefits to empower our employees to work productively in a setting that best and uniquely suits their needs. Each organization is unique in the degree of flexibility and mobility in which they work so that all employees are enabled to be their most creative and successful versions of themselves, regardless of where they live. Find your place at Okta today! https : / / www.okta.com / company / careers / .

    Some roles may require travel to one of our office locations for in-person onboarding.

    Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.

    If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation.

    Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Privacy Policy at https : / / www.okta.com / privacy-policy / .

    Show more

    Show less

    Skills Required

    Elk, Networking, Cloudformation, Chef, Prometheus, Go, Grafana, Redis, Shell, Linux, Terraform, Os Hardening, Ansible, Ruby, Kubernetes, Python, Aws

    Create a job alert for this search

    Site Reliability Engineer • Bengaluru / Bangalore, India

    Related jobs
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Delta Air LinesBengaluru, India
    Execute on the Incident, Change Management, Problem Management processes.Building and supporting a reliable application suite for the environment in order to meet the development and maintenance re...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Site Reliability Engineer [T500-20179]

    Sr. Site Reliability Engineer [T500-20179]

    Delta Air LinesBengaluru, Karnataka, India
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Vbeyond corporationBangalore
    SRE (Site Reliability Engineer 2) We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tool...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    ▷ (Urgent Search) Site Reliability Engineer

    ▷ (Urgent Search) Site Reliability Engineer

    Amicon Hub ServicesBengaluru, Karnataka, India
    Key Responsibilities - Manage and scale production systems hosted on Google Cloud Platform (GCP) - Implement SRE best practices : monitoring, alerting, SLAs, SLOs, and error budgets - Automate oper...Show moreLast updated: 3 hours ago
    • Promoted
    Sr. Site Reliability Engineer [T500-20179]

    Sr. Site Reliability Engineer [T500-20179]

    ConfidentialBengaluru / Bangalore, India
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 18 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    QualityKiosk Technologies Pvt. Ltd.Bengaluru, Karnataka, India
    QualityKiosk Technologies is one of the world's largest independent Quality Engineering (QE) providers and digital transformation enablers, helping companies build and manage applications for optim...Show moreLast updated: 5 days ago
    • Promoted
    Lead Site Reliability Engineer [T500-20012]

    Lead Site Reliability Engineer [T500-20012]

    ConfidentialBengaluru / Bangalore, India
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 18 days ago
    • Promoted
    Principal Site Reliability Engineer, AI Infrastructure

    Principal Site Reliability Engineer, AI Infrastructure

    ConfidentialBengaluru / Bangalore, India
    NVIDIA is widely considered to be one of the technology world's most desirable employers.We have some of the most forward-thinking and hardworking people in the world working for us.If you're creat...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronBangalore Urban, Karnataka, India
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5 to 9 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology special...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2Bengaluru, Karnataka, India
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    RecRootsBangalore Urban, Karnataka, India
    The core premise for the SRE lies in treating operational issues as a software problem.We code our way out of problems where operations are concerned, addressing availability, scalability, latency,...Show moreLast updated: 5 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Delta Air LinesBengaluru, India
    Execute on the Incident, Change Management, Problem Management processes.Building and supporting reliable applications that meet development and maintenance requirements. Provide consultation and di...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub ServicesBengaluru, Karnataka, India
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 16 days ago
    • Promoted
    Staff Site Reliability Engineer (Observability)

    Staff Site Reliability Engineer (Observability)

    Palo Alto NetworksBengaluru, Karnataka, India
    At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and m...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    People Realm Recruitment Services Private LimitedBengaluru, Karnataka, India
    Job Title- Site Reliability Engineer.Desired Years of Experience - 5 - 14 Years of Relevant Experience.A Career with a Leading Global Investment Management Firm’s Technology Team.Our client, a lead...Show moreLast updated: 28 days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Rakuten IndiaBengaluru, Karnataka, India
    Design, develop SLA, SLO, SLI of services within the Business Unit.Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ACL DigitalBengaluru, Karnataka, India
    Service Management : Maintain application uptime / performance, manage system enhancements and defects, oversee daily operational activities, and ensure continuous improvement and adherence to ITIL be...Show moreLast updated: 9 days ago