Talent.com
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

ConfidentialBengaluru / Bangalore, India
6 days ago
Job description

About Us

At SentinelOne, we're redefining cybersecurity by pushing the limits of what's possible—leveraging AI-powered, data-driven innovation to stay ahead of tomorrow's threats.

From building industry-leading products to cultivating an exceptional company culture, our core values guide everything we do. We're looking for passionate individuals who thrive in collaborative environments and are eager to drive impact. If you're excited about solving complex challenges in bold, innovative ways, we'd love to connect with you.

What Are We Looking For

SentinelOne is seeking highly experienced SRE Engineers to join the Observo.ai team, our cutting-edge AI-driven data pipeline optimization platform. This role will be responsible for identifying opportunities to simplify and improve the reliability of our deployments, drive such changes to closure and demonstrating real impact in ease of customer onboarding.

We aim to build an SRE team around the leads and they will be responsible for core decisions around telemetry and building automated tools to identify and resolve problems without need for human intervention. The leads would also be expected to own alert management end to end, including identifying novel conditions to alert on, resolving complex performance degradation episodes or other non-functional degradation. Data pipelines are often at the heart of organization's security posture management and extreme levels of reliability are needed to support this. The ideal candidate should bring extensive expertise in both stateless and distributed persistent-data systems, with a proven track record of driving simplification and reliability in deployments and operation at scale. This role is part of the Observo.ai engineering organization.

This role is part of the Observo AI engineering organization and offers excellent opportunities for technical growth while contributing to our global engineering efforts. This is a hybrid role with 3 days in our Bengaluru office.

What Will You Do

  • De-escalate and resolve functional and non-functional problems that surface in production deployments
  • Identify patterns and look for opportunities to build self-healing mechanisms for anti-fragile deployments
  • Make improvements to the stack to make detection / diagnosis of rare conditions easy (preferrably self-served)
  • Simplify deployments, make it easy for customers to perform one-touch deployments across a wide variety of environments with stringent security requirements.
  • Identify gaps in telemetry generated by the product, partner and strategize with engineering to get the necessary telemetry built into the product as new capabilities and we build out entirely new product areas.
  • Provide technical leadership and mentorship to senior and junior SREs, establishing engineering best practices and culture
  • Influence technical decision-making forums and contribute to company-wide engineering standards and practices

What Skills and Knowledge Should You Bring

  • 10+ years of deep experience with focus on distributed systems, data engineering, or ML infrastructure in high-growth SaaS environments
  • Proficiency in Go, Rust, Python, Scala etc to look into the code when necessary to understand complex conditions that manifest in production environments
  • Extensive experience with cloud platforms (AWS, GCP, Azure) and container orchestration technologies (Kubernetes, Docker) at enterprise scale
  • Proven track record in leading and scaling SRE teams, protecting and nurturing emphasis on depth and accuracy in production debugging
  • Strong bias for working closely and leading by example, setting high standards for everyone.
  • Prior exposure or interest in machine learning frameworks (TensorFlow, PyTorch, scikit-learn) and MLOps practices for production ML systems at scale
  • Expert knowledge of observability and monitoring tools and practices, with strong intuition in high-value, low-noise alerting and monitoring.
  • Strong leadership and technical communication skills with experience in managing outages involving multiple teams and stakeholders
  • Track record of mentoring engineers and establishing technical standards and best practices in complex engineering organizations
  • Bachelor's degree in Computer Science, Engineering, or related field; advanced degree preferred
  • Why Us

  • You'll be joining a young, dynamic team—essentially a startup within SentinelOne—where you'll have the opportunity to solve foundational problems as we grow rapidly
  • You will be joining a cutting-edge company where you will tackle extraordinary challenges and work with the very best in the industry
  • You'll work on technology that directly impacts how enterprises understand and optimize their data infrastructure, solving problems at unprecedented scale
  • You'll be part of the Observo AI team that's revolutionizing how organizations handle observability data, with direct impact on customer cost savings and operational efficiency
  • You'll collaborate with world-class engineers, data scientists, and product leaders in a fast-paced, innovation-driven environment across global teams
  • You'll have access to cutting-edge AI / ML tools and platforms, with opportunity to shape the future of intelligent data processing
  • You'll contribute to building SentinelOne's engineering excellence in one of the world's most dynamic technology markets
  • Benefits

  • Competitive salary and equity package aligned with senior-level roles in the Indian market
  • Comprehensive health insurance for you and your family
  • Flexible work arrangements with hybrid office model (3 days in office)
  • Professional development opportunities including training, conferences, and skill development programs
  • Annual performance bonus and stock option participation
  • Paid time off and public holidays
  • Team building and company events including regular team activities and celebrations
  • Modern office facilities in Bengaluru with state-of-the-art technology and amenities
  • Career growth opportunities within a rapidly expanding global technology company
  • Mentorship opportunities with staff and principal engineers for continuous learning
  • SentinelOne is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

    SentinelOne participates in the E-Verify Program for all U.S. based roles.

    Skills Required

    Rust, Scala, Monitoring Tools, Go, data engineering , Tensorflow, Pytorch, Gcp, MLops, Docker, Distributed Systems, Azure, Python, Kubernetes, Aws

    Create a job alert for this search

    Senior Site Reliability Engineer • Bengaluru / Bangalore, India

    Related jobs
    • Promoted
    Site Reliability Engineer (SRE II)

    Site Reliability Engineer (SRE II)

    greytHRBengaluru, Karnataka, India
    We are looking for a passionate and detail-oriented.Site Reliability Engineer (SRE).As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our infrast...Show moreLast updated: 23 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ITC InfotechBengaluru, Karnataka, India
    Must-Have Requirements - Experience : 5–8 years in SRE and / or DevOps roles - Programming Skills : Proficiency in at least one coding language — preferably Python or C++ - Platform Support : Experienc...Show moreLast updated: 21 days ago
    • Promoted
    Senior Site Reliability Engineer / Senior Cloud Engineer

    Senior Site Reliability Engineer / Senior Cloud Engineer

    CloudHirehosur, tamil nadu, in
    The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture.Repo...Show moreLast updated: 2 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Tata Consultancy ServicesBengaluru, Karnataka, India
    Senior Site Reliability Engineer (SRE).Senior Site Reliability Engineer (SRE).Desired Experience Range : 7 - 10 yrs.Notice Period : Immediate to 90Days only. We are currently planning to do a Virtual....Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmahosur, tamil nadu, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 23 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech Solutionsbangalore, karnataka, in
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronBengaluru, Karnataka, India
    We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer.At Synechron, we believe in the power of digital to transform businesses for the better.Our globa...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.netBengaluru, Karnataka, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 13 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ThalesBengaluru, Republic Of India, IN
    Apply SRE core tenets of measurement (SLI / SLO / SLA), eliminate toil, and reliability modeling.Enable and educate development teams on industry best practice design patterns, ways of working and oper...Show moreLast updated: 9 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    greytHRBengaluru, Republic Of India, IN
    We are looking for a passionate and detail-oriented.Site Reliability Engineer (SRE).As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our infrast...Show moreLast updated: 24 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    RazorpayBengaluru, Republic Of India, IN
    Senior DevOps engineer is critical to the project’s overall success, from planning to supporting primary KPIs such as customer satisfaction and productivity. A DevOps Engineering Expert has an essen...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeBangalore, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 15 days ago
    • Promoted
    • New!
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServicehosur, tamil nadu, in
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 17 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    super.moneyBengaluru, Karnataka, India
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show moreLast updated: 3 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionshosur, tamil nadu, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globalbangalore district, karnataka, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Tata Consultancy ServicesBengaluru, Republic Of India, IN
    Senior Site Reliability Engineer (SRE).Senior Site Reliability Engineer (SRE).Desired Experience Range : 7 - 10 yrs.Notice Period : Immediate to 90Days only. We are currently planning to do a Virtual....Show moreLast updated: 13 days ago