Talent.com
This job offer is not available in your country.
Sr. System Reliability Engineer

Sr. System Reliability Engineer

ConfidentialBengaluru / Bangalore
18 days ago
Job description
  • The Product and Performance Engineering (PPE) Team ensures the availability and performance of Netskope s applications, particularly in the area of end-user experience
  • This team is a post-incident escalation point for issues where the root cause is not immediately clear, or it has been determined that more than one service component (infrastructure or application) contributed to overall impairment
  • This team owns the determination of the root cause in such cases
  • Typically, the individual assigned to a specific issue will build a tiger team of individuals from across the company who have deep knowledge in a particular area and coordinate activities between these individuals to form and execute on a unified plan
  • The PPE team is ultimately responsible for the outcome (resolution) of the issue
  • What s in it for you

    PPE is seeking a production service-oriented, self-driven, and motivated Infrastructure SRE to join the team and help to build out our existing infrastructure and troubleshoot problems as they arise, ensuring the highest levels of systems and infrastructure availability of Netskope s production services. You will also be responsible for integrating services health metrics, identifying / measuring these service health indicators and providing creative tool sets for the frontline operations support teams.

    Required skills and experience

    • A minimum of 5 - 7 years of experience working in a production data center environment with 1000+ servers
    • Experience troubleshooting complex issues and correlating data from multiple sources such as service applications, linux systems and the network.
    • Deep knowledge of metrics platforms such as Prism, Prometheus, Grafana, Graphite, Sumo Logic etc, and expertise in the collection, analysis and correlation of metrics.
    • The ability to deep dive into network troubleshooting areas such as packet analysis, HTTP / HTTPs, tunneling protocol, load balancer issues, etc.
    • A comprehensive understanding of computer internals and architectures, and experience maintaining common Linux / Unix applications and services.
    • Experience with modern cloud and virtualization technologies such as Docker, Kubernetes, AWS, GCP, KVM, OpenNebula, OpenStack or other orchestration platforms.
    • Strong software development skills using Python, C, C++, Go, etc.
    • Deep expertise with operational support systems, automation, and CI / CD tools.
    • A demonstrated ability and willingness to act as subject matter expert, tracking technology / industry trends, and providing data-driven reasoning for technology path recommendations.
    • Education

    • BSCS or equivalent required, MSCS or equivalent strongly preferred
    • Skills Required

      Unix, Kvm, C++, Data Security, Http, Python

    Create a job alert for this search

    Reliability Engineer • Bengaluru / Bangalore

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Vbeyond corporationBangalore
    SRE (Site Reliability Engineer 2) We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tool...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    BayOne Solutionshosur, tamil nadu, in
    Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 2 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub Serviceshosur, tamil nadu, in
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 6 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Central Business Solutions Inc.Bengaluru, India
    Linux SRE (Linux SRE L3 with Infra + Operation Support).The Server Operations team is part of the Enterprise Computing organization within Client. The wider team has presence in cities globally and ...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer III - System Architecture

    Site Reliability Engineer III - System Architecture

    HyreSnapBangalore
    Responsibilities : - Architect and lead the design of scalable, reliable infrastructure solutions.Implement strategies for high availabili...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Core Minds Tech SOlutionsHosur
    Job Description : - Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions&l...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ElgebraBangalore
    Role Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our c...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2Bengaluru, Karnataka, India
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    EmbarkGCCBengaluru, India
    Senior Site Reliability Engineer (SRE) – Job Description.Implement and tune SLOs / SLIs, build reliability dashboards, and respond to incidents using Grafana IRM, JSM, and escalation workflows.Monito...Show moreLast updated: 5 days ago
    • Promoted
    Principal / Chief Site Reliability Engineer - Observability Services

    Principal / Chief Site Reliability Engineer - Observability Services

    CollaberaBangalore
    Job Description : As a Principal / Chief Site Reliability Engineer, you will play a critical role in designing, developing, and maintaining scalable and highly reliabl...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersBengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftBangalore, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 5 hours ago
    • Promoted
    Sr Advanced Systems Engineer

    Sr Advanced Systems Engineer

    HoneywellBengaluru, Karnataka, India
    Nasdaq : HON) invents and commercializes technologies that address some of the world’s most critical challenges around energy, safety, security, air travel, productivity, and global urbanization.We ...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Uplershosur, tamil nadu, in
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 24 days ago
    • Promoted
    ThoughtSpot - Senior System Reliability Engineer I - Cloud Infrastructure

    ThoughtSpot - Senior System Reliability Engineer I - Cloud Infrastructure

    THOUGHTSPOT INDIA PRIVATE LIMITEDBangalore
    About The Role : ThoughtSpot is an AI-powered analytics platform that enables users to explore and analyze data through natural language queries, making insights acce...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    At ViewSonic Technologies, we’re passionate about building software that solves problems.We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availabilit...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Site Reliability Engineer [T500-20179]

    Sr. Site Reliability Engineer [T500-20179]

    Delta Air Linesbangalore, karnataka, in
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 18 days ago