Talent.com
This job offer is not available in your country.
Lead Sustenance Engineer - Storage

Lead Sustenance Engineer - Storage

DDNHyderabad, IN
7 days ago
Job description

This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a global market leader renowned for powering many of the world's most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing.

"DDN's A3I solutions are transforming the landscape of AI infrastructure." – IDC

“The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments” - Marc Hamilton, VP, Solutions Architecture & Engineering | NVIDIA

DDN is the global leader in AI and multi-cloud data management at scale. Our cutting-edge data intelligence platform is designed to accelerate AI workloads, enabling organizations to extract maximum value from their data. With a proven track record of performance, reliability, and scalability, DDN empowers businesses to tackle the most challenging AI and data-intensive workloads with confidence.

Our success is driven by our unwavering commitment to innovation, customer-centricity, and a team of passionate professionals who bring their expertise and dedication to every project. This is a chance to make a significant impact at a company that is shaping the future of AI and data management.

Our commitment to innovation, customer success, and market leadership makes this an exciting and rewarding role for a driven professional looking to make a lasting impact in the world of AI and data storage.

We are looking for a Lead Software Engineer - Lustre Sustaining Engineer for our team , which focuses on creating storage solutions for the most data-intensive workloads in the world, both HPC and AI / ML. The ideal candidate will have experience designing, implementing, and shipping software using Linux kernel development tooling and practices.

Responsibilities for this role include but are not limited to :

  • Efficient analysis of bug reports and development of software fixes on multiple platforms.
  • Triage, diagnose, and troubleshoot problems in a professional and timely manner often working in production customer environments.
  • Work with the Engineering managers and a geographically distributed team and customer base to ensure professional delivery and appropriate customer engagement and response.
  • Assist with performance tuning of features for specific environments and use-cases.
  • Involve product engineers when deep technical expertise is needed within a specific area of the product.
  • Develop processes and tools to accelerate problem analysis.
  • Track and coordinate bug fixes and communicate status back to Professional Services, Support, and customers.
  • Provide regular and ad hoc reports in an effective and timely manner.

Qualifications :

  • BS / MS in Computer Science, Computer Engineering or equivalent degree / experience.
  • 7+ years of software development experience with C in Linux environments.
  • 7+ years of experience working with enterprise-class or HPC storage systems and / or distributed systems.
  • Strong team player with good communication skills and should be self-starter.
  • Excellent time management skills, with the ability to prioritize, multitask, and work under deadlines in a fast-paced environment.
  • Knowledge of Parallel File Systems, in particular Lustre, is highly preferred.
  • Familiarity with Linux kernel VFS, IO, and the Ext4 file system is preferred.
  • Experience with Git strongly preferred, and JIRA, Jenkins, Gerrit, and Github are assets.
  • Create a job alert for this search

    Storage Engineer • Hyderabad, IN

    Related jobs
    • Promoted
    Sr Engineer, Site Reliability [T500-20279]

    Sr Engineer, Site Reliability [T500-20279]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 7 days ago
    • Promoted
    L3 O365 Engineer

    L3 O365 Engineer

    Nextbridge IT SolutionsHyderabad, IN
    We are seeking a highly skilled .This senior role is a critical escalation point for complex issues, driving the resolution of major incidents and ensuring the seamless operation, security, and pro...Show moreLast updated: 7 days ago
    • Promoted
    Engineer, Site Reliability [T500-20520]

    Engineer, Site Reliability [T500-20520]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 7 days ago
    • Promoted
    Assistant Manager - Process - Solar Cell

    Assistant Manager - Process - Solar Cell

    Premier Energies LimitedRangareddy, Telangana, India
    Founded in 1995, Premier Energies is a leading solar cell and module manufacturer based in Telangana, India.We operate advanced facilities with 2 GW cell and 5. GW module capacity, and are expanding...Show moreLast updated: 30+ days ago
    • Promoted
    Storage Development Lead

    Storage Development Lead

    SEMI LEAFHyderabad
    Job Description : Role & Responsibilities : < / ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - AWS / Google Cloud Platform

    Site Reliability Engineer - AWS / Google Cloud Platform

    INDIGLOBE IT SOLUTIONS PRIVATE LIMITEDHyderabad
    Job Summary : We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team.As an SRE, you will play a key role in ensuring the rel...Show moreLast updated: 16 days ago
    • Promoted
    Lead - Site Reliability Engineer

    Lead - Site Reliability Engineer

    VXI Global Solutionshyderabad, telangana, in
    We are looking for a Lead - Site Reliability Engineer with 8+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications...Show moreLast updated: 25 days ago
    • Promoted
    Zonal SHE Manager

    Zonal SHE Manager

    United Breweries Ltd.Sangareddy, Telangana, India
    Full time degree in Engineering & technology from a recognized institute.Diploma In Industrial Safety from DISH approved institution is essential. Compliance with Legal Obligations and Company Requi...Show moreLast updated: 3 days ago
    • Promoted
    Energy Storage Engineer

    Energy Storage Engineer

    PVinsight Inc.hyderabad, telangana, in
    Electrical design of battery-based energy storage systems and know-how on PCS, BMS, EMS and SCADA designs.Familiarity with applicable codes, standards and regulations in USA.Due diligence and owner...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersHyderabad, IN
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 24 days ago
    • Promoted
    Senior Engineer - Maintenance - Solar Module

    Senior Engineer - Maintenance - Solar Module

    Premier Energies LimitedRangareddy, Telangana, India
    Founded in 1995, Premier Energies is a leading solar cell and module manufacturer based in Telangana, India.We operate advanced facilities with 2 GW cell and 5. GW module capacity, and are expanding...Show moreLast updated: 3 days ago
    • Promoted
    Deputy Manager IT

    Deputy Manager IT

    Premier Energies LimitedRangareddy, Telangana, India
    Founded in 1995, Premier Energies is a leading solar cell and module manufacturer based in Telangana, India.We operate advanced facilities with 2 GW cell and 5. GW module capacity, and are expanding...Show moreLast updated: 3 days ago
    • Promoted
    Senior Storage Solutions Engineer

    Senior Storage Solutions Engineer

    Platform9hyderabad, telangana, in
    Platform9 : A Better Way to Go Cloud Native.Platform9 is a leader in simplifying enterprise private clouds.Our flagship product, Private Cloud Director, turns existing infrastructure into a full-fea...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordHyderabad, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 17 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABITHyderabad, India
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Insight Global, LLCHyderabad
    We are seeking SRE / Ansible Developers to join our Enterprise SRE Center of Excellence (COE) team.This team is responsible for defining development standards, ensuring compliance, and building autom...Show moreLast updated: 26 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    ConfidentialHyderabad / Secunderabad, Telangana
    Collaborate with development, operations, and product teams to define, review, and implement reliability standards and best practices. Design, implement, and maintain highly available and scalable a...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub Serviceshyderabad, telangana, in
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 5 days ago