Talent.com
SMTS Systems Design Eng.
SMTS Systems Design Eng.Confidential • Hyderabad / Secunderabad, Telangana
SMTS Systems Design Eng.

SMTS Systems Design Eng.

Confidential • Hyderabad / Secunderabad, Telangana
30+ days ago
Job description

We are seeking an experienced  HPC Systems Engineer  with  7+ years of expertise in high-performance computing (HPC)  environments. This role requires hands-on experience with  Python, Kubernetes (K8s), Slurm, OpenStack, and Ansible  , along with the ability to  support external clients in live troubleshooting sessions.

The PERSON :

The ideal candidate will have deep technical knowledge of  drivers, troubleshooting methods, and system-level debugging  and will play a key role in managing, optimizing, and troubleshooting  HPC clusters and cloud-based HPC environments.

  • KEY RESPONSIBILITIES :
  • HPC System Administration & Troubleshooting Manage and optimize HPC clusters, ensuring high availability and performance.
  • Troubleshoot GPU, CPU, network drivers, firmware, and OS-level issues.
  • Debug storage, networking, and job scheduling bottlenecks in Slurm-based environments.
  • Kubernetes & Cloud HPC Environments Deploy and manage HPC workloads in Kubernetes for AI / ML and parallel computing.
  • Optimize OpenStack-based HPC clusters with Ceph, Cinder, and Neutron for cloud scalability.
  • Implement containerized HPC workflows using Kubernetes and OpenShift.
  • Automation & Infrastructure as Code (IaC) Develop Ansible and Terraform scripts for provisioning and managing HPC resources.
  • Automate job scheduling, cluster monitoring, and log analysis using Python.
  • Optimize CI / CD pipelines for HPC and AI / ML applications.
  • Performance Tuning & Benchmarking Benchmark and optimize multi-node HPC workloads (MPI, NCCL, ROCm, CUDA).
  • Tune OS parameters, networking (InfiniBand, RoCE), and Slurm configurations for peak performance.
  • Enhance HPC storage performance (Ceph, Lustre, NFS) and distributed computing efficiency.
  • Client Support & Collaboration Provide real-time technical support and troubleshooting for HPC users.
  • Engage with developers, DevOps, and system administrators to optimize cluster performance.
  • Document solutions, best practices, and contribute to internal knowledge bases.

PREFERRED QUALIFICATION'S

  • Experience with AMD MI300, MI2X0 GPUs, ROCm, MPI, UCX, or XPMEM.
  • Exposure to containerized workloads using Singularity or Docker in HPC.
  • Familiarity with OpenStack deployment automation (e.g., TripleO, Kolla, or OpenStack-Ansible).
  • Experience in customer-facing technical roles, with a strong ability to troubleshoot live issues.
  • Skills Required

    Python, Openstack, Ansible

    Create a job alert for this search

    System Design • Hyderabad / Secunderabad, Telangana

    Related jobs
    Ams Circuit Design Engineer

    Ams Circuit Design Engineer

    ACL Digital • Hyderabad, Republic Of India, IN
    Electronics and Electrical Engineering from an institute of repute.Analog and SERDES IP Circuit Design.The candidate should have relevant experience in following Analog IPs like GPIO, RCOMP, ADC, D...Show more
    Last updated: 30+ days ago • Promoted
    Mixed-Signal Design Engineer

    Mixed-Signal Design Engineer

    Sevya Multimedia • Hyderabad, Republic Of India, IN
    Sevya is an innovative semiconductor design company dedicated to pushing the boundaries of technology.We focus on developing cutting-edge solutions that empower the electronics industry.Our mission...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Systems Design Engineer

    Sr. Systems Design Engineer

    Confidential • Hyderabad / Secunderabad, Telangana
    We are looking for a dynamic, energetic Systems Design Engineer to join our growing team.As a key contributor to the success of AMD s product, you will be part of a leading team to drive and improv...Show more
    Last updated: 30+ days ago • Promoted
    Semiconductor Design Engineer

    Semiconductor Design Engineer

    Sevya Multimedia • Hyderabad, Republic Of India, IN
    Sevya is an innovative semiconductor design company dedicated to pushing the boundaries of technology.We focus on developing cutting-edge solutions that empower the electronics industry.Our mission...Show more
    Last updated: 30+ days ago • Promoted
    Hardware Systems Design Engineer

    Hardware Systems Design Engineer

    7 Darter • Hyderabad, Republic Of India, IN
    We are looking for a talented and passionate.This role is ideal for someone who thrives in a fast-paced product development environment and has hands-on experience in end-to-end hardware design — f...Show more
    Last updated: 30+ days ago • Promoted
    AMS Circuit Design Engineer

    AMS Circuit Design Engineer

    ACL Digital • Hyderabad, Telangana, India
    Electronics and Electrical Engineering from an institute of repute.Analog and SERDES IP Circuit Design.The candidate should have relevant experience in following Analog IPs like GPIO, RCOMP, ADC, D...Show more
    Last updated: 30+ days ago • Promoted
    Analog Design Engineer

    Analog Design Engineer

    Sevya Multimedia • Hyderabad, Telangana, India
    Sevya is an innovative semiconductor design company dedicated to pushing the boundaries of technology.We focus on developing cutting-edge solutions that empower the electronics industry.Our mission...Show more
    Last updated: 30+ days ago • Promoted
    Semiconductor Design Engineer

    Semiconductor Design Engineer

    ACL Digital • Hyderabad, Republic Of India, IN
    Lead Physical Design Engineer - EMIR.Good understanding of IR / Power-Domain-Network signoff at SOC & block level.Must have knowledge of Physical Implementation (Synthesis and Place & Route).Strong a...Show more
    Last updated: 30+ days ago • Promoted
    Entry-Level Semiconductor Design Specialist

    Entry-Level Semiconductor Design Specialist

    SysTechCorp Inc • Hyderabad, Republic Of India, IN
    Semiconductor Design Engineer 1_INR.Hands-on experience in developing / understanding building block schematics, memory schematics, running circuit simulation with spice simulators, DC analysis, tran...Show more
    Last updated: 21 days ago • Promoted
    Senior Space Systems Integration Engineer

    Senior Space Systems Integration Engineer

    Dhruva Space • Hyderabad, Republic Of India, IN
    Role Overview and Responsibilities : .Dhruva Space is seeking an experienced and highly skilled Senior Electronics Engineer to lead the Assembly, Integration, and Testing of spacecraft electronics sy...Show more
    Last updated: 30+ days ago • Promoted
    Drones systems engineer

    Drones systems engineer

    PhoQtek labs • Hyderabad, Telangana, India
    Phoqtek Labs is seeking an exceptional .The candidate will be responsible for the .Visual Navigation Systems (VNS).NVIDIA Jetson Orin Nano / Xavier. Design, assemble, and optimize .ESC configuration, ...Show more
    Last updated: 30+ days ago • Promoted
    Lead-Design Verification Engineer

    Lead-Design Verification Engineer

    MosChip® • hyderabad, telangana, in
    Tech in electronics with 7-10+ year experience in verification domain.Own or lead verification of complex flows at the SOC, subsystem, or IP levels. Plan the verification of complex design IP / SoC in...Show more
    Last updated: 8 days ago • Promoted
    Building Systems Design Engineer

    Building Systems Design Engineer

    Jaleel Aircon Projects & Services Pvt Ltd ( JAPS PVT LTD) • Hyderabad, Republic Of India, IN
    We are Hiring HVAC DESIGN ENGINEER, Minimum 1 year of Experience in Same field.Site visits to understand Clients Requirement. Must have Experience in Designing / Sizing of Copper pipe (DAIKIN VRV ).Show more
    Last updated: 6 days ago • Promoted
    Space Systems Engineer

    Space Systems Engineer

    Dhruva Space • Hyderabad, Telangana, India
    Role Overview and Responsibilities : .Dhruva Space is seeking a highly skilled and motivated.As a cutting-edge space technology company, Dhruva Space specializes in developing next-generation satelli...Show more
    Last updated: 9 days ago • Promoted
    Design Verification Engineer

    Design Verification Engineer

    Proxelera • hyderabad, telangana, in
    Semiconductors, Systems, and Tailored Hardware.We unite robust engineering processes with domain mastery to shape impactful technology and expand the industry–academia pipeline for VLSI excellence....Show more
    Last updated: 6 days ago • Promoted
    Sr. Lead Systems Design Eng.

    Sr. Lead Systems Design Eng.

    Confidential • Hyderabad / Secunderabad, Telangana, India
    WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs,.Grounded in a culture of...Show more
    Last updated: 13 days ago • Promoted
    Design Verification Engineer

    Design Verification Engineer

    Sevya Multimedia • Hyderabad, Telangana, India
    We need experienced engineers to verify an IP / full-chip using System Verilog / UVM.Expertise in PCIe / DDR verification is preferable at IP / chip level. Overall 3+ years industry experience in Design Ver...Show more
    Last updated: 30+ days ago • Promoted
    Senior Circuit Design Engineer

    Senior Circuit Design Engineer

    ACL Digital • Hyderabad, Republic Of India, IN
    Electronics and Electrical Engineering from an institute of repute.Analog and SERDES IP Circuit Design.The candidate should have relevant experience in following Analog IPs like GPIO, RCOMP, ADC, D...Show more
    Last updated: 30+ days ago • Promoted