Talent.com
This job offer is not available in your country.
▷ Apply Now! Senior HPC Engineer

▷ Apply Now! Senior HPC Engineer

Netweb Technologies India Ltd.India
12 hours ago
Job description

Job Title : Senior Engineer-HPC

Department : Production & Support

Location : Faridabad

Position Summary :

Accomplished HPC Systems Engineer with 8–10 years of enterprise Linux administration and over 5 years of hands-on experience managing large-scale HPC clusters exceeding 500 cores and multi-petabyte storage environments. Proven expertise in designing, implementing, and optimizing HPC infrastructure, including compute, storage, and high-speed networking, to deliver maximum performance for demanding workloads.

Key Responsibilities :

HPC Cluster Management & Optimization

  • Design, implement, and maintain HPC environments, including compute, storage, and network components.
  • Configure and optimize Slurm, PBS Pro, or other workload managers / schedulers for efficient job scheduling and resource allocation.
  • Implement performance tuning for CPU, GPU, memory, I / O, and network subsystems to meet workload demands.
  • Manage HPC filesystem solutions such as Lustre, BeeGFS, or GPFS / Spectrum Scale.

Linux Administration

  • Administer enterprise-grade Linux distributions (RHEL, CentOS, Rocky, Ubuntu) in large-scale compute environments.
  • Manage kernel upgrades, patching, and security hardening.
  • Troubleshoot kernel-level and system-level issues for performance and stability.
  • Automation & Configuration Management

  • Develop and maintain Ansible playbooks / roles for automated provisioning, configuration, and patching of HPC systems.
  • Integrate Ansible with CI / CD pipelines for infrastructure as code (IaC) practices.
  • Automate cluster deployment and environment consistency across hundreds of nodes.
  • Monitoring, Troubleshooting & Support

  • Implement and maintain monitoring tools (e.g., Grafana, Prometheus, Nagios, Ganglia).
  • Troubleshoot complex HPC workloads, MPI communication issues, and application performance bottlenecks.
  • Provide Tier-3 escalation support for Linux / HPC-related incidents.
  • Collaboration & Documentation

  • Work closely with research teams, DevOps engineers, and system architects to deliver high-performance solutions.
  • Document architecture, SOPs, troubleshooting guides, and performance tuning methodologies.
  • Requirements

    Required Skills & Experience

  • 8–10 years of hands-on Linux system administration experience in production environments.
  • 5+ years managing HPC clusters at scale (500+ cores / multiple petabytes of storage).
  • Strong Ansible automation skills (complex playbooks, roles, variables, templates).
  • Deep understanding of MPI, OpenMP, and GPU / accelerator integration in HPC workloads.
  • Proficient with HPC job schedulers (Slurm, PBS Pro, LSF).
  • Experience with HPC storage (Lustre, BeeGFS, GPFS).
  • Strong knowledge of TCP / IP networking, Infiniband, and RDMA technologies.
  • Experience with performance tuning and benchmarking tools (perf, hpc tool kit, Intel VTune, Iperf, fio).
  • Scripting proficiency in Bash, Python, or Perl for automation and tooling.
  • Preferred Qualifications

  • Experience with containerized HPC (Singularity, Apptainer, or Podman).
  • Familiarity with cloud-HPC integration (AWS Parallel Cluster, Azure Cycle Cloud, GCP HPC).
  • Knowledge of security compliance standards (CIS benchmarks, STIG).
  • Contribution to HPC community tools or open-source projects.
  • Soft Skills

  • Strong problem-solving and analytical thinking.
  • Ability to mentor junior engineers and collaborate across teams.
  • Excellent communication skills for technical and non-technical stakeholders.
  • Create a job alert for this search

    Hpc Engineer • India

    Related jobs
    • Promoted
    Engineering Manager

    Engineering Manager

    Branch InternationalNagpur, IN
    Branch delivers world-class financial services to the mobile generation.With offices in the United States, Nigeria, Kenya, and India, Branch is a for-profit socially conscious company that uses the...Show moreLast updated: 30+ days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    AiPriseNagpur, IN
    The ideal candidate will be responsible for managing and inspiring his or her team to achieve their performance metrics.Your role will involve strategizing, project management, part staff managemen...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Nagpur, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Technical Analyst

    Senior Technical Analyst

    Insight GlobalNagpur, IN
    Strong experience within data validation, data testing and data accuracy.Fluent in English and ability to communicate with internal / external stakeholders. Experience with ETL Development.This role w...Show moreLast updated: 9 days ago
    • Promoted
    H1B Resource Deployment Manager

    H1B Resource Deployment Manager

    PTR GlobalNagpur, IN
    Pinnacle Group is a nationally recognized leader in workforce solutions, known for delivering high-impact staffing, talent management, and contingent workforce programs. We support some of the most ...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebianagpur, maharashtra, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 22 days ago
    • Promoted
    Senior Automation Engineer

    Senior Automation Engineer

    Ensononagpur, maharashtra, in
    JD - Senior Automation Engineer.Expertise in designing, architecting and developing automations using like.Experience in Linux, Windows and Network for. Expertise in writing code with any programmin...Show moreLast updated: 22 days ago
    • Promoted
    Senior Automation Engineer

    Senior Automation Engineer

    Abacus.AINagpur, IN
    We are looking for a skilled QA Engineer with a strong focus on blackbox testing and hands-on experience in automation and whitebox testing. In this role, blackbox testing will comprise more than 70...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Engineering Manager

    Senior Engineering Manager

    Foodhubnagpur, maharashtra, in
    Senior Manager | Platform Engineering & API Ecosystems.At Foodhub, every order and payment flows through our Order & Transaction Platform. It’s a high-throughput backbone moving millions of requests...Show moreLast updated: 14 days ago
    • Promoted
    Lead / Sr. Full Stack Engineer

    Lead / Sr. Full Stack Engineer

    StitchFinnagpur, maharashtra, in
    We are a healthcare startup revolutionizing patient care through Voice and Conversational AI agents.Our mission is to simplify healthcare workflows, enhance accessibility, and improve patient-provi...Show moreLast updated: 7 days ago
    • Promoted
    Senior Automation Developer

    Senior Automation Developer

    Rapid Circlenagpur, maharashtra, in
    Making a difference and driving positive change is what we do every day at Rapid Circle.Our Cloud Pioneers help our clients in their digital transformation. Are you someone who goes for constant, po...Show moreLast updated: 18 days ago
    • Promoted
    Fusion HCM Sr. Technical Consultant

    Fusion HCM Sr. Technical Consultant

    Best Infosystems Ltd.Nagpur, IN
    Technical Consultant_Full-Time_Remote.Oracle HCM Cloud Senior Technical Developer with minimum experience of 8 years in technical development. Candidate must have technical experience for US and Can...Show moreLast updated: 30+ days ago
    • Promoted
    Fusion HCM Sr. Techno-Functional Consultant

    Fusion HCM Sr. Techno-Functional Consultant

    Best Infosystems Ltd.Nagpur, IN
    Techno-Functional Consultant_Full-Time_Remote.Oracle HCM Cloud Senior Techno-Functional Consultant with more than 8 years of functional experience in US and Canada Payroll.Candidate must be 70% Fun...Show moreLast updated: 30+ days ago
    • Promoted
    Technical Project Manager

    Technical Project Manager

    RoroNagpur, IN
    Roro is a product innovation studio specializing in rapid product development powered by AI tools.We build AI, IoT, mobile, and web solutions quickly and affordably. Our small team collaborates on p...Show moreLast updated: 9 days ago
    • Promoted
    Lead Solutions Architect

    Lead Solutions Architect

    CapgeminiNagpur, IN
    Lead solutioning from deal qualification to contract signature.Develop end-to-end solutions (Transformation, Transition, Run). Manage internal stakeholders and ensure visibility of solution metrics....Show moreLast updated: 8 days ago
    • Promoted
    Sr. Full Stack Engineer

    Sr. Full Stack Engineer

    BrightEdgenagpur, maharashtra, in
    BrightEdge is a global leader in enterprise SEO and content performance solutions, driving AI-powered digital marketing success for the world’s top brands. Our culture is product-first, innovation-d...Show moreLast updated: 7 days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    TamaraNagpur, IN
    Tamara is the leading fintech platform in Saudi Arabia and the wider GCC region with a mission to help people make their dreams come true by building the most customer-centric financial super-app o...Show moreLast updated: 9 days ago
    • Promoted
    Project Manager (Tech)

    Project Manager (Tech)

    EmeritusNagpur, IN
    Emeritus is committed to teaching the skills of the future by making highquality education accessible and affordable to individuals, companies, and governments around the world.It does this by coll...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Engineer

    Principal Engineer

    Hotel TraderNagpur, IN
    We're Hiring : Staff / Principal Engineer (Java) - Remote.Location : Remote | 🌍 Global Team | 💼 Experience : 8–12 years. Ready to build the future of hotel distribution at scale?.At Hotel Trader, we're...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Process Engineer

    Senior Process Engineer

    Sweconagpur, maharashtra, in
    We’re looking for a Process Engineer to join our team offshore, helping lead the way in innovative and sustainable water infrastructure engineering. As a Process Engineer, you will provide technical...Show moreLast updated: 9 days ago