About BQP
BQP is building the next-generation simulation platform,
BQPhy® , designed to solve the most complex computational challenges in aerospace, space, and defense. The platform integrates advanced solvers with proprietary quantum-inspired algorithms, delivering performance beyond the capabilities of modern GPUs. Running on classical high-performance computing systems, BQPhy® has demonstrated
up to 10X computational advantages
for aerospace and defense clients. The platform is built to transition seamlessly to quantum-native hardware as it matures, enabling sustained technical superiority and reduced development costs across industries.
About this Role
We are seeking a highly motivated HPC Performance Profiling Intern to join our High-Performance Computing (HPC) team. The intern will focus on CPU / MPI / GPU performance profiling and optimization for our advanced HPC simulation and optimization frameworks. This role is critical to addressing current performance bottlenecks and ensuring scalability for production-level workflows.
The selected candidate will gain exposure to cutting-edge HPC applications, profiling tools, and parallelization paradigms (OpenMP, OpenACC, CUDA, SYCL), while working closely with senior HPC engineers on both in-house code and third-party integrations.
Key Responsibilities
Conduct strong and weak scaling analyses for complex HPC workloads.
Perform in-depth profiling of codebases, identifying performance bottlenecks across CPU, GPU, and MPI / OpenMP layers.
Utilize tools such as Nvidia Nsight Compute, Intel vTune, Intel MPI Profiler, mpiP, Scalasca for detailed performance analysis.
Contribute to the development of a performance testing and monitoring framework for HPC workflows.
Prepare roofline plots, occupancy heat maps, and performance visualizations for reporting.
Collaborate with the HPC team to optimize application performance and enhance code efficiency.
Support the creation of technical reports, white papers, conference presentations, and journal publicationsbased on findings.
Required Qualifications
Strong programming skills in C++.
Proficiency with MPI / OpenMP for parallel programming.
Hands-on experience with CUDA for GPU programming.
Strong analytical skills with the ability to interpret and visualize performance metrics.
Preferred / Good-to-Have Skills
Familiarity with performance profiling tools : Nvidia Nsight Compute, Intel vTune, Intel MPI Profiler, mpiP, Scalasca.
Knowledge of scaling concepts (strong / weak scaling), roofline modeling, and HPC performance visualization techniques.
Exposure to technical writing : reports, white papers, or academic articles.
Background in HPC research, computational sciences, or applied mathematics.
Performance Computing • Raipur, Chhattisgarh, India