- Parallel Computing and Optimization Techniques
- Advanced Data Storage Technologies
- Distributed and Parallel Computing Systems
- Gas Dynamics and Kinetic Theory
- Computational Fluid Dynamics and Aerodynamics
- Plasma and Flow Control in Aerodynamics
- Real-time simulation and control systems
Georgia Institute of Technology
2023-2024
University of Illinois Urbana-Champaign
2022
University of Delaware
2022
Institute of Technology of Cambodia
2022
University of Basel
2022
Swisscom (Switzerland)
2022
CSCS - Swiss National Supercomputing Centre
2022
Sandia National Laboratories
2022
Helmholtz-Zentrum Dresden-Rossendorf
2022
Nvidia (United States)
2022
This paper assesses and reports the experience of ten teams working to port, validate, benchmark several High Performance Computing applications on a novel GPU-accelerated Arm testbed system. The consists eight NVIDIA HPC Developer Kit systems, each one equipped with server-class CPU from Ampere two data center GPUs Corp. systems are connected together using InfiniBand interconnect. selected mini-apps written programming languages use multiple accelerator-based models for such as CUDA,...
Multiphase compressible flows are often characterized by a broad range of space and time scales. Thus entailing large grids small steps, simulations these on CPU-based clusters can thus take several wall-clock days. Offloading the compute kernels to GPUs appears attractive but is memory-bound for standard finite-volume -difference methods, damping speed-ups. Even when realized, faster GPU-based lead more intrusive communication I/O times. We present portable strategy GPU acceleration...
This paper assesses and reports the experience of ten teams working to port,validate, benchmark several High Performance Computing applications on a novel GPU-accelerated Arm testbed system. The consists eight NVIDIA HPC Developer Kit systems built by GIGABYTE, each one equipped with server-class CPU from Ampere A100 data center GPU Corp. are connected together using Infiniband high-bandwidth low-latency interconnect. selected mini-apps written programming languages use multiple...
Multiphase compressible flows are often characterized by a broad range of space and time scales, entailing large grids small steps. Simulations these on CPU-based clusters can thus take several wall-clock days. Offloading the compute kernels to GPUs appears attractive but is memory-bound for many finite-volume -difference methods, damping speed-ups. Even when realized, GPU-based lead more intrusive communication I/O times owing lower computation costs. We present strategy GPU acceleration...