1、Accelerating HPC Applications with SmartNICsDonglai DaiChief Engineercontactusx-San Jose,CA April 26-28,2022Outline Motivation Basic Idea for MVAPICH2-DPU Library Design Main Features of MVAPICH2-DPU Library Performance Benefits for Benchmarks and Applications ConclusionSan Jose,CA April 26-28,2022R
2、equirements for Next-Generation Communication Libraries SmartNICs have the potential to take over a wide range of overhead tasks in a variety of applications from the host CPUs in systems Message Passing Interface(MPI)libraries are widely used for parallel and distributed HPC and AI applications in
3、HPC/data centers and clouds Requirements for a high-performance and scalable MPI library:Low latency communication High bandwidth communication Minimum contention for host CPU resources to progress non-blocking collectives High overlap of computation with communication CPU based non-blocking communi
4、cation progress can lead to sub-par performance as the main application has less CPU resources for useful application-level computationSan Jose,CA April 26-28,2022Can MPI Functions be Offloaded?The area of network offloading of MPI primitives is still nascentState-of-the-art BlueField DPUs bring mor
5、e compute power into the networkExploit additional compute capabilities of modern BlueField DPUs into existing MPI middleware to extractPeak pure communication performance Overlap of communication and computationSan Jose,CA April 26-28,2022Outline Motivation Basic Idea for MVAPICH2-DPU Library Desig
6、n Main Features of MVAPICH2-DPU Library Performance Benefits for Benchmarks and Applications ConclusionSan Jose,CA April 26-28,2022Overview of BlueField-2 DPUConnectX-6 network adapter with 200Gbps InfiniBandSystem-on-chip containing eight 64-bit ARMv8 A72 cores with 2.7 GHz each16GB of memory for t