Presentation,e-document

VAVERKA Filip, ©PE«KO Matej, TREEBY Bradley E. and JARO© Jiří. Optimization of Ultrasound Simulations on Multi-GPU Servers. Dallas, 2018. Available from: https://sc18.supercomputing.org/presentation/?id=post255&sess=sess322
Publication language:english
Original title:Optimization of Ultrasound Simulations on Multi-GPU Servers
Pages:1-2
Conference:SuperComputing 2018
Place:Dallas, US
Year:2018
URL:https://sc18.supercomputing.org/presentation/?id=post255&sess=sess322
Files: 
+Type +Name Title Size Last modified
iconabstract.pdfhttps://sc18.supercomputing.org/presentation/?id=post255&sess=sess322223 KB2018-11-23 16:21:29
iconpost255s2-file2.pdfPoster3,34 MB2018-11-23 16:21:29
^ Select all
With selected:
Keywords
Ultrasound simulations, Local Fourier basis decomposition, k-Wave toolbox, Multi-GPU systems, CUDA, MPI
Annotation
Realistic ultrasound simulations have found a broad area of applications in preoperative photoacoustic screening and non-invasive ultrasound treatment planing. However, the domains are typically thousands of wavelengths in size, leading to large-scale numerical models with billions of unknowns. The current trend in accelerated computing is towards the use of fat nodes with multiple GPUs per node. The multi-GPU version of our k-Wave acoustic toolbox is based on the local Fourier basis domain decomposition where 3D simulation domain is partitioned into rectangular cuboid blocks assigned to particular GPUs. This paper investigates the benefits of using the CUDA-Aware MPI and CUDA peer-to-peer transfers on an 8-GPU server equipped with Nvidia P40 GPUs. The server has a total GPU memory of 192 GB and a  single-precision performance of 96 Tflops. These techniques reduces the overall simulation time a factor of 2-3.6.

Your IPv4 address: 18.232.53.231
Switch to https