GPU-acceleration of the Discontinuous Galerkin Shallow Water Equations Solver (DG-SWEM) using CUDA and OpenACC

arXiv:2508.21208v1 Announce Type: new
Abstract: This paper presents a porting of DG-SWEM, a discontinuous Galerkin solver for coastal ocean circulation, and in particular storm surge, to GPU using two separate approaches: CUDA Fortran and OpenACC. Time-explicit discontinuous Galerkin methods have been shown to exhibit a large amount of data parallelism due to the loose coupling between elements, and thus are naturally mapped to the GPU architecture. For each porting approach, we discuss the code design, ease of programming, and performance when running on realistic use cases. Specifically for the OpenACC version, we also aim to preserve maintainability within the same codebase through using Unified Memory. We test the codes on NVIDIA’s Grace Hopper chip and compare the GPU performance on each node to the MPI version on a single CPU node (144 cores).

More From Author

The nox-minima dome: a tangible alternative to flat star maps for astronomy education

Bright yet dark: how strong coupling quenches exciton-polariton radiation

Leave a Reply

Your email address will not be published. Required fields are marked *