May 29, 2024Open Access

Multilevel Interior Penalty Methods on GPUs

Key Points

Key points are not available for this paper at this time.

Abstract

We present a matrix-free multigrid method for high-order discontinuous Galerkin (DG) finite element methods with GPU acceleration. A performance analysis is conducted, comparing various data and compute layouts. Smoother implementations are optimized through localization and fast diagonalization techniques. Leveraging conflict-free access patterns in shared memory, arithmetic throughput of up to 39% of the peak performance on Nvidia A100 GPUs are achieved. Experimental results affirm the effectiveness of mixed-precision approaches and MPI parallelization in accelerating algorithms. Furthermore, an assessment of solver efficiency and robustness is provided across both two and three dimensions, with applications to Poisson problems.

Read Full Paperexternally

اسأل الذكاء الاصطناعي

Bookmark

View Full Paper

Cite This Study

Cui et al. (Wed,) studied this question.

synapsesocial.com/papers/68e67e28b6db64358760817e https://doi.org/https://doi.org/10.48550/arxiv.2405.18982

اسأل الذكاء الاصطناعي

Bookmark

View Full Paper