GPU acceleration of hybrid FETI solver for problems of transient nonlinear dynamics
Loading...
Downloads
0
Date issued
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Location
Signature
Abstract
FETI methods, which build on the Finite Element Method, are utilized for large-scale engineering simulations. They use domain decomposition techniques to divide a large domain into many smaller subdomains, which can be processed in parallel. Current trends in HPC focus on GPU-accelerated clusters. To utilize them efficiently, FETI solvers should be able to use these accelerators. Recent developments have demonstrated that the fundamental component of the FETI methods, the dual operator, can be successfully offloaded to the GPU.In this paper, we focus on GPU acceleration of the Hybrid FETI variant. It reduces the size of the projector by using a two-level decomposition, thus allowing for a significantly higher number of compute nodes to be efficiently utilized. In turn, it allows us to split the problem into a larger number of smaller subdomains, which improves single-process performance. We demonstrate the performance on a real-world problem of transient nonlinear dynamics that requires reassembling of the dual operator, preconditioner, and projector during each call of the solver. On the MareNostrum 5 supercomputer, using Nvidia H100 GPUs, we achieved a speedup of 2.9 for the whole Hybrid FETI solver compared to a CPU-only run.
Description
Delayed publication
Available after
Subject(s)
FETI, hybrid FETI, GPU, CUDA, domain decomposition
Citation
Future Generation Computer Systems. 2026, vol. 179, art. no. 108341.