Evaluation of the Intel Xeon Phi offload runtimes for domain decomposition solvers
Loading...
Downloads
0
Date issued
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Location
Signature
Abstract
In the paper we provide a comparison of several runtimes which can be used for offloading computationally intensive kernels to the Intel Xeon Phi coprocessors. The presented benchmark application is a stripped-down version of an iterative solver used within the Schur complement finite or boundary element tearing and interconnecting (FETI, BETI) domain decomposition methods where the sparse solve with local stiffness matrices is replaced by the multiplication with dense matrices in order to exploit coalesced memory access patterns. We present offload approaches based on the Intel Language Extension for Offload (LEO), Hetero Streams Library (hStreams), and Heterogeneous Active Messages (HAM), and compare their performance and ease of use.
Description
Subject(s)
Intel Xeon Phi, coprocessor, many-core, offload, domain decomposition
Citation
Advances in Engineering Software. 2018, vol. 125, p. 146-154.
Item identifier
Collections
Publikační činnost VŠB-TUO ve Web of Science / Publications of VŠB-TUO in Web of Science
Publikační činnost IT4Innovations / Publications of IT4Innovations (9600)
Publikační činnost Katedry aplikované matematiky / Publications of Department of Applied Mathematics (470)
Články z časopisů s impakt faktorem / Articles from Impact Factor Journals
Publikační činnost IT4Innovations / Publications of IT4Innovations (9600)
Publikační činnost Katedry aplikované matematiky / Publications of Department of Applied Mathematics (470)
Články z časopisů s impakt faktorem / Articles from Impact Factor Journals