Evaluation of the Intel Xeon Phi offload runtimes for domain decomposition solvers

Loading...
Thumbnail Image

Downloads

0

Date issued

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier

Location

Signature

Abstract

In the paper we provide a comparison of several runtimes which can be used for offloading computationally intensive kernels to the Intel Xeon Phi coprocessors. The presented benchmark application is a stripped-down version of an iterative solver used within the Schur complement finite or boundary element tearing and interconnecting (FETI, BETI) domain decomposition methods where the sparse solve with local stiffness matrices is replaced by the multiplication with dense matrices in order to exploit coalesced memory access patterns. We present offload approaches based on the Intel Language Extension for Offload (LEO), Hetero Streams Library (hStreams), and Heterogeneous Active Messages (HAM), and compare their performance and ease of use.

Description

Subject(s)

Intel Xeon Phi, coprocessor, many-core, offload, domain decomposition

Citation

Advances in Engineering Software. 2018, vol. 125, p. 146-154.