Implementation of the efficient communication layer for the highly parallel total FETI and hybrid total FETI solvers

Loading...
Thumbnail Image

Downloads

0

Date issued

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier

Location

Signature

Abstract

This paper describes the implementation, performance, and scalability of our communica- tion layer developed for Total FETI (TFETI) and Hybrid Total FETI (HTFETI) solvers. HTFETI is based on our variant of the Finite Element Tearing and Interconnecting (FETI) type do- main decomposition method. In this approach a small number of neighboring subdomains is aggregated into clusters, which results in a smaller coarse problem. To solve the origi- nal problem TFETI method is applied twice: to the clusters and then to the subdomains in each cluster. The current implementation of the solver is focused on the performance optimization of the main CG iteration loop, including: implementation of communication hiding and avoid- ing techniques for global communications; optimization of the nearest neighbor commu- nication - multiplication with a global gluing matrix; and optimization of the parallel CG algorithm to iterate over local Lagrange multipliers only. The performance is demonstrated on a linear elasticity 3D cube and real world bench- marks.

Description

Subject(s)

FETI, hybrid total FETI, total FETI, domain decomposition, scalability, HPC

Citation

Parallel Computing. 2016, vol. 57, p. 154-166.