dc.contributor.author | Arbenz, Peter | |
dc.contributor.author | Říha, Lubomír | |
dc.date.accessioned | 2020-02-21T07:30:45Z | |
dc.date.available | 2020-02-21T07:30:45Z | |
dc.date.issued | 2020 | |
dc.identifier.citation | Journal of Parallel and Distributed Computing. 2020, vol. 137, p. 148-159. | cs |
dc.identifier.issn | 0743-7315 | |
dc.identifier.issn | 1096-0848 | |
dc.identifier.uri | http://hdl.handle.net/10084/139177 | |
dc.description.abstract | We investigate the iterative solution of a symmetric positive definite linear system involving the shifted Laplacian as the system matrix on General Purpose Graphics Processing Units (GPGPUs). We consider in particular the Chebyshev iteration for its reduced global communication. The ADI-type preconditioner involves solving multiple (batched) symmetric positive tridiagonal Toeplitz systems along each coordinate direction. We investigate several variants how to solve these tridiagonal systems, the Thomas algorithm, the Thomas combined with the SPIKE algorithm, and a polynomial approximation of the inverse. We test the various implementations numerically by means of two-and three-dimensional examples. It turns out that a combination of the Thomas algorithm and the approximate inverse leads to a solution that does not need either tiling or transpositions. As such none of the kernels uses an extensive amount of shared memory which yields a very high GPU utilization and more importantly optimal coalesced global memory access patterns. | cs |
dc.language.iso | en | cs |
dc.publisher | Elsevier | cs |
dc.relation.ispartofseries | Journal of Parallel and Distributed Computing | cs |
dc.relation.uri | https://doi.org/10.1016/j.jpdc.2019.11.004 | cs |
dc.rights | © 2019 Elsevier Inc. All rights reserved. | cs |
dc.subject | shifted Poisson problem | cs |
dc.subject | ADI preconditioner | cs |
dc.subject | batched triangular systems | cs |
dc.subject | General Purpose Graphical Processing Unit (GPGPU) | cs |
dc.title | Batched transpose-free ADI-type preconditioners for a Poisson solver on GPGPUs | cs |
dc.type | article | cs |
dc.identifier.doi | 10.1016/j.jpdc.2019.11.004 | |
dc.type.status | Peer-reviewed | cs |
dc.description.source | Web of Science | cs |
dc.description.volume | 137 | cs |
dc.description.lastpage | 159 | cs |
dc.description.firstpage | 148 | cs |
dc.identifier.wos | 000510315300012 | |