Batched transpose-free ADI-type preconditioners for a Poisson solver on GPGPUs

dc.contributor.authorArbenz, Peter
dc.contributor.authorŘíha, Lubomír
dc.date.accessioned2020-02-21T07:30:45Z
dc.date.available2020-02-21T07:30:45Z
dc.date.issued2020
dc.description.abstractWe investigate the iterative solution of a symmetric positive definite linear system involving the shifted Laplacian as the system matrix on General Purpose Graphics Processing Units (GPGPUs). We consider in particular the Chebyshev iteration for its reduced global communication. The ADI-type preconditioner involves solving multiple (batched) symmetric positive tridiagonal Toeplitz systems along each coordinate direction. We investigate several variants how to solve these tridiagonal systems, the Thomas algorithm, the Thomas combined with the SPIKE algorithm, and a polynomial approximation of the inverse. We test the various implementations numerically by means of two-and three-dimensional examples. It turns out that a combination of the Thomas algorithm and the approximate inverse leads to a solution that does not need either tiling or transpositions. As such none of the kernels uses an extensive amount of shared memory which yields a very high GPU utilization and more importantly optimal coalesced global memory access patterns.cs
dc.description.firstpage148cs
dc.description.lastpage159cs
dc.description.sourceWeb of Sciencecs
dc.description.volume137cs
dc.identifier.citationJournal of Parallel and Distributed Computing. 2020, vol. 137, p. 148-159.cs
dc.identifier.doi10.1016/j.jpdc.2019.11.004
dc.identifier.issn0743-7315
dc.identifier.issn1096-0848
dc.identifier.urihttp://hdl.handle.net/10084/139177
dc.identifier.wos000510315300012
dc.language.isoencs
dc.publisherElseviercs
dc.relation.ispartofseriesJournal of Parallel and Distributed Computingcs
dc.relation.urihttps://doi.org/10.1016/j.jpdc.2019.11.004cs
dc.rights© 2019 Elsevier Inc. All rights reserved.cs
dc.subjectshifted Poisson problemcs
dc.subjectADI preconditionercs
dc.subjectbatched triangular systemscs
dc.subjectGeneral Purpose Graphical Processing Unit (GPGPU)cs
dc.titleBatched transpose-free ADI-type preconditioners for a Poisson solver on GPGPUscs
dc.typearticlecs
dc.type.statusPeer-reviewedcs

Files

License bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
license.txt
Size:
718 B
Format:
Item-specific license agreed upon to submission
Description: