Boundary element quadrature schemes for multi- and many-core architectures

Zapletal, Jan; Merta, Michal; Malý, Lukáš

dc.contributor.author	Zapletal, Jan
dc.contributor.author	Merta, Michal
dc.contributor.author	Malý, Lukáš
dc.date.accessioned	2017-07-12T07:11:10Z
dc.date.available	2017-07-12T07:11:10Z
dc.date.issued	2017
dc.identifier.citation	Computers & Mathematics with Applications. 2017, vol. 74, issue 1, p. 157-173.	cs
dc.identifier.issn	0898-1221
dc.identifier.issn	1873-7668
dc.identifier.uri	http://hdl.handle.net/10084/117166
dc.description.abstract	In the paper we study the performance of the regularized boundary element quadrature routines implemented in the BEM4I library developed by the authors. Apart from the results obtained on the classical multi-core architecture represented by the Intel Xeon processors we concentrate on the portability of the code to the many-core family Intel Xeon Phi. Contrary to the GP-GPU programming accelerating many scientific codes, the standard x86 architecture of the Xeon Phi processors allows to reuse the already existing multi-core implementation. Although in many cases a simple recompilation would lead to an inefficient utilization of the Xeon Phi, the effort invested in the optimization usually leads to a better performance on the multi-core Xeon processors as well. This makes the Xeon Phi an interesting platform for scientists developing a software library aimed at both modern portable PCs and high performance computing environments. Here we focus at the manually vectorized assembly of the local element contributions and the parallel assembly of the global matrices on shared memory systems. Due to the quadratic complexity of the standard assembly we also present an assembly sparsified by the adaptive cross approximation based on the same acceleration techniques. The numerical results performed on the Xeon multi-core processor and two generations of the Xeon Phi many-core platform validate the proposed implementation and highlight the importance of vectorization necessary to exploit the features of modern hardware.	cs
dc.language.iso	en	cs
dc.publisher	Elsevier	cs
dc.relation.ispartofseries	Computers & Mathematics with Applications	cs
dc.relation.uri	https://doi.org/10.1016/j.camwa.2017.01.018	cs
dc.rights	© 2017 Elsevier Ltd. All rights reserved.	cs
dc.subject	boundary element method	cs
dc.subject	quadrature	cs
dc.subject	SIMD	cs
dc.subject	vectorization	cs
dc.subject	Intel Xeon Phi	cs
dc.subject	many-core architecture	cs
dc.title	Boundary element quadrature schemes for multi- and many-core architectures	cs
dc.type	article	cs
dc.identifier.doi	10.1016/j.camwa.2017.01.018
dc.type.status	Peer-reviewed	cs
dc.description.source	Web of Science	cs
dc.description.volume	74	cs
dc.description.issue	1	cs
dc.description.lastpage	173	cs
dc.description.firstpage	157	cs
dc.identifier.wos	000403633600013

Soubory tohoto záznamu

Soubory	Velikost	Formát	Zobrazit
K tomuto záznamu nejsou připojeny žádné soubory.

Tento záznam se objevuje v následujících kolekcích

Publikační činnost VŠB-TUO ve Web of Science / Publications of VŠB-TUO in Web of Science [7798]
Kolekce obsahuje bibliografické záznamy článků akademických pracovníků VŠB-TUO publikovaných v časopisech indexovaných ve Web of Science od roku 1990 po současnost.
Články z časopisů s impakt faktorem / Articles from Impact Factor Journals [6377]
Články z časopisů (od roku 2008), které v době vydání článku měly impakt faktor.
Publikační činnost Katedry aplikované matematiky / Publications of Department of Applied Mathematics (470) [318]
Kolekce obsahuje bibliografické záznamy publikační činnosti (článků) akademických pracovníků Katedry aplikované matematiky (470) v časopisech registrovaných ve Web of Science od roku 2003 po současnost.
Publikační činnost IT4Innovations / Publications of IT4Innovations (9600) [841]

Zobrazit minimální záznam