Zobrazit minimální záznam

dc.contributor.authorZapletal, Jan
dc.contributor.authorMerta, Michal
dc.contributor.authorMalý, Lukáš
dc.date.accessioned2017-07-12T07:11:10Z
dc.date.available2017-07-12T07:11:10Z
dc.date.issued2017
dc.identifier.citationComputers & Mathematics with Applications. 2017, vol. 74, issue 1, p. 157-173.cs
dc.identifier.issn0898-1221
dc.identifier.issn1873-7668
dc.identifier.urihttp://hdl.handle.net/10084/117166
dc.description.abstractIn the paper we study the performance of the regularized boundary element quadrature routines implemented in the BEM4I library developed by the authors. Apart from the results obtained on the classical multi-core architecture represented by the Intel Xeon processors we concentrate on the portability of the code to the many-core family Intel Xeon Phi. Contrary to the GP-GPU programming accelerating many scientific codes, the standard x86 architecture of the Xeon Phi processors allows to reuse the already existing multi-core implementation. Although in many cases a simple recompilation would lead to an inefficient utilization of the Xeon Phi, the effort invested in the optimization usually leads to a better performance on the multi-core Xeon processors as well. This makes the Xeon Phi an interesting platform for scientists developing a software library aimed at both modern portable PCs and high performance computing environments. Here we focus at the manually vectorized assembly of the local element contributions and the parallel assembly of the global matrices on shared memory systems. Due to the quadratic complexity of the standard assembly we also present an assembly sparsified by the adaptive cross approximation based on the same acceleration techniques. The numerical results performed on the Xeon multi-core processor and two generations of the Xeon Phi many-core platform validate the proposed implementation and highlight the importance of vectorization necessary to exploit the features of modern hardware.cs
dc.language.isoencs
dc.publisherElseviercs
dc.relation.ispartofseriesComputers & Mathematics with Applicationscs
dc.relation.urihttps://doi.org/10.1016/j.camwa.2017.01.018cs
dc.rights© 2017 Elsevier Ltd. All rights reserved.cs
dc.subjectboundary element methodcs
dc.subjectquadraturecs
dc.subjectSIMDcs
dc.subjectvectorizationcs
dc.subjectIntel Xeon Phics
dc.subjectmany-core architecturecs
dc.titleBoundary element quadrature schemes for multi- and many-core architecturescs
dc.typearticlecs
dc.identifier.doi10.1016/j.camwa.2017.01.018
dc.type.statusPeer-reviewedcs
dc.description.sourceWeb of Sciencecs
dc.description.volume74cs
dc.description.issue1cs
dc.description.lastpage173cs
dc.description.firstpage157cs
dc.identifier.wos000403633600013


Soubory tohoto záznamu

SouboryVelikostFormátZobrazit

K tomuto záznamu nejsou připojeny žádné soubory.

Tento záznam se objevuje v následujících kolekcích

Zobrazit minimální záznam