Zobrazit minimální záznam

dc.contributor.authorSchneible, Joseph
dc.contributor.authorŘíha, Lubomír
dc.contributor.authorMalik, Maria
dc.contributor.authorEl-Ghazawi, Tarek
dc.contributor.authorAlexandru, Andrei
dc.date.accessioned2015-10-02T13:49:57Z
dc.date.available2015-10-02T13:49:57Z
dc.date.issued2015
dc.identifier.citationConcurrency and Computation: Practice and Experience. 2015, vol. 27, issue 13, p. 3262-3280.cs
dc.identifier.issn1532-0626
dc.identifier.issn1532-0634
dc.identifier.urihttp://hdl.handle.net/10084/110499
dc.description.abstractIn recent years, the use of accelerators in conjunction with CPUs, known as heterogeneous computing, hasbrou ght about significant performance increases for scientifi c applications. One of the best examples ofthis is lattice quantum chromodynamics (Q CD), a stencil operation based simulation. These simulationshave a large memory footprint necessitating the use of many graphics processing units (GPUs) in parallel.This requires the use of a heterogeneous cluster with one or more GPUs per node. In order to obtainoptimal performance, it is necessary to determine an efficient commu nication pattern bet ween G PUs onthe same node and between nodes. In this paper, we present a performance model based method for min-imizing the communication time of applications with stencil o perations, s uch a s l attice Q CD, o n hetero-geneous computing systems with a non-blocking InfiniBand interconnection network. The proposedmethod is able to increase the performance of the most computationally intensive kernel of lattice QCDby 25% due to improved overlapping of communication and computation. We also demonstrate that theaforementioned performance model and efficient communication patterns can be used to determine a costefficient heterogeneous system design for stencil operation based applications.cs
dc.language.isoencs
dc.publisherWileycs
dc.relation.ispartofseriesConcurrency and Computation: Practice and Experiencecs
dc.relation.urihttp://dx.doi.org/10.1002/cpe.3210cs
dc.rights.uriCopyright © 2014 John Wiley & Sons, Ltd.cs
dc.titleCommunication efficient work distributions in stencil operation based applicationscs
dc.typearticlecs
dc.identifier.doi10.1002/cpe.3210
dc.type.statusPeer-reviewedcs
dc.description.sourceWeb of Sciencecs
dc.description.volume27cs
dc.description.issue13cs
dc.description.lastpage3280cs
dc.description.firstpage3262cs
dc.identifier.wos000360178400007


Soubory tohoto záznamu

SouboryVelikostFormátZobrazit

K tomuto záznamu nejsou připojeny žádné soubory.

Tento záznam se objevuje v následujících kolekcích

Zobrazit minimální záznam