dc.contributor.author | Schneible, Joseph | |
dc.contributor.author | Říha, Lubomír | |
dc.contributor.author | Malik, Maria | |
dc.contributor.author | El-Ghazawi, Tarek | |
dc.contributor.author | Alexandru, Andrei | |
dc.date.accessioned | 2015-10-02T13:49:57Z | |
dc.date.available | 2015-10-02T13:49:57Z | |
dc.date.issued | 2015 | |
dc.identifier.citation | Concurrency and Computation: Practice and Experience. 2015, vol. 27, issue 13, p. 3262-3280. | cs |
dc.identifier.issn | 1532-0626 | |
dc.identifier.issn | 1532-0634 | |
dc.identifier.uri | http://hdl.handle.net/10084/110499 | |
dc.description.abstract | In recent years, the use of accelerators in conjunction with CPUs, known as heterogeneous computing, hasbrou ght about significant performance increases for scientifi c applications. One of the best examples ofthis is lattice quantum chromodynamics (Q CD), a stencil operation based simulation. These simulationshave a large memory footprint necessitating the use of many graphics processing units (GPUs) in parallel.This requires the use of a heterogeneous cluster with one or more GPUs per node. In order to obtainoptimal performance, it is necessary to determine an efficient commu nication pattern bet ween G PUs onthe same node and between nodes. In this paper, we present a performance model based method for min-imizing the communication time of applications with stencil o perations, s uch a s l attice Q CD, o n hetero-geneous computing systems with a non-blocking InfiniBand interconnection network. The proposedmethod is able to increase the performance of the most computationally intensive kernel of lattice QCDby 25% due to improved overlapping of communication and computation. We also demonstrate that theaforementioned performance model and efficient communication patterns can be used to determine a costefficient heterogeneous system design for stencil operation based applications. | cs |
dc.language.iso | en | cs |
dc.publisher | Wiley | cs |
dc.relation.ispartofseries | Concurrency and Computation: Practice and Experience | cs |
dc.relation.uri | http://dx.doi.org/10.1002/cpe.3210 | cs |
dc.rights.uri | Copyright © 2014 John Wiley & Sons, Ltd. | cs |
dc.title | Communication efficient work distributions in stencil operation based applications | cs |
dc.type | article | cs |
dc.identifier.doi | 10.1002/cpe.3210 | |
dc.type.status | Peer-reviewed | cs |
dc.description.source | Web of Science | cs |
dc.description.volume | 27 | cs |
dc.description.issue | 13 | cs |
dc.description.lastpage | 3280 | cs |
dc.description.firstpage | 3262 | cs |
dc.identifier.wos | 000360178400007 | |