DGX-A100 face to face DGX-2-performance, power and thermal behavior evaluation
Loading...
Downloads
6
Date issued
Journal Title
Journal ISSN
Volume Title
Publisher
MDPI
Location
Signature
License
Abstract
Nvidia is a leading producer of GPUs for high-performance computing and artificial intelligence, bringing top performance and energy-efficiency. We present performance, power consumption, and thermal behavior analysis of the new Nvidia DGX-A100 server equipped with eight A100 Ampere microarchitecture GPUs. The results are compared against the previous generation of the server, Nvidia DGX-2, based on Tesla V100 GPUs. We developed a synthetic benchmark to measure the raw performance of floating-point computing units including Tensor Cores. Furthermore, thermal stability was investigated. In addition, Dynamic Frequency and Voltage Scaling (DVFS) analysis was performed to determine the best energy-efficient configuration of the GPUs executing workloads of various arithmetical intensities. Under the energy-optimal configuration the A100 GPU reaches efficiency of 51 GFLOPS/W for double-precision workload and 91 GFLOPS/W for tensor core double precision workload, which makes the A100 the most energy-efficient server accelerator for scientific simulations in the market.
Description
Subject(s)
DGX-A100, DGX-2, tensor cores, performance analysis, energy efficient computing, DVFS, power-aware computing, high performance computing
Citation
Energies. 2021, vol. 14, issue 2, art. no. 376.