DGX-A100 face to face DGX-2-performance, power and thermal behavior evaluation

Loading...
Thumbnail Image

Downloads

6

Date issued

Journal Title

Journal ISSN

Volume Title

Publisher

MDPI

Location

Signature

License

Abstract

Nvidia is a leading producer of GPUs for high-performance computing and artificial intelligence, bringing top performance and energy-efficiency. We present performance, power consumption, and thermal behavior analysis of the new Nvidia DGX-A100 server equipped with eight A100 Ampere microarchitecture GPUs. The results are compared against the previous generation of the server, Nvidia DGX-2, based on Tesla V100 GPUs. We developed a synthetic benchmark to measure the raw performance of floating-point computing units including Tensor Cores. Furthermore, thermal stability was investigated. In addition, Dynamic Frequency and Voltage Scaling (DVFS) analysis was performed to determine the best energy-efficient configuration of the GPUs executing workloads of various arithmetical intensities. Under the energy-optimal configuration the A100 GPU reaches efficiency of 51 GFLOPS/W for double-precision workload and 91 GFLOPS/W for tensor core double precision workload, which makes the A100 the most energy-efficient server accelerator for scientific simulations in the market.

Description

Subject(s)

DGX-A100, DGX-2, tensor cores, performance analysis, energy efficient computing, DVFS, power-aware computing, high performance computing

Citation

Energies. 2021, vol. 14, issue 2, art. no. 376.