Pre-exascale accelerated application development: The ORNL Summit experience

L. Luo*, T. P. Straatsma, L. E. Aguilar Suarez, R. Broer, D. Bykov, E. F. D'Azevedo, S. S. Faraji, K. C. Gottiparthi, C. de Graaf, J. A. Harris, R. W. A. Havenith, H. J. Aa Jensen, W. Joubert, R. K. Kathir, J. Larkin, Y. W. Li, D. Lyakh, O. E. B. Messer, M. R. Norman, J. C. OefeleinR. Sankaran, A. F. Tillack, A. L. Barnes, L. Visscher, J. C. Wells, M. Wibowo

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

3 Citations (Scopus)

Abstract

High-performance computing (HPC) increasingly relies on heterogeneous architectures to achieve higher performance. In the Oak Ridge Leadership Facility (OLCF), Oak Ridge, TN, USA, this trend continues as its latest supercomputer, Summit, entered production in early 2019. The combination of IBM POWER9 CPU and NVIDIA V100 GPU, along with a fast NVLink2 interconnect and other latest technologies, pushes system performance to a new height and breaks the exascale barrier by certain measures. Due to Summit's powerful GPUs and much higher GPU-CPU ratio, offloading to accelerators becomes a requirement for any application, which intends to effectively use the system. To facilitate navigating a complex landscape of competing heterogeneous architectures, a collection of applications from a wide spectrum of scientific domains is selected for early adoption on Summit. In this article, the experience and lessons learned are summarized, in the hope of providing useful guidance to address new programming challenges, such as scalability, performance portability, and software maintainability, for future application development efforts on heterogeneous HPC systems.

Original languageEnglish
Article number8960361
Number of pages21
JournalIbm journal of research and development
Volume64
Issue number3-4
DOIs
Publication statusPublished - 1-May-2020

Keywords

  • COUPLED-CLUSTER
  • MASSIVELY-PARALLEL
  • FRAMEWORK
  • CODE

Cite this