December 1, 2016
Journal Article

HPC Node Performance and Energy Modeling with the Co-Location of Applications

Abstract

Multicore processors have become an integral part of modern large-scale and high-performance parallel and distributed computing systems. This advance became necessary as the demand for increased system performance has exceeded the limits that single core processors can provide. Unfortunately, applications co-located on multicore processors can suffer from decreased performance and increased dynamic energy use as a result of access to shared resources, such as memory. Consequently, it is increasingly important to characterize the performance of applications that execute on these architectures. This work investigates some of the disadvantages of co-location, and presents a methodology for building models capable of utilizing varying amounts of information about a target application and its co-located applications to make predictions about the target application’s execution time and the system’s energy use under arbitrary co-locations of a wide range of application types. The proposed methodology is validated on three different server class Intel Xeon multicore processors using eleven applications from two scientific benchmark suites. The model’s utility for scheduling is also demonstrated in a simulated large-scale high-performance computing environment through the creation of a co-location aware scheduling heuristic. This heuristic demonstrates that scheduling using information generated with the proposed modeling methodology is capable of making significant improvements over a scheduling heuristic that is naive to co-location interference.

Revised: August 20, 2019 | Published: December 1, 2016

Citation

Dauwe D., E. Jonardi, R.D. Friese, S. Pasricha, A.A. Maciejewski, D.A. Bader, and D.A. Bader, et al. 2016. HPC Node Performance and Energy Modeling with the Co-Location of Applications. Journal of Supercomputing 72, no. 12:4771-4809. PNNL-SA-118182. doi:10.1007/s11227-016-1783-y