June 26, 2025
Conference Paper

Programming the Future: the Essential Role of System Topology Awareness in Heterogeneous Disaggregated Environments

Abstract

Heterogeneous disaggregated systems represent a promising solution to deliver the performance required by next-generation High-Performance Computing (HPC) workloads. Nevertheless, their heterogeneity represents a significant challenge in the application development process, making it urgent to identify solutions to program these systems productively and efficiently without sacrificing performance. In this paper, we explore the potential of system topology information in addressing such a challenge. We delve into the foundations of system topology and how this information could be exploited in high-level programming libraries across all layers of the software infrastructures, from runtime systems to user-facing APIs. We propose K-Nearest Neighbors (KNN) as a simple case study, which we implement with a distributed programming library prototyped to exploit topology awareness. We demonstrate the solution's effectiveness on a commodity cluster with GPU-equipped nodes and illustrate how it will apply to next-generation disaggregated hardware.

Published: June 26, 2025

Citation

Branchini B., I. Di Dio Lavore, V.G. Castellana, and M. Santambrogio. 2024. Programming the Future: the Essential Role of System Topology Awareness in Heterogeneous Disaggregated Environments. In Proceedings of the International Symposium on Memory Systems (MEMSYS 2024), September 30-October 3, 2024, Washington, D.C., 186 - 191. New York, New York:Association for Computing Machinery. PNNL-SA-202876. doi:10.1145/3695794.3695811