May 30, 2017
Conference Paper

Generating Performance Models for Irregular Applications

Abstract

Many applications have irregular behavior --- non-uniform input data, input-dependent solvers, irregular memory accesses, unbiased branches --- that cannot be captured using today's automated performance modeling techniques. We describe new hierarchical critical path analyses for the \Palm model generation tool. To create a model's structure, we capture tasks along representative MPI critical paths. We create a histogram of critical tasks with parameterized task arguments and instance counts. To model each task, we identify hot instruction-level sub-paths and model each sub-path based on data flow, instruction scheduling, and data locality. We describe application models that generate accurate predictions for strong scaling when varying CPU speed, cache speed, memory speed, and architecture. We present results for the Sweep3D neutron transport benchmark; Page Rank on multiple graphs; Support Vector Machine with pruning; and PFLOTRAN's reactive flow/transport solver with domain-induced load imbalance.

Revised: July 26, 2017 | Published: May 30, 2017

Citation

Friese R.D., N.R. Tallent, A. Vishnu, D.J. Kerbyson, and A. Hoisie. 2017. Generating Performance Models for Irregular Applications. In IEEE International Parallel and Distributed Processing Symposium (IPDPS 2017), May 29-June 2, 2017, Orlando, Florida, 317-326. Piscataway, New Jersey:IEEE. PNNL-SA-123945. doi:10.1109/IPDPS.2017.61