Workflows are taking an Workflows are taking an increasingly
important role in orchestrating complex scientific processes in
extreme scale and highly heterogeneous environments. However,
to date we cannot reliably predict, understand, and optimize
workflow performance. Sources of performance variability and in
particular the interdependencies of workflow design, execution
environment and system architecture are not well understood.
While there is a rich portfolio of tools for performance analysis,
modeling and prediction for single applications in homogenous
computing environments, these are not applicable to workflows,
due to the number and heterogeneity of the involved workflow
and system components and their strong interdependencies. In this
paper, we investigate workflow performance goals and identify
factors that could have a relevant impact. Based on our analysis,
we propose a new workflow performance provenance ontology,
the Open Provenance Model-based WorkFlow Performance
Provenance, or OPM-WFPP, that will enable the empirical study
of workflow performance characteristics and variability including
complex source attribution.
Revised: January 16, 2017 |
Published: November 15, 2015
Citation
Kleese van Dam K., E.G. Stephan, B. Raju, I. Altintas, T.O. Elsethagen, and S. Krishnamoorthy. 2015.Enabling Structured Exploration of Workflow Performance Variability in Extreme-Scale Environments. In 8th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) 2015, November 15, 2015, Austin, Texas. Chicago, Illinois:Data-Intensive Distributed Systems Laboratory.PNNL-SA-120941.