October 29, 2020
Conference Paper

Geomancy: Automated Performance Enhancement through Data Layout Optimization

Abstract

Large distributed storage systems such as high- performance computing (HPC) systems used by national or international laboratories require sufficient performance and scale for demanding scientific workloads and must handle shifting workloads with ease. Ideally, data is placed in locations to optimize performance, but the size and complexity of large storage systems inhibit rapid effective restructuring of data layouts to maintain performance as workloads shift. To address these issues, we have developed Geomancy, a tool that models the placement of data within a distributed storage system and reacts to drops in performance. Using a combination of machine learning techniques suitable for temporal modeling, Geomancy determines when and where a bottleneck may happen due to changing workloads and suggests changes in the layout that mitigate or prevent them. Our approach to optimizing throughput offers benefits for storage systems such as avoiding potential bottlenecks and increasing overall I/O throughput from 11% to 30%.

Revised: September 2, 2020 | Published: October 29, 2020

Citation

Bel O.M., K. Chang, N.R. Tallent, D. Duellmann, E.L. Miller, F. Faisal Nawab, and D. Long. 2020. "Geomancy: Automated Performance Enhancement through Data Layout Optimization." In Proceedings of the 36th International Conference on Massive Storage Systems and Technology (MSST 2020), October 29-30, 2020, Santa Clara, CA. Santa Cruz, California:Center for Research in Storage Systems, University of California, Santa Cruz. PNNL-SA-155741.

Research topics