August 26, 2020
Conference Paper

Detecting Anomalous Computation with RNNs on GPU-Accelerated HPC Machines

Abstract

This paper presents a workload classification framework that discriminates illicit computation from authorized workloads on GPU-accelerated HPC systems. As such systems become more and more powerful, they are exploited by attackers to run malicious and for-profit programs that typically require extremely high computing ability to be successful. Our classification framework leverages the distinctive signatures between illicit and authorized workloads, and explore machine learning methods to learn the workloads and classify them. The framework uses lightweight, non-intrusive workload profiling to collect model input data, and explores multiple machine learning methods, particularly recurrent neural network (RNN) that is suitable for online anomalous workload detection. Evaluation results on three generations of GPU machines demonstrate that the workload classification framework can tell apart the illicit authorized workloads with a high accuracy of over 95%.

Revised: November 4, 2020 | Published: August 26, 2020

Citation

Zou P., A. Li, K.J. Barker, and R. Ge. 2020. Detecting Anomalous Computation with RNNs on GPU-Accelerated HPC Machines. In Proceedings of the 49th International Conference on Parallel Processing (ICPP 2020) August 17-20, 2020, Online., Article No.3404435. New York, New York:Association for Computing Machinery. PNNL-SA-148325. doi:10.1145/3404397.3404435