December 16, 2019
Conference Paper

Fingerprinting Anomalous Computation with RNN for GPU-accelerated HPC Machines

Abstract

This paper presents a workload classification framework that discriminates illicit computation from authorized workloads on GPU-accelerated HPC systems. As such systems become more and more powerful, they are exploited by attackers to run malicious and for-profit programs that typically require extremely high computing ability to be successful. Our classification framework leverages the distinctive signatures between illicit and authorized workloads, and explore machine learning methods to learn the workloads and classify them. The framework uses lightweight, non-intrusive workload profiling to collect model input data, and explores multiple machine learning methods, particularly recurrent neural network (RNN) that is suitable for online anomalous workload detection. Evaluation results on four generations of GPU machines demonstrate that the workload classification framework can tell apart the illicit authorized workloads with a high accuracy of over 95%.

Revised: May 7, 2020 | Published: December 16, 2019

Citation

Zou P., A. Li, K.J. Barker, and R. Ge. 2019. Fingerprinting Anomalous Computation with RNN for GPU-accelerated HPC Machines. In IEEE International Symposium on Workload Characterization (IISWC 2019), November 3-5, 2019, Orlando, FL, 253-256. Piscataway, New Jersey:IEEE. PNNL-SA-144356. doi:10.1109/IISWC47752.2019.9042165