July 15, 2020
Conference Paper

Direction-optimizing Label Propagation and its Application for Community Detection

Abstract

Label Propagation is a machine learning algorithm typically used for classification. It has also been found to be an effective method for detecting communities in networks. It has two attractive features as a community detection method: it has nearly linear runtime and it requires no \textit{a priori} community information. We propose a new Direction Optimizing Label Propagation Algorithm (DOLPA) that relies on the use of {\em frontiers} and alternates between label {\em push} and label {\em pull} operations to enhance the performance of LPA. Specifically, DOLPA has parameters for tuning the processing order of vertices in a graph. This reduces the number of edges visited and improves the quality of solution. We apply DOLPA to community detection and present the design and implementation of the algorithm as well as its shared-memory parallelization using OpenMP. Empirically, we evaluate our algorithm using synthetic graphs as well as real-world networks. Compared with the state-of-art \textit{Parallel Label Propagation} algorithm, we achieve at least two times the F-Score while reducing the runtime by 50\% for synthetic graphs with overlapping communities. We also compare DOLPA against the state-of-art parallel implementation of the Louvain method using the same graphs and show that DOLPA achieves about three times the F-Score at 10\% the runtime. On real-world graphs, we get a speedup of up to $10\times$ using 64 threads.

Revised: July 24, 2020 | Published: July 15, 2020

Citation

Liu X., M. Halappanavar, K.J. Barker, A. Lumsdaine, and A. Gebremedhin. 2020. Direction-optimizing Label Propagation and its Application for Community Detection. In Proceedings of the 17th ACM International Conference on Computing Frontiers (CF 2020), June 1-10, 2020. Catania, Italy, 192–201. New York, New York:ACM. PNNL-SA-152667. doi:10.1145/3387902.3392634