October 19, 2020
Conference Paper

Hypergraph Random Walks, Laplacians, and Clustering

Abstract

We propose a flexible framework for clustering hypergraph-structured data based on recently proposed random walks utilizing edge-dependent vertex weights. When incorporating edge-dependent vertex weights (EDVW), a weight is associated with each vertex-hyperedge pair, yielding a weighted incidence matrix of the hypergraph. Such weightings have been utilized in term-document representations of text data sets. We explain how random walks with EDVW serve to construct different hypergraph Laplacian matrices, and then develop a suite of clustering methods that use these incidence matrices and Laplacians for hypergraph clustering. Using 20Newsgroup, U.S. patent, Reuters' Corpus Volume 1, and genetics data sets, we compare the performance of these clustering algorithms experimentally against a variety of existing hypergraph clustering methods. We show that the proposed methods produce higher-quality clusters.

Revised: October 29, 2020 | Published: October 19, 2020

Citation

Hayashi K., S.G. Aksoy, C. Park, and H. Park. 2020. Hypergraph Random Walks, Laplacians, and Clustering. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management (CIKM 2020), October 19-23, 2020, Virtual Event Ireland, 495-504. New York, New York:ACM. PNNL-SA-153213. doi:10.1145/3340531.3412034