September 25, 2021
Conference Paper

Explaining Missing Data in Graphs: A Constraint-based Approach

Abstract

Abstract: This paper introduces a constraint-based approach to clarify missing values in graphs. Our method capitalizes on a set S of graph data constraints. An explanation is a sequence of operational enforcement of S towards the recovery of interested yet missing data (e.g., attribute values, edges). We show that constraint-based approach helps us to understand not only why a value is missing, but also how to recover the missing value. We study S-explanation problem, which is to compute the optimal explanations with guarantees on the informativeness and conciseness. We show the problem is in ?P^2 for established graph data constraints such as graph keys and graph association rules. We develop an efficient bidirectional algorithm to compute optimal explanations, without enforcing S on the entire graph. We also show our algorithm can be easily extended to support graph refinement within limited time, and to explain missing answers. Using real-world graphs, we experimentally verify the effectiveness and efficiency of our algorithms.

Published: September 25, 2021

Citation

Song Q., P. Lin, H. Ma, and Y. Wu. 2021. Explaining Missing Data in Graphs: A Constraint-based Approach. In IEEE 37th International Conference on Data Engineering (ICDE 2021), April 19-22, 2021, Chania, Greece, 1476-1487. Piscataway, New Jersey:IEEE. PNNL-SA-159919. doi:10.1109/ICDE51399.2021.00131