Abstract: This paper introduces a constraint-based approach to clarify missing values in graphs. Our method capitalizes on a set S of graph data constraints. An explanation is a sequence of operational enforcement of S towards the recovery of interested yet missing data (e.g., attribute values, edges). We show that constraint-based approach helps us to understand not only why a value is missing, but also how to recover the missing value. We study S-explanation problem, which is to compute the optimal explanations with guarantees on the informativeness and conciseness. We show the problem is in ?P^2 for established graph data constraints such as graph keys and graph association
rules. We develop an efficient bidirectional algorithm to compute optimal explanations, without enforcing S on the entire graph. We also show our algorithm can be easily extended to support
graph refinement within limited time, and to explain missing answers. Using real-world graphs, we experimentally verify the effectiveness and efficiency of our algorithms.
Published: September 25, 2021
Citation
Song Q., P. Lin, H. Ma, and Y. Wu. 2021.Explaining Missing Data in Graphs: A Constraint-based Approach. In IEEE 37th International Conference on Data Engineering (ICDE 2021), April 19-22, 2021, Chania, Greece, 1476-1487. Piscataway, New Jersey:IEEE.PNNL-SA-159919.doi:10.1109/ICDE51399.2021.00131