The capability of OLAP database software systems to handle data complexity comes at a high price for analysts, presenting them a combinatorially vast space of views of a relational database. We respond to the need to deploy technologies sufficient to allow users to guide themselves to areas of local structure by casting the space of ``views'' of an OLAP database as a combinatorial object of all projections and subsets, and ``view discovery'' as an search process over that lattice. We equip the view lattice with statistical information theoretical measures sufficient to support a combinatorial optimization process. We outline ``hop-chaining'' as a particular view discovery algorithm over this object, wherein users are guided across a permutation of the dimensions by searching for successive two-dimensional views, pushing seen dimensions into an increasingly large background filter in a ``spiraling'' search process. We illustrate this work in the context of data cubes recording summary statistics for radiation portal monitors at US ports.
Revised: January 26, 2010 |
Published: May 1, 2009
Citation
Joslyn C.A., E.J. Burke, and T.J. Critchlow. 2009.View Discovery in OLAP Databases through Statistical Combinatorial Optimization. In Proceedings of the 21st International Conference on Scientific and Statistical Database Management, Lecture Notes in Computer Science, 5566, 37-55. New York:Springer Verlag.PNNL-SA-64183.