Reasoning and querying over data streams rely on the abil-
ity to deliver a sequence of stream snapshots to the processing algo-
rithms. These snapshots are typically provided using windows as views
into streams and associated window management strategies. Generally,
the goal of any window management strategy is to preserve the most im-
portant data in the current window and preferentially evict the rest, so
that the retained data can continue to be exploited. A simple timestamp-
based strategy is rst-in-rst-out (FIFO), in which items are replaced in
strict order of arrival. All timestamp-based strategies implicitly assume
that a temporal ordering reliably re
ects importance to the processing
task at hand, and thus that window management using timestamps will
maximize the ability of the processing algorithms to deliver accurate
interpretations of the stream. In this work, we explore a general no-
tion of semantic importance that can be used for window management
for streams of RDF data using semantically-aware processing algorithms
like deduction or semantic query. Semantic importance exploits the infor-
mation carried in RDF and surrounding ontologies for ranking window
data in terms of its likely contribution to the processing algorithms.
We explore the general semantic categories of query contribution, prove-
nance, and trustworthiness, as well as the contribution of domain-specic
ontologies. We describe how these categories behave using several con-
crete examples. Finally, we consider how a stream window management
strategy based on semantic importance could improve overall processing
performance, especially as available window sizes decrease.
Revised: December 2, 2016 |
Published: November 4, 2016
Citation
Yan R., M.T. Greaves, W.P. Smith, and D.L. McGuinness. 2016.Remembering the Important Things: Semantic Importance in Stream Reasoning. In Stream Reasoning Workshop 2016, October 18, 2016, Kobe, Japan.PNNL-SA-120179.