October 20, 2003
Conference Paper

Multi-scale Science: Supporting Emerging Practice with Semantically Derived Provenance

Abstract

Scientific progress is becoming increasingly dependent of our ability to study phenomena at multiple scales and from multiple perspectives. The ability to recontextualize third party data within the semantic and syntactic framework of a given research project is increasingly seen as a primary barrier in multi-scale science. Within the Collaboratory for Multiscale Chemical Science (CMCS) project, we are developing a general-purpose informatics-based approach that emphasizes “on-demand” metadata creation, configurable data translations, and semantic mapping to support the rapidly increasing and continually evolving requirements for managing data, metadata, and data relationships in such projects. A concrete example of this approach is the design of the CMCS provenance subsystem. The concept of provenance varies across communities, and multiple independent applications contribute to and use provenance. In CMCS, we have developed generic tools for viewing provenance relationships and for using them to, for example, scope notifications and searches. These tools rely on a configurable concept of provenance defined in terms of other relationships. The result is a very flexible mechanism capable of tracking data provenance across many disciplines and supporting multiple uses of provenance information.

Revised: August 15, 2005 | Published: October 20, 2003

Citation

Myers J.D., C.M. Pancerella, C.S. Lansing, K.L. Schuchardt, and B.T. Didier. 2003. Multi-scale Science: Supporting Emerging Practice with Semantically Derived Provenance. In Proceedings of the Workshop on Semantic Web Technologies for Searching and Retrieving Scientific Data, edited by Ashish, N., Goble, C. Aachen:Sun SITE Central Europe (CEUR). PNNL-SA-39195.