Aviation safety reports are the best available source of information about why a flight incident happened. However, stream of consciousness permeates the narratives making difficult the automation of the information extraction task. We propose an approach and infrastructure based on a common pattern specification language to capture relevant information via normalized template expression matching in context. Template expression matching handles variants of multi-word expressions. Normalization improves the likelihood of correct hits by standardizing and cleaning the vocabulary used in narratives. Checking for the presence of negative modifiers in the proximity of a potential hit reduces the chance of false hits. We present the above approach in the context of a specific application, which is the extraction of human performance factors from NASA ASRS reports. While knowledge infusion from experts plays a critical role during the learning phase, early results show that in a production mode, the automated process provides information that is consistent with analyses by human subjects.
Revised: February 1, 2006 |
Published: May 12, 2005
Citation
Posse C., B.D. Matzke, C.M. Anderson, A.J. Brothers, M.M. Matzke, and T.A. Ferryman. 2005.Extracting Information from Narratives: An Application to Aviation Safety Reports. In 2005 IEEE Aerospace Conference, 1-14. Manhattan Beach, California:IEEE Conference Publications. PNWD-SA-6702.