June 24, 2011
Conference Paper

Complex Biological Event Extraction from Full Text using Signatures of Linguistic and Semantic Features

Abstract

Building on technical advances from the BioNLP 2009 Shared Task Challenge, the 2011 challenge sets forth to generalize techniques to other complex biological event extraction tasks. In this paper, we present the implementation and evaluation of a signature-based machine-learning technique to predict events from full texts of infectious disease documents. Specifically, our approach uses novel signatures composed of traditional linguistic features and semantic knowledge to predict event triggers and their candidate arguments. Using a leave-one out analysis, we report the contribution of linguistic and shallow semantic features in the trigger prediction and candidate argument extraction. Lastly, we examine evaluations and posit causes for errors of infectious disease track subtasks.

Revised: October 27, 2011 | Published: June 24, 2011

Citation

McGrath L.R., K.O. Domico, C.D. Corley, and B.M. Webb-Robertson. 2011. Complex Biological Event Extraction from Full Text using Signatures of Linguistic and Semantic Features. In The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Proceedings of BioNLP Shared Task 2011Workshop, June 24th, Portland, Oregon, 130-137. Stroudsburg, Pennsylvania:Association for Computational Linguistics. PNNL-SA-79051.