We describe MedstractPlus, a resource for mining relations from the Medline bibliographic database that is currently under construction. It was built on the remains of Medstract, a previously created resource that included a biorelation server and an acronym database. MedstractPlus uses simple and scalable natural language processing modules to structure text, is designed with reusability and extendibility in mind, and adheres to the philosophy of the Linguistic Annotation Framework. We show how MedstractPlus has been used to provide seeds for a novel approach to inferring transcriptional regulatory networks from gene expression data.
Revised: November 30, 2011 |
Published: September 19, 2011
Citation
Verhagen M., J. Pustejovsky, R.C. Taylor, and A.P. Sanfilippo. 2011.Modular Semantic Tagging of Medline Abstracts and its Use in Inferring Regulatory Networks. In Proceedings of the 5th IEEE International Conference on Semantic Computing (ICSC 2011), September 18-21, 2011, Palo Alto, California, 498-505. Los Alamitos, California:IEEE Computer Society Press.PNNL-SA-81610.doi:10.1109/ICSC.2011.78