August 4, 2017
Conference Paper

Intrinsic and Extrinsic Evaluation of Spatiotemporal Text Representations in Twitter Streams

Abstract

Language in social media is a dynamic system, constantly evolving and adapting, with words and concepts rapidly emerging, disappearing, and changing their meaning. These changes can be estimated using word representations in context, over time and across locations. A number of methods have been proposed to track these spatiotemporal changes but no general method exists to evaluate the quality of these representations. Previous work largely focused on qualitative evaluation, which we improve by proposing a set of visualizations that highlight changes in text representation over both space and time. We demonstrate usefulness of novel spatiotemporal representations to explore and characterize specific aspects of the corpus of tweets collected from European countries over a two-week period centered around the terrorist attacks in Brussels in March 2016. In addition, we quantitatively evaluate spatiotemporal representations by feeding them into a downstream classification task – event type prediction. Thus, our work is the first to provide both intrinsic (qualitative) and extrinsic (quantitative) evaluation of text representations for spatiotemporal trends.

Revised: June 1, 2018 | Published: August 4, 2017

Citation

Phillips L.A., K.J. Shaffer, D.L. Arendt, N.O. Hodas, and S. Volkova. 2017. "Intrinsic and Extrinsic Evaluation of Spatiotemporal Text Representations in Twitter Streams." In Proceedings of the 2nd Workshop on Representation Learning for NL (Rep4NLP@ACL 2017), August 3, 2017, Vancouver Canada, 201-210. Stroudsburg, Pennsylvania:Association for Computational Linguistic. PNNL-SA-122225.

Research topics