February 22, 2010
Journal Article

Text and Structural Data Mining of Influenza Mentions in Web and Social Media

Abstract

Text and structural data mining of Web and social media (WSM) provides a novel disease surveillance resource and can identify online communities for targeted public health communications (PHC) to assure wide dissemination of pertinent information. WSM that mention influenza are harvested over a 24-week period, 5-October-2008 to 21-March-2009. Link analysis reveals communities for targeted PHC. Text mining is shown to identify trends in flu posts that correlate to real-world influenza-like-illness patient report data. We also bring to bear a graph-based data mining technique to detect anomalies among flu blogs connected by publisher type, links, and user-tags.

Revised: July 4, 2010 | Published: February 22, 2010

Citation

Corley C.D., D. Cook, A.R. Mikler, and K.P. Singh. 2010. Text and Structural Data Mining of Influenza Mentions in Web and Social Media. International Journal of Environmental Research and Public Health 7, no. 2:596-615. PNNL-SA-69450. doi:10.3390/ijerph7020596