A variety of data analysis problems is reviewed. A common computational bottleneck encountered in each of these problems is described and diagnosed as analysis tools and algorithms with unbounded memory. Analysis of the problems suggest a research and development path that could greatly extend the scale of problems that can be addressed with routine data analysis tools.
Revised: September 9, 2013 |
Published: October 1, 2004
Citation
Whitney P.D. 2004.Toward the Routine Analysis of Moderate- to Large-Size Data. In Statistical Analysis of Massive Data Streams: Proceedings of a Workshop, December 13-14, 2002, Washington DC. Washington Dc:National Academies Press.PNNL-SA-40643.