This book chapter presents an approach to analysis of large-scale time-series sensor information based on our experience with power grid data. We use the R-Hadoop Integrated Programming Environment (RHIPE) to analyze a 2TB data set and present code and results for this analysis.
Revised: February 13, 2014 |
Published: January 1, 2014
Citation
Hafen R.P., T.D. Gibson, K. Kleese van Dam, and T.J. Critchlow. 2014.Power Grid Data Analysis with R and Hadoop. In Data Mining Applications with R, edited by Y Zhao and Y Cen. 1-34. Waltham, Massachusetts:Academic Press.PNNL-SA-89931.