Crunching large graphs is the basis of many emerging appli- cations, such as social network analysis and bioinformatics. Graph analytics algorithms exhibit little locality and therefore present signi?cant performance challenges. Hardware multi- threading systems (e.g, Cray XMT) show that with enough concurrency, we can tolerate long latencies. Unfortunately, this solution is not available with commodity parts. Our goal is to develop a latency-tolerant system built out of commodity parts and mostly in software. The proposed system includes a runtime that supports a large number of lightweight contexts, full-bit synchronization and a memory manager that provides a high-latency but high-bandwidth global shared memory. This paper lays out the vision for our system, and justi?es its feasibility with a performance analysis of the run- time for latency tolerance.
Revised: June 6, 2011 |
Published: May 26, 2011
Citation
Nelson J.E., B.D. Myers, A.H. Hunter, P. Briggs, L. Ceze, W.C. Ebeling, and D. Grossman, et al. 2011.Crunching Large Graphs with Commodity Processors. In HotPar 11: Third USENIX Workshop on Hot Topics in Parallelism, May 26-27, 2011, Berkeley, CA. Berkeley, California:USENIX.PNNL-SA-77533.