January 21, 2013
Conference Paper

Comparing the Performance of Blue Gene/Q with Leading Cray XE6 and InfiniBand Systems

Abstract

Abstract—Three types of systems dominate the current High Performance Computing landscape: the Cray XE6, the IBM Blue Gene, and commodity clusters using InfiniBand. These systems have quite different characteristics making the choice for a particular deployment difficult. The XE6 uses Cray’s proprietary Gemini 3-D torus interconnect with two nodes at each network endpoint. The latest IBM Blue Gene/Q uses a single socket integrating processor and communication in a 5-D torus network. InfiniBand provides the flexibility of using nodes from many vendors connected in many possible topologies. The performance characteristics of each vary vastly along with their utilization model. In this work we compare the performance of these three systems using a combination of micro-benchmarks and a set of production applications. In particular we discuss the causes of variability in performance across the systems and also quantify where performance is lost using a combination of measurements and models. Our results show that significant performance can be lost in normal production operation of the Cray XT6 and InfiniBand Clusters in comparison to Blue Gene/Q.

Revised: July 30, 2013 | Published: January 21, 2013

Citation

Kerbyson D.J., K.J. Barker, A. Vishnu, and A. Hoisie. 2013. Comparing the Performance of Blue Gene/Q with Leading Cray XE6 and InfiniBand Systems. In Proceedings of 18th IEEE International Conference on Parallel and Distributed Systems (ICPADS), December 17-19, 2012, Singapore, 556-563. Los Alamitos, California:IEEE Computer Society. PNNL-SA-91632. doi:10.1109/ICPADS.2012.81