The programming techniques supported and employed on these GPUs and Multi-GPUs systems are not sufficient to address problems exhibiting irregular, and unbalanced workload such as Molecular Dynamic (MD) simulations of systems with non-uniform densities. In this paper, we propose a task-based dynamic load-balancing solution to employ on MD simulations for single- and multi-GPU systems. The solution allows load balancing at a finer granularity than what is supported in existing APIs such as NVIDIA’s CUDA. Experimental results with a single-GPU configuration show that our fine-grained task solution can utilize the hardware more efficiently than the CUDA scheduler. On multi-GPU systems, our solution achieves near-linear speedup, load balance, and significant performance improvement over techniques based on standard CUDA APIs.
Revised: August 18, 2010 |
Published: May 30, 2010
Citation
Villa O., L. Chen, and S. Krishnamoorthy. 2010.High Performance Molecular Dynamic Simulation on Single and Multi-GPU Systems. In IEEE International Symposium on Circuits and Systems (ISCAS 2010), 3805-3808. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers.PNNL-SA-70332.doi:10.1109/ISCAS.2010.5537723