CHINESE JOURNAL OF COMPUTATIONAL PHYSICS ›› 2018, Vol. 35 ›› Issue (5): 554-562.DOI: 10.19596/j.cnki.1001-246x.7698

Previous Articles     Next Articles

Performance Optimization of 3D Pseudopotential Multi-Relaxation-Time Lattice Boltzmann Model on GPU

PENG Hao1, SHAN Minglei1,2, ZHU Changping1,2, YAO Cheng1,2   

  1. 1. Changzhou Key Laboratory of Sensor Networks and Environmental Sensing, Jiangsu Key Laboratory of Power Transmission and Distribution Equipment Technology, Hohai University, Changzhou 213022, China;
    2. Jiangsu Provincial Collaborative Innovation Center of World Water Valley and Water Ecological Civilization, Nanjing 211100, China
  • Received:2017-05-17 Revised:2017-07-21 Online:2018-09-25 Published:2018-09-25

Abstract: Pseudopotential model of lattice Boltzmann method is partially non-local for pseudopotential calculation with coupling of lattices, which leads to synchronization of threads in parallel implementation process. Besides, it uses a large number of registers and much time of data access operations when access global memory in calculation process. They lead to low computational efficiency. In this paper, a multi-relaxation-time(MRT) 3D pseudopotential model with D3Q15 lattice is adopted as an example to investigate performance of parallel computing based on GPU. To address limitation of parallel computing of pseudo-potential model, efficiency of reading and writing of global memory is improved by using merge access method. To improve efficiency of grids retrieving data which are in boundary of lattice, a "Directional Transfer" algorithm is proposed. The role of computing resource configuration is investigated with different sizes of block, and optimal resource configuration scheme is obtained.

Key words: LBM, pseudopotential model, GPU, parallel computing, performance optimization

CLC Number: