计算物理 ›› 2024, Vol. 41 ›› Issue (4): 403-417.DOI: 10.19596/j.cnki.1001-246x.8737

•   • 上一篇    下一篇

几类典型应用的代数多重网格算法并行可扩展瓶颈分析

毛润彰1,2(), 杜皓3, 田鸿运2, 黄思路2, 张鹏2,4, 徐小文2,4,*()   

  1. 1. 中国工程物理研究院研究生院,北京 100088
    2. 北京应用物理与计算数学研究所,北京 100094
    3. 南京大学匡亚明学院,江苏 南京 210023
    4. 中国工程物理研究院高性能数值模拟软件中心,北京 100088
  • 收稿日期:2023-03-23 出版日期:2024-07-25 发布日期:2024-08-24
  • 通讯作者: 徐小文
  • 作者简介:毛润彰,男,硕士研究生,研究方向为大规模并行计算, E-mail:maorunzhang20@gscaep.ac.cn
  • 基金资助:
    国家自然科学基金项目(62032023)

Analysis of Parallel Scalability Bottleneck for Algebraic Multigrid in Typical Real Applications

Runzhang MAO1,2(), Hao DU3, Hongyun TIAN2, Silu HUANG2, Peng ZHANG2,4, Xiaowen XU2,4,*()   

  1. 1. Graduate School of China Academy of Engineering Physics, Beijing 100088, China
    2. Institute of Applied Physics and Computational Mathematics, Beijing 100094, China
    3. Kuang Yaming Honors School of Nanjing University, Nanjing, Jiangsu 210023, China
    4. Software Center for High Performance Numerical Simulation, China Academy of Engineering Physics, Beijing 100088, China
  • Received:2023-03-23 Online:2024-07-25 Published:2024-08-24
  • Contact: Xiaowen XU

摘要:

对于大规模稀疏线性代数方程组,代数多重网格(AMG)是具有最优计算复杂度的求解算法,但由于其算法流程复杂,导致难以取得理想的并行可扩展性能,难以定位和分析其并行可扩展瓶颈。通过分析AMG算法的性能骨架和通信模式,归纳了三类可扩展性能瓶颈,并引入稀疏矩阵通信域的概念来刻画稀疏模式对并行通信性能的影响。针对辐射流体力学、结构力学、航空发动机三类实际应用的6个具有不同稀疏模式特征的典型算例,实现了多粒度并行可扩展性能瓶颈的定位与分析,总结了未来AMG并行性能优化方向。

关键词: 代数多重网格, 并行预条件算法, 并行可扩展性, 性能分析, 性能瓶颈

Abstract:

Algebraic multigrid (AMG) is an optimal algorithm for solving large-scale sparse linear systems. However, its complexity makes it challenging to achieve ideal parallel scalability and identify parallel scalability bottlenecks. In this paper, we analyze the performance skeletons and communication patterns of the AMG algorithm to identify three categories of scalability bottlenecks. Additionally, we introduce the concept of the sparse matrix communication domain to characterize the influence of sparse patterns on parallel communication performance. We examine six typical examples with varying sparse pattern features in practical applications such as radiation fluid dynamics, structural mechanics, and aero-engines. Through our analysis, we identify and analyze multi-granularity parallel scalability bottlenecks and provide insights into future directions for improving AMG parallel performance.

Key words: algebraic multigrid, parallel preconditioning algorithms, parallel scalability, performance analysis, performance bottleneck

中图分类号: