1 |
MO Zeyao , ZHANG Aiqing , CAO Xiaolin , et al. JASMIN: A parallel software infrastructure for scientific computing[J]. Frontiers of Computer Science in China, 2010, 4 (4): 480- 488.
DOI
|
2 |
任健, 武林平, 申卫东. 基于JASMIN框架多物理耦合程序的性能优化及分析[J]. 计算物理, 2015, 32 (4): 431- 436.
DOI
|
3 |
程汤培, 莫则尧, 邵景力. 基于JASMIN的地下水流大规模并行数值模拟[J]. 计算物理, 2013, 30 (3): 317- 325.
|
4 |
郭红, 曹小林, 胡晓燕. 基于JASMIN框架的FFT并行解法器及其应用[J]. 计算物理, 2011, 28 (4): 475- 480.
|
5 |
LIU Qingkai, ZHAO Weibo, CHENG Jie, et al. A programming framework for large scale numerical simulations on unstructured mesh[C]//2016 IEEE 2nd International Conference on Big Data Security on Cloud (BigDataSecurity), IEEE International Conference on High Performance and Smart Computing (HPSC), and IEEE International Conference on Intelligent Data and Security (IDS). New York, NY, USA: IEEE, 2016: 310-315.
|
6 |
WISSINK A M, HORNUNG R D, KOHN S R, et al. Large scale parallel structured AMR calculations using the SAMRAI framework[C]//SC '01: Proceedings of the 2001 ACM/IEEE Conference on Supercomputing. Denver, CO, USA: IEEE, 2001: 22-22.
|
7 |
TOP500. org. Top500 list[EB/OL]. [2023-05-10]. https://www.top500.org/lists/top500/2023/06/.
|
8 |
CARTER EDWARDS H , TROTT C R , SUNDERLAND D . Kokkos: Enabling manycore performance portability through polymorphic memory access patterns[J]. Journal of Parallel and Distributed Computing, 2014, 74 (12): 3202- 3216.
DOI
|
9 |
REGULY I Z, MUDALIGE G R, GILES M B, et al. The OPS domain specific abstraction for multi-block structured grid computations[C]//2014 Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing. New Orleans, LA, USA: IEEE, 2014: 58-67.
|
10 |
BONDHUGULA U, HARTONO A, RAMANUJAM J, et al. A practical automatic polyhedral parallelizer and locality optimizer[C]//Proceedings of the 29th ACM SIGPLAN Conference on Programming Language Design and Implementation. Tucson, AZ, USA: Association for Computing Machinery, 2008: 101-113.
|
11 |
BECKINGSALE D A, BURMARK J, HORNUNG R, et al. RAJA: Portable performance for Large-scale scientific applications[C]//2019 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC). Denver, CO, USA: IEEE, 2019: 71-81.
|
12 |
TANG Yuan, CHOWDHURY R A, KUSZMAUL B C, et al. The pochoir stencil compiler[C]//Proceedings of the Twenty-third Annual ACM Symposium on Parallelism in Algorithms and Architectures. San Jose, California, USA: Association for Computing Machinery, 2011: 117-128.
|
13 |
TANG Yuan, CHOWDHURY R A, LUK C K, et al. Coding stencil computations using the pochoir stencil-specification language[C]. Proceedings of the 3rd USENIX Workshop on Hot Topics in Parallelism (HotPar 2011). Berkeley, California: USENIX Association, 2011.
|
14 |
HENRETTY T, VERAS R, FRANCHETT F, et al. A stencil compiler for short-vector SIMD architectures[C]//Proceedings of the 27th International ACM Conference on International Conference on Supercomputing. Eugene, Oregon, USA: Association for Computing Machinery, 2013: 13-24.
|
15 |
INTEL C. Intel cilk plus language specification[EB/OL]. [2023-05-11]. http://software.intel.com/sites/products/cilk-plus.
|
16 |
HENRETTY T, STOCK K, POUCHET L N, et al. Data layout transformation for stencil computations on short-vector SIMD architectures[C]//International Conference on Compiler Construction-CC 2011: Compiler Construction. Saarbrücken, Germany: Springer, 2011: 225-245.
|
17 |
CHRISTEN M, SCHENK O, BURKHART H. PATUS: A code generation and autotuning framework for parallel iterative stencil computations on modern microarchitectures[C]//2011 IEEE International Parallel & Distributed Processing Symposium. Anchorage, AK, USA: IEEE, 2011: 676-687.
|
18 |
MUDALIGE G R, GILES M B, REGULY I, et al. OP2: An active library framework for solving unstructured mesh-based applications on multi-core and many-core architectures[C]//2012 Innovative Parallel Computing (InPar). San Jose, CA, USA: IEEE, 2012: 1-12.
|
19 |
ACHARYA A, BONDHUGULA U. Pluto+: Near-complete modeling of affine transformations for parallelism and locality[C]//Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. San Francisco, CA, USA: Association for Computing Machinery, 2015: 54-64.
|
20 |
CAAMAÑO J M M, SUKUMARAN-RAJAM A, BALOIAN A, et al. APOLLO: Automatic speculative polyhedral loop optimizer[C]//IMPACT 2017-7th International Workshop on Polyhedral Compilation Techniques. Stockholm, Sweden: HAL, 2017: hal-01533692.
|
21 |
RAGAN-KELLEY J, BARNES C, ADAMS A, et al. Halide: A language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines[C]//Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation. Seattle, Washington, USA: Association for Computing Machinery, 2013: 519-530.
|
22 |
RAGAN-KELLEY J , ADAMS A , PARIS S , et al. Decoupling algorithms from schedules for easy optimization of image processing pipelines[J]. ACM Transactions on Graphics, 2012, 31 (4): 32.
|
23 |
卢兴敬, 刘雷, 贾海鹏, 等. ParaC: 面向GPU平台的图像处理领域的编程框架[J]. 软件学报, 2017, 28 (7): 1655- 1675.
|
24 |
STONE J E , GOHARA D , SHI Guochun . OpenCL: A parallel programming standard for heterogeneous computing systems[J]. Computing in Science & Engineering, 2010, 12 (3): 66- 72.
|
25 |
ECP. Overview of the ECP[EB/OL]. [2023-05-11]. https://www.exascaleproject.org/about.
|
26 |
赵捷, 李颖颖, 赵荣彩. 基于多面体模型的编译"黑魔法"[J]. 软件学报, 2018, 29 (8): 2371- 2396.
|