当前位置:主页 > 科技论文 > 计算机论文 >

神威太湖之光上OpenFOAM的移植与优化

发布时间:2018-09-19 07:01
【摘要】:神威太湖之光是最新一期Top500榜单上排名第一的超级计算机,峰值性能为125.4PFlops,其计算能力主要归功于国产SW26010众核处理器。OpenFOAM(Open Source Field Operation and Manipulation)是计算流体力学领域使用最广泛的开源软件包,但是由于其基于C++实现,与神威太湖之光上的异构众核处理器SW26010的编译器不兼容,因此无法直接在该架构上有效运行。基于SW26010的主核/从核的体系架构移植了OpenFOAM的核心计算代码,并采用混合语言编程实现的方式来解决编译不兼容的问题。此外,通过寄存器通信、向量化和双缓冲等优化手段,单核组的性能较优化后的主核代码提高了8.03倍,较Intel(R)Xeon(R)CPU E5-2695v3的串行执行性能提高了1.18倍。同时,将单核组的实现扩展到了神威太湖之光的大规模集群上,并进行了强可扩展性测试,256个核组上实现了184.9倍的加速。采用的移植方式和优化手段也可以为其他复杂C++程序在神威太湖之光上的应用提供借鉴。
[Abstract]:Shenwei Taihu Light is the No. 1 supercomputer on the latest Top500 list, with peak performance of 125.4 PFlops.The computational power is mainly attributed to the domestic SW26010 multicore processor. OpenFOAM (Open Source Field Operation and Manipulation) is the most widely used open source software package in computational fluid dynamics. However, because it is based on C, and is incompatible with the compiler of SW26010, a heterogeneous multi-core processor on the light of Taihu Lake, it can not run effectively on this architecture directly. The core computing code of OpenFOAM is transplanted based on the architecture of master / slave core of SW26010, and the problem of compiling incompatibility is solved by mixed language programming. In addition, by means of register communication, vectorization and double buffering, the performance of the single core group is 8.03 times higher than that of the optimized main core code, and 1.18 times higher than that of the serial execution performance of Intel (R) Xeon (R) CPU E5-2695v3. At the same time, the implementation of mononuclear group is extended to the large-scale cluster of light in Shenwei Lake Taihu, and a strong extensibility test is carried out, and the acceleration of 184.9 times is achieved on 256 core groups. The transplant method and optimization method can also be used for reference in the application of other complex C programs in the light of Shenwei Lake Taihu.
【作者单位】: 上海交通大学高性能计算中心;东京工业大学;
【基金】:国家重点研发计划(2016YFB0201400,2016YFB0201800) 日本学术振兴会JSPS的RONPAKU项目 并行计算机工程技术研究中心资助
【分类号】:TP338


本文编号:2249387

资料下载
论文发表

本文链接:https://www.wllwen.com/kejilunwen/jisuanjikexuelunwen/2249387.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户24fe8***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com