多核异构环境下通用并行计算框架关键技术研究
发布时间:2018-12-10 07:59
【摘要】:随着科技的发展,尤其是计算机技术的发展,各个行业中的数据量都开始呈指数型增长,传统的串行计算能力,已经远远不能满足日益增长的数据处理需求。在这种背景下,并行计算技术应运而生,其主要目的是快速解决大型且复杂的计算问题。并行计算不仅和国家的科技和经济发展密切相关,而且直接影响到国防能力和国家安全,如核爆炸模拟,复杂系统精确解算、基因研究和国家机要通信的加密与解密等等。并行计算能力是衡量国家实力的重要标志。 虽然并行计算已发展多年,在一些具体问题的解决上也已经有了较为实用的方案,总结了相当多的经验,但还远远不及串行算法那样丰富,因此这门学科研究尚不成熟。并行算法与串行算法的最大不同之处在于,不仅需要考虑问题本身的解决方法,还需要考虑问题所适用的并行模型,若要达到效率最大化,还需要考虑处理器架构、网络连接等因素,这必然会增加并行算法的设计和实现难度。 本论文在分析了并行计算中的各种难题以及国内外研究现状的基础上,针对并行计算模型的种种难题,提出了一种新的满足多核处理器机群计算需求的分层异构并行的通用计算模型,并对其中的关键技术做了初步研究,具体内容如下: (1)提出了一种新的满足多核处理器机群计算需求的分层异构并行的通用计算模型,该模型将目标问题的开发划分为程序模型算法设计、并行程序设计、并行程序执行三个阶段。程序模型算法设计阶段,开发人员面对参数化的并行机设计程序模型算法;并行程序设计阶段,开发人员利用并行开发平台开发具体的并行程序,实现并行任务;并行程序执行阶段,并行程序运行在相应的软硬件架构下,通过解释系统优化的计算参数优化指令执行效率。 (2)对模型框架进行细化和实现,针对分层异构模型的分层,在不同的层次设计相应的方法库、参数库、程序复用库等工具对模型功能以匹配,分层实现并行算法设计与实现的过程,最终实现并行计算模型的动态性、自适应性、可重构性与通用性。 (3)在不同的层次之间,提出了语言解释系统和编译系统,,实现层次之间的链接,保证了系统模型的框架的完整性和可实现性。 (4)利用该异构并行通用计算模型,实现了叠前偏移程序并行算法的设计。叠前偏移程序算法是石油探测中经典算法之一,该算法的串行程序已经较为完善,但其并行算法一直是个较为复杂的问题。应用该模型很好地解决了其并行性。 该通用并行计算框架对应用开发人员提供简单易用的设计语言,实现并行计算程序设计的高效性、正确性、普适性。因而具有广泛的应用前景和显著的社会效益。分层异构并行的通用计算模型为应用开发人员提供独立于硬件的可扩展的编程接口,为具有普遍性的问题建立方法库,为程序运行平台建立参数库,综合管理不同的计算资源,合理分配计算任务,减轻程序的开发难度以及应用开发人员的工作量。
[Abstract]:With the development of science and technology, especially the development of computer technology, the amount of data in each industry is beginning to be exponential growth, and the traditional serial computing power is far from meeting the increasing demand of data processing. In this background, the parallel computing technology has come into being, and its main purpose is to quickly solve large and complex computing problems. Parallel computing is not only closely related to the state's science and technology and economic development, but also directly affects national defense capability and national security, such as nuclear explosion simulation, accurate solution of complex system, gene research and encryption and decryption of communication between national machine and so on. The ability of parallel computation is an important symbol to measure the national strength. Although the parallel computing has been developed for many years, a more practical solution has been made in the solution of some specific problems, and a considerable amount of experience has been summarized, but it is far less abundant than the serial algorithm, so the research of this subject is not yet feasible. The maximum difference between the parallel algorithm and the serial algorithm is not only to consider the solution of the problem itself, but also to consider the parallel model applicable to the problem. If the efficiency is maximized, the processor architecture, network connection, etc. need to be taken into account. In addition, this will increase the design and implementation of the parallel algorithm. In this paper, based on the analysis of the various problems in parallel computation and the current research situation at home and abroad, a new, multi-core processor cluster, which can meet the needs of multi-core processor cluster, is put forward in order to solve the problems of parallel computing model. In this paper, the key technology in the model is studied and the key technology in the model is studied in detail. The method is as follows: (1) A new, hierarchical and heterogeneous parallel general-purpose computing model for multi-core processor cluster computing needs is proposed, which divides the development of the target problem into a program model algorithm design, parallel program design and parallel program execution In the design stage of the program model, the developer faces the parameterized parallel machine design program model algorithm, and the developer uses the parallel development platform to develop the specific parallel program in the design phase of the parallel program, so as to realize the parallel task, in that sequence execution stage, the parallel program is run under the corresponding software and hardware structure, and the optimization of the calculation parameter of the system optimization is and (2) refining and implementing the model framework, and aiming at the layering of the hierarchical heterogeneous model, and designing the corresponding method library, the parameter library, the program multiplexing library and the like at different levels to match the model functions so as to realize the parallel algorithm in the layering manner. and finally, the dynamic and self-adaptive of the parallel computing model can be realized, (3) At different levels, the language interpretation system and the compiling system are put forward, the link between the levels is realized, and the framework of the system model is guaranteed. and (4) utilizing the heterogeneous parallel general-purpose computing model to realize the pre-stack The algorithm of pre-stack migration is one of the classical algorithms in petroleum exploration, and the serial program of the algorithm is perfect, but its parallel algorithm It's a more complex problem. Application of the model The general parallel computing framework provides a simple and easy-to-use design language for the application developer to realize the parallel computing program. The invention has the advantages of high efficiency, correctness and universality. The application developer is provided with an extensible programming interface which is independent of the hardware, a method library is established for the problem of universality, a parameter library is established for the program running platform, and the comprehensive management is different computing resources, reasonably allocating computing tasks, and reducing the development of programs
【学位授予单位】:中国海洋大学
【学位级别】:博士
【学位授予年份】:2013
【分类号】:TP338.6
本文编号:2370249
[Abstract]:With the development of science and technology, especially the development of computer technology, the amount of data in each industry is beginning to be exponential growth, and the traditional serial computing power is far from meeting the increasing demand of data processing. In this background, the parallel computing technology has come into being, and its main purpose is to quickly solve large and complex computing problems. Parallel computing is not only closely related to the state's science and technology and economic development, but also directly affects national defense capability and national security, such as nuclear explosion simulation, accurate solution of complex system, gene research and encryption and decryption of communication between national machine and so on. The ability of parallel computation is an important symbol to measure the national strength. Although the parallel computing has been developed for many years, a more practical solution has been made in the solution of some specific problems, and a considerable amount of experience has been summarized, but it is far less abundant than the serial algorithm, so the research of this subject is not yet feasible. The maximum difference between the parallel algorithm and the serial algorithm is not only to consider the solution of the problem itself, but also to consider the parallel model applicable to the problem. If the efficiency is maximized, the processor architecture, network connection, etc. need to be taken into account. In addition, this will increase the design and implementation of the parallel algorithm. In this paper, based on the analysis of the various problems in parallel computation and the current research situation at home and abroad, a new, multi-core processor cluster, which can meet the needs of multi-core processor cluster, is put forward in order to solve the problems of parallel computing model. In this paper, the key technology in the model is studied and the key technology in the model is studied in detail. The method is as follows: (1) A new, hierarchical and heterogeneous parallel general-purpose computing model for multi-core processor cluster computing needs is proposed, which divides the development of the target problem into a program model algorithm design, parallel program design and parallel program execution In the design stage of the program model, the developer faces the parameterized parallel machine design program model algorithm, and the developer uses the parallel development platform to develop the specific parallel program in the design phase of the parallel program, so as to realize the parallel task, in that sequence execution stage, the parallel program is run under the corresponding software and hardware structure, and the optimization of the calculation parameter of the system optimization is and (2) refining and implementing the model framework, and aiming at the layering of the hierarchical heterogeneous model, and designing the corresponding method library, the parameter library, the program multiplexing library and the like at different levels to match the model functions so as to realize the parallel algorithm in the layering manner. and finally, the dynamic and self-adaptive of the parallel computing model can be realized, (3) At different levels, the language interpretation system and the compiling system are put forward, the link between the levels is realized, and the framework of the system model is guaranteed. and (4) utilizing the heterogeneous parallel general-purpose computing model to realize the pre-stack The algorithm of pre-stack migration is one of the classical algorithms in petroleum exploration, and the serial program of the algorithm is perfect, but its parallel algorithm It's a more complex problem. Application of the model The general parallel computing framework provides a simple and easy-to-use design language for the application developer to realize the parallel computing program. The invention has the advantages of high efficiency, correctness and universality. The application developer is provided with an extensible programming interface which is independent of the hardware, a method library is established for the problem of universality, a parameter library is established for the program running platform, and the comprehensive management is different computing resources, reasonably allocating computing tasks, and reducing the development of programs
【学位授予单位】:中国海洋大学
【学位级别】:博士
【学位授予年份】:2013
【分类号】:TP338.6
【参考文献】
相关期刊论文 前10条
1 陈鹏,张立昂;PRAM模型模拟RMESH模型的2种方案[J];北京大学学报(自然科学版);2005年03期
2 周浩;钟波;罗志才;张坤;;OpenMP并行算法在卫星重力场模型反演中的应用[J];大地测量与地球动力学;2011年05期
3 刘劲松;刘福田;刘俊;郝天珧;;地震层析成像LSQR算法的并行化[J];地球物理学报;2006年02期
4 于秀敏,李建中,郭风;高性能并行计算的曙光:机群系统[J];哈尔滨学院学报;2004年02期
5 陶应龙;王建国;牛胜利;范如玉;;MCATNP蒙特卡罗粒子输运程序的MPI并行化[J];核电子学与探测技术;2011年05期
6 沈绪榜;;MPP系统芯片体系结构技术的发展[J];中国科学(E辑:信息科学);2008年06期
7 张理论;叶红;吴建平;宋君强;;基于最大负载偏移率的并行负载平衡性能分析[J];计算机研究与发展;2010年06期
8 乔香珍,杨晔;基于LogP模型的并行计算模拟器[J];计算机研究与发展;1997年09期
9 申俊,郑纬民;异构并行工作站机群系统的性能评价指标[J];计算机研究与发展;1998年03期
10 林智华;对阵列处理机和多处理机的深入比较[J];福建金融管理干部学院学报;2004年01期
相关博士学位论文 前1条
1 胡悦;并行计算时间与存储空间关系研究[D];上海大学;2011年
本文编号:2370249
本文链接:https://www.wllwen.com/kejilunwen/jisuanjikexuelunwen/2370249.html