当前位置:主页 > 科技论文 > 计算机论文 >

虚拟计算集群下高性能应用性能分析与优化技术研究

发布时间:2018-01-09 05:15

  本文关键词:虚拟计算集群下高性能应用性能分析与优化技术研究 出处:《国防科学技术大学》2013年硕士论文 论文类型:学位论文


  更多相关文章: 虚拟化技术 虚拟计算集群 资源调度 科学工作流 Bin Packing


【摘要】:虚拟化带来的性能开销阻碍了高性能计算在虚拟集群环境下的大规模应用。首先,虚拟化层次的引入在传统操作系统任务调度中加入了虚拟CPU调度这一额外层次,带来了额外的调度开销。其次,同一台物理主机上所有虚拟机的通信通过虚拟机监视器统一处理,增加了应用在虚拟集群环境下的通信延迟。最后,同一台物理主机上的虚拟机对Cache等共享资源的非亲和性使用,使得高性能应用在虚拟集群环境下的性能显著下降。分析高性能应用在虚拟集群环境下的性能开销并由此提出相应的优化技术是解决上述问题的有效手段。本文利用Xen虚拟化软件、SLURM集群作业管理系统,Eclipse+PTP远程并行开发调试环境等构建了一个虚拟化高性能集群系统,通过在该系统上执行高性能测试集,对高性能应用的虚拟化性能开销进行量化分析,提出了基于VCPU亲和性的虚拟资源优化调度技术。具体研究内容包括以下三个方面:1.课题研究了虚拟化技术与基于虚拟集群的高性能计算国内外现状,为虚拟集群系统软件栈的构建提供了选择依据;对传统高性能环境下的典型应用进行调研,选取了三个具有代表性的科学计算应用作为虚拟化性能开销的测试集;对用户可能关心的性能指标进行了综合,选取合适的性能评价标准衡量虚拟集群环境下高性能应用的性能;最后阐述了虚拟集群系统的构建流程。2.通过在虚拟集群和物理集群上分别执行高性能应用程序对虚拟化性能开销进行了量化分析。特别地,通过执行中长期天气预报模式WRF,对虚拟集群环境下实际科学应用程序的性能开销进行了分析;通过执行NPB标准测试集对虚拟集群下不同类型高性能应用的性能开销进行了分析;通过执行科学工作流数据放置策略测试了虚拟集群下工作流应用不同规模通信量下的性能开销。最后,基于量化分析结果,指出了基于虚拟集群的高性能计算当前存在的问题与改进方向。3.根据虚拟集群下通信复用不明显而导致通信开销较大的特点,提出了基于Bin Packing的虚拟资源优化调度算法。该算法在基于解决传统Bin Packing问题的三种启发式算法的基础上,通过选取剩余时间和内存作为权值估计函数,设计了VM状态转换信息表。与先前算法相比,本文算法考虑了VCPU亲和性对高性能应用的性能影响,同时针对虚拟集群自身特点加入了通信感知与物理节点标记机制,提出了考虑通信量的改进Bin Packing算法。模拟实验结果表明,基于Bin Packing的虚拟资源调度算法相对于比先前的资源调度算法在通信复用上具有较大优势。
[Abstract]:The performance overhead brought by virtualization hinders the large-scale application of high performance computing in virtual cluster environment. The introduction of virtualization layer adds virtual CPU scheduling to the traditional operating system task scheduling which brings additional scheduling overhead. Secondly. The communication of all virtual machines on the same physical host is processed uniformly by the virtual machine monitor, which increases the communication delay applied in the virtual cluster environment. Finally. The non-affinity usage of virtual machines on the same physical host to shared resources such as Cache. The performance of high performance applications in virtual cluster environment is significantly reduced. It is an effective method to solve the above problems by analyzing the performance overhead of high performance applications in virtual cluster environment and putting forward corresponding optimization techniques. Using Xen virtualization software. The SLURM Cluster Job Management system (SLURM), Eclipse PTP remote parallel development and debugging environment, has constructed a virtualized high performance cluster system, which executes the high performance test set on the system. The virtualization performance overhead of high performance applications is analyzed quantitatively. A virtual resource scheduling technology based on VCPU affinity is proposed. The specific research contents include the following three aspects: 1.The subject studies the virtualization technology and high performance computing based on virtual cluster at home and abroad. It provides the basis for choosing the software stack of virtual cluster system. This paper investigates the typical applications in the traditional high performance environment and selects three representative scientific computing applications as the test set of virtualization performance overhead. The performance indexes that the users may care about are synthesized, and the performance of high performance applications in virtual cluster environment is evaluated by selecting appropriate performance evaluation criteria. Finally, the construction process of virtual cluster system. 2. The virtualization performance overhead is analyzed quantitatively by executing high performance application programs on virtual cluster and physical cluster respectively. The performance overhead of actual scientific applications in virtual cluster environment is analyzed by implementing the medium and long term weather forecast model (WRFs). The performance overhead of different types of high performance applications in virtual cluster is analyzed by executing NPB standard test set. The performance overhead of workflow application under virtual cluster is tested by implementing scientific workflow data placement strategy. Finally, based on the results of quantitative analysis. This paper points out the existing problems and improvement direction of high performance computing based on virtual cluster. 3. According to the characteristics that communication reuse is not obvious in virtual cluster, which leads to high communication cost. A virtual resource scheduling algorithm based on Bin Packing is proposed, which is based on three heuristic algorithms to solve the traditional Bin Packing problem. By selecting the residual time and memory as the weight estimation function, the VM state transition information table is designed. Compared with the previous algorithm, this algorithm considers the effect of VCPU affinity on the performance of high performance applications. At the same time, according to the characteristics of virtual cluster, the communication perception and physical node marking mechanism are added, and an improved Bin Packing algorithm considering traffic is proposed. The virtual resource scheduling algorithm based on Bin Packing is superior to the previous resource scheduling algorithm in communication reuse.
【学位授予单位】:国防科学技术大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:TP302

【参考文献】

相关期刊论文 前1条

1 刘杰;张亦添;迟利华;徐涵;蒋杰;胡庆丰;;并行作业容错自动调度环境设计[J];计算机工程与科学;2009年11期



本文编号:1400127

资料下载
论文发表

本文链接:https://www.wllwen.com/kejilunwen/jisuanjikexuelunwen/1400127.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户24a2e***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com