CUDA图形处理器对聚类算法的加速实现
[Abstract]:In recent years, with the rapid development of information processing and communication technology, it has promoted the further improvement of output and data acquisition in many fields. This has brought us a large amount of data to be processed. Nowadays, GB or TB orders of magnitude of data are very common. Therefore, the traditional serial data mining technology has been unable to be used. It is effective to deal with these data, which is replaced by parallel data mining technology. Nowadays, multi-core CPU (Central Processing Unit) is very common as a computing platform for parallel data mining. However, as the scale of data is increasing, the complexity of the algorithms involved in data mining is getting higher and higher, which brings high level. Density data operation will consume a large number of computing time of the processor, which is very bad for the performance and power of the whole system. In order to reduce the computation of this part of large-scale data, we need to choose a computing unit that is more suitable for this kind of operation to reduce the load of the system. And the graphic processor - GPU (Graphics Processing Unit), with its special architecture, is very suitable for parallel computing of large scale and high density data. At the beginning of the design, GPU designers have configured a large number of data computing units and high memory access bandwidth to meet the demands of increasingly harsh graphics processing and electronic games, especially since NV When IDIA launched the GPU products of the CUDA (Compute Unified Device Architecture) architecture in 2007 and its supporting environment, the development of GPU based platform is very similar to the development of the CPU platform, allowing developers to quickly start developing GPU based on CUDA architecture. In many fields, such as scientific computing, financial engineering, Data mining, and so on, developers are trying to use the CUDA architecture GPU to improve the computing power of the system. In addition to the implementation of GPU operation in the traditional single machine environment, and more and more research on the application of GPU in the distributed environment. This paper will select two kinds of clustering algorithms commonly used in the data mining domain, K mean clustering And single chain combined hierarchical clustering, and based on a NVIDIA GTX260 series CUDA architecture GPU respectively to the two clustering algorithms to carry out parallel implementation, thus demonstrating the effectiveness and feasibility of GPU clustering algorithm performance improvement. Finally, this paper will combine the enterprise customer relationship management system (Customer Relationship Management, CRM). The actual requirement is to realize the GPU clustering operation under the Hadoop framework. Clustering is a very common operation in the process of data mining. The main purpose of this paper is to assemble the randomly scattered objects together to form one or more clusters according to some agreed similarity or correlation. The K mean clustering algorithm implemented in this paper mainly refers to N is divided into the nearest cluster according to the distance of its spatial Euclidean distance. After several iterations, K clusters are formed after several iterations. As a classic clustering algorithm, the algorithm has been widely used in data mining, biological information, image recognition, artificial intelligence and so on. The famous Apache Mahout The K means clustering algorithm is used in its clustering operation. For another clustering algorithm implemented in the experiment, combined hierarchical clustering, first N independent data objects are initialized to N independent subclusters, and the Euclidean distance is used as the measure of similarity between subclusters, and the nearest sub cluster is merged and repeated multiple times. After generation, all data objects in the whole data set are included in a cluster as the end of the algorithm. Although the operation and final results are different from the K mean, all the algorithms contain large data operations and there is a high independence between the computing. It is precisely because of this commonality that we can use it. Multi core CPU and GPU parallel clustering algorithm to reduce the running time of the program and improve the throughput of the data. Because enterprise CRM needs to run in the distributed computing environment, the choice of the open source software Hadoop.Haoop framework allows the users to cluster the common computer cluster for large-scale data. Distributed computing, at the same time, provides strong fault tolerance and reliability. At present, such as Google, Facebook and other companies are using the distributed computing based on the Hadoop framework. In this paper, three groups of different sizes of input data and target clustering number are designed to perform K mean operation. The layer clustering algorithm has designed a smaller size of data than the K mean clustering. According to the programming model of CUDA, the program is divided into two parts, in which the program of non high density calculation runs in the CPU end, and the mathematical operation of the large floating point number involved in the clustering operation is run at the GPU end, and this model is also called the heterogeneous calculation of CPU+GPU. At the same time, I We also calculate the same object based on the multi core CPU. Finally, by comparing the time consuming of the two implementations, the acceleration ratio based on the GPU implementation is obtained. Finally, this paper will implement the GPU clustering operation based on the Hadoop framework, and the realization method and the result can be used as the prototype and reference of the enterprise CRM design.
【学位授予单位】:上海交通大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:TP311.13
【相似文献】
相关期刊论文 前10条
1 ;NVIDIA GeForce FX被评为2002年最佳图形处理器[J];CAD/CAM与制造业信息化;2003年Z1期
2 李海燕;张春元;李礼;任巨;;图形处理器的流执行模型[J];计算机工程;2008年22期
3 ;MathWorks为MATLAB提供GPU支持[J];电子与电脑;2010年10期
4 杨毅;郭立;史鸿声;郭安泰;;面向移动设备的3D图形处理器设计[J];小型微型计算机系统;2009年08期
5 ;MathWorks为MATLAB提供GPU支持[J];电信科学;2010年10期
6 ;MathWorks为MATLAB提供GPU支持[J];中国电子商情(基础电子);2010年10期
7 ;MathWorks为MATLAB提供GPU支持[J];电信科学;2010年S2期
8 韩俊刚;刘有耀;张晓;;图形处理器的历史现状和发展趋势[J];西安邮电学院学报;2011年03期
9 ;产品推介[J];电子产品世界;2012年09期
10 ;产业信息[J];单片机与嵌入式系统应用;2013年12期
相关会议论文 前7条
1 张春燕;;一种基于图形处理器的数据流计算模式[A];全国第19届计算机技术与应用(CACIS)学术会议论文集(下册)[C];2008年
2 徐侃;陈如山;杜磊;朱剑;杨阳;;可编程图形处理器加速无条件稳定的Crank-Nicolson FDTD分析三维微波电路[A];2009年全国微波毫米波会议论文集(下册)[C];2009年
3 周国亮;冯海军;何国明;陈红;李翠平;王珊;;基于图形处理器的Cuboid算法[A];第26届中国数据库学术会议论文集(B辑)[C];2009年
4 毕文元;陈志强;;利用可编程图形处理器加速CT重建与体数据的绘制[A];第十一届中国体视学与图像分析学术会议论文集[C];2006年
5 刘伟峰;杨权一;曹邦功;孟凡密;周洁;;基于GPU的高度并行Marching Cubes改进算法[A];2008年全国开放式分布与并行计算机学术会议论文集(上册)[C];2008年
6 林旭生;田绪红;冯志炜;陈茂资;;GPU加速的蚁群算法在HP模型中的应用[A];第十四届全国图象图形学学术会议论文集[C];2008年
7 方建文;于金辉;陈海英;;三维卡通水与物体交互作用的动画建模[A];中国计算机图形学进展2008--第七届中国计算机图形学大会论文集[C];2008年
相关重要报纸文章 前10条
1 乐山 乐水;图形处理技术的全球专利布局形势[N];中国知识产权报;2010年
2 严威川;明明白白显卡“芯”[N];中国电脑教育报;2007年
3 ;NEC图形处理器每秒运行50.2G条指令[N];计算机世界;2003年
4 游讯;图形处理器GPU[N];人民邮电;2011年
5 本报记者 姜姝;AMD嵌入式技术为波音飞机保驾护航[N];中国信息化周报;2014年
6 均儿;人人都有台超级计算机[N];电脑报;2008年
7 ;AMD启动“Fusion”企业品牌推广计划[N];人民邮电;2008年
8 本报记者 田梦;Adobe CS4全面支持GPU加速[N];计算机世界;2009年
9 赵欣;“玩”3D,笔记本也行![N];中国计算机报;2003年
10 ;HP Compaq Evo D210教育信息化的好帮手[N];中国计算机报;2003年
相关博士学位论文 前7条
1 祖渊;基于图形处理器的高速并行算法研究[D];中国科学技术大学;2014年
2 李雪;基于图形处理的多点地质统计算法及模型评价[D];中国科学技术大学;2016年
3 曹小鹏;图形处理器关键技术和光线追踪并行结构研究[D];西安电子科技大学;2015年
4 杨珂;基于图形处理器的数据管理技术研究[D];浙江大学;2008年
5 穆帅;针对不规则应用的图形处理器资源调度关键技术研究[D];清华大学;2013年
6 夏健明;基于图形处理器的大规模结构计算研究[D];华南理工大学;2009年
7 黄涛;基于GPU的多点地质统计逐点模拟并行算法的研究[D];中国科学技术大学;2013年
相关硕士学位论文 前10条
1 刘锐;GPU在FD-OCT系统数据处理及实时图像显示中的应用[D];北京理工大学;2015年
2 彭欢;基于GPU的二维FDTD加速算法研究[D];西安电子科技大学;2013年
3 徐蔚;基于图形处理器的窗口系统的研究[D];西安工程大学;2015年
4 孙修宇;基于GPU的三维图像重建方法研究[D];中国民航大学;2011年
5 刘伍锋;基于PCI总线的主设备功能仿真与验证[D];西安电子科技大学;2016年
6 李天骥;图形处理器存储系统的高精度System Verilog模型与自动化仿真验证[D];西安电子科技大学;2016年
7 陈贵华;基于RDMA高性能通信库的设计与实现[D];华中科技大学;2015年
8 马仁骏;CUDA图形处理器对聚类算法的加速实现[D];上海交通大学;2013年
9 黄伟钿;面向移动平台的3D图形处理器的设计[D];华南理工大学;2011年
10 王旭;图形处理器的仿真验证[D];哈尔滨工业大学;2007年
,本文编号:2148850
本文链接:https://www.wllwen.com/guanlilunwen/kehuguanxiguanli/2148850.html