大型社交网络中社团挖掘算法的研究
[Abstract]:It is of great significance to excavate the community structure in social networks for revealing the potential laws of the network and grasping the macroscopic characteristics of the network. At present, many community mining algorithms have emerged, some of which have the advantage of near linear complexity, but there are still two limitations in the application of large networks. One is the need to predict the number of network communities. Second, the low complexity of the algorithm is achieved at the expense of accuracy. Therefore, in order to make the existing community mining algorithms applicable to large networks, the two limitations are studied and improved accordingly in this paper. The main work is as follows: (1) the problems of common community mining algorithms in large networks are discussed. It is concluded that low time complexity algorithms have advantages but still have two limitations. Therefore, the current situation of community number estimation methods and labeling propagation community mining algorithms at home and abroad are investigated, and the advantages and disadvantages of these two existing methods are analyzed. (2) the accuracy of existing community number estimation methods is low. In this paper a method of estimating the number of communities using regular loop-free matrix is presented in which the computation is not efficient and the scope of application is limited. In this method, a regular loop free matrix is defined to describe the network, and the distribution of eigenvalues is observed and analyzed. Finally, the number of network communities is estimated by using the maximum position of the eigenspace. The proposed method is verified on artificial networks composed of two classical network generation models. (3) an improved labeling propagation algorithm is presented to deal with the problem of low complexity at the expense of the accuracy and stability of the label propagation algorithm. In this algorithm, all nodes are prioritized by the newly defined composite weights in the label propagation sequence, and the candidate labels are filtered by the contribution degree of the nodes in the process of label propagation. Finally, the new balanced node filtering mechanism is used to optimize the convergence conditions of the algorithm. The algorithm is validated on two standard large social network datasets. The experimental results show that the influence of degree heterogeneity distribution on the estimation of community number is mainly eliminated by using the regular loop free matrix estimation method, thus the accuracy of the estimation results is improved, and the range of application is not limited. Compared with label propagation algorithm and other two large network community mining algorithms, the improved labeling propagation algorithm not only has significant advantages in performance and quality, but also improves the efficiency of community mining.
【学位授予单位】:西安理工大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP311.13;TP393.09
【参考文献】
相关期刊论文 前10条
1 刘功申;张浩霖;孟魁;苏波;;非随机的标签传播社区划分算法[J];上海交通大学学报;2015年08期
2 赵宝峰;赵菊敏;李灯熬;;一种稳定的标签传播社区发现算法[J];太原理工大学学报;2013年04期
3 张聪;沈惠璋;;基于谱方法的复杂网络中社团结构的模块度[J];系统工程理论与实践;2013年05期
4 康旭彬;贾彩燕;;一种改进的标签传播快速社区发现方法[J];合肥工业大学学报(自然科学版);2013年01期
5 黄健斌;钟翔;孙鹤立;茆婉婷;;基于相似性模块度最大约束标记传播的网络社团发现算法[J];北京大学学报(自然科学版);2013年03期
6 季青松;赵郁忻;陈乐生;陈秀真;李生红;;有效改善标签传播算法鲁棒性的途径[J];信息安全与通信保密;2012年09期
7 赵卓翔;王轶彤;田家堂;周泽学;;社会网络中基于标签传播的社区发现新算法[J];计算机研究与发展;2011年S3期
8 金弟;刘杰;杨博;何东晓;刘大有;;局部搜索与遗传算法结合的大规模复杂网络社区探测[J];自动化学报;2011年07期
9 汪小帆;刘亚冰;;复杂网络中的社团结构算法综述[J];电子科技大学学报;2009年05期
10 李晓佳;张鹏;狄增如;樊瑛;;复杂网络中的社团结构[J];复杂系统与复杂性科学;2008年03期
相关硕士学位论文 前3条
1 王锦锦;层次凝聚和主动学习半监督社团检测算法研究[D];兰州大学;2014年
2 黄中杰;社交网络中的视频观看质量优化[D];复旦大学;2012年
3 李亚飞;复杂网络中的社团结构检测算法研究[D];北京交通大学;2011年
,本文编号:2159165
本文链接:https://www.wllwen.com/guanlilunwen/ydhl/2159165.html