融合多维签到信息的LBSN链接预测研究
[Abstract]:With the rapid development of mobile Internet technology and the increasing number of location-based services, more and more people share geographically marked pictures, videos and text through online social networks. A location-based social network called (Location Based social Network,LBSN. Social network data mining, also known as link mining. In this paper, LBSN friend link prediction is a branch of link mining, which is a hot research topic. Mining a lot of sign-in information based on time and space dimension provided by LBSN provides a new direction for link prediction. However, the sparse check-in distribution of LBSN users and the single dimension of analysis make it difficult to improve the prediction performance. In order to solve the above problems, the user similarity features contained in the sign-in information are mined from four dimensions: user, time, location and location semantics, and these features are synthesized by supervised learning strategies for link prediction. Simulation results in real network data sets show that the proposed method improves the performance of link prediction significantly. The research work is supported by the National Natural Science Foundation (No.61172072,61271308), the Natural Science Foundation of Beijing (No.4112045) and the Special Research Foundation for doctorate points of higher Education (No.20100009110002). The main work and contributions of this paper are as follows: (1) the distribution characteristics of LBSN data sets based on check-in behavior are analyzed from three dimensions: user, location and time. The analysis shows that the LBSN user's check-in distribution is sparse, which makes it difficult to make full use of the check-in information. (2) aiming at the problem of sparse check-in location distribution, the hierarchical clustering algorithm is used to cluster the check-in location, and the concept of generalized location is introduced. Then the generalized location relationship network is constructed, which greatly reduces the number of outliers in the network and preserves the users in the network as much as possible. Aiming at the sparse distribution of user check-in time dimension, the similarity of check-in behavior of single user at different times is used to correct the similarity of check-in behavior between two users at different times. (3) UTP model is proposed to mine user similarity features based on spatio-temporal dimension, and the similarity features of integrated user and location and check-in time are proposed. Verification in real network data sets shows that the two features can effectively distinguish between friends and non-friends. (4) the location semantic dimension is used to mine the user similarity features based on location semantics. Based on the idea of LDA document topic modeling, the location topic of all users' check-in semantic POI information is modeled, and a user similarity feature based on check-in location semantics is proposed. Verification in real network data sets shows that the feature can effectively distinguish between friends and non-friends. (5) combining network structure information based on LBSN, check-in location information and location semantic information, multi-dimensional similarity feature vector is obtained. A supervised strategy is used for link prediction. Experiments in real network data sets show that the proposed link prediction algorithm based on multidimensional information improves the performance of LBSN link prediction significantly compared with the traditional link prediction algorithm.
【学位授予单位】:北京交通大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP393.09;TP311.13
【参考文献】
相关期刊论文 前8条
1 李宏涛;何克清;王健;彭珍连;田刚;;基于概念格和随机游走的社交网朋友推荐算法[J];四川大学学报(工程科学版);2015年06期
2 王莹;郭宇春;;基于位置的社交网络链接预测特征研究[J];计算机与现代化;2015年04期
3 WANG Peng;XU BaoWen;WU YuRong;ZHOU XiaoYu;;Link prediction in social networks: the state-of-the-art[J];Science China(Information Sciences);2015年01期
4 卢文羊;徐佳一;杨育彬;;基于LDA主题模型的社会网络链接预测[J];山东大学学报(工学版);2014年06期
5 张健沛;姜延良;;一种基于节点相似性的链接预测算法[J];中国科技论文;2013年07期
6 吕琳媛;;复杂网络链路预测[J];电子科技大学学报;2010年05期
7 赵慧;刘希玉;崔海青;;网格聚类算法[J];计算机技术与发展;2010年09期
8 唐华松,姚耀文;数据挖掘中决策树算法的探讨[J];计算机应用研究;2001年08期
相关博士学位论文 前1条
1 蒋良孝;朴素贝叶斯分类器及其改进算法研究[D];中国地质大学;2009年
相关硕士学位论文 前5条
1 吴晓阳;微博用户社会关系离线挖掘算法的研究[D];北京交通大学;2016年
2 王莹;基于位置的社交网络链接预测系统研究[D];北京交通大学;2015年
3 朱荣鑫;基于地理位置的社交网络潜在用户和位置推荐模型研究[D];南京邮电大学;2013年
4 补嘉;基于LDA的社交网络链接预测模型研究[D];西南大学;2012年
5 郭宏伟;基于矩阵的多特征链接预测方法研究[D];燕山大学;2010年
,本文编号:2281341
本文链接:https://www.wllwen.com/guanlilunwen/ydhl/2281341.html