融合用户标签和微博内容的用户兴趣社区发现
发布时间:2019-04-26 19:08
【摘要】:随着社交网络的不断发展,微博已经成为了人们日常生活中不可或缺的一个重要组成部分。微博当中,微博用户的自定义用户标签以及用户发布微博,转发微博等微博行为,反映了用户的兴趣爱好,如何通过这些信息,挖掘用户兴趣,发现用户兴趣社区,具有重要的研究意义和价值。本文针对微博用户兴趣社区发现,主要从以下几个方面展开了深入研究与探讨: (1)提出了一种基于特征映射的用户标签兴趣建模方法。 针对用户标签反映用户兴趣的特点,选择用户标签作为用户兴趣模型的特征,并为了解决标签表述不一和长标签所带来的数据稀疏问题和噪声影响,引入特征映射的思想,将长标签进行分词处理,表示成子标签集,通过计算标签之间的相似度,将用户标签映射到相似度最高的特征维标签,利用标签相似度和标签频率的乘积作为特征维的特征值,构建用户标签兴趣模型,并利用模糊聚类方法,验证用户标签兴趣建模的有效性。 (2)提出了一种基于有指导LDA的用户微博内容兴趣建模方法。 针对微博文本交互对微博主题分布的影响,提出了一种有指导的LDA微博生成模型,综合考虑了转发微博,评论微博,回复微博,他人评论四个因素对用户微博兴趣主题分布的影响,在传统LDA模型的基础上,构建有指导的LDA微博生成模型,得到微博主题分布,并进一步得到用户主题分布,从微博内容角度对用户进行兴趣建模。 (3)提出了一种融合用户标签和微博内容的用户兴趣社区发现方法。 在(1)和(2)的基础上,利用用户模型相似度,构建用户标签兴趣关系网和微博内容兴趣关系网,与微博当中本身存在的用户关注关系网进行融合,并在此基础上,考虑微博用户可能属于多个社区所造成的社区重叠问题,提出了一种基于k-clique的用户兴趣社区发现方法,通过对社区重叠矩阵进行求解,得到社区连接矩阵,最后得到包含多个相连k-clique的用户兴趣社区。 (4)利用上述研究成果,设计实现了微博用户兴趣社区发现原型系统。
[Abstract]:With the development of social network, Weibo has become an indispensable part of people's daily life. In Weibo, the custom user tags of Weibo users and the Weibo behaviors such as publishing Weibo and forwarding Weibo by users reflect the interests of users, how to discover user interests through these information, and how to discover the community of interests of users. It has important research significance and value. In this paper, Weibo user interest community discovery, mainly from the following aspects of in-depth research and discussion: (1) A user tag interest modeling method based on feature mapping is proposed. In view of the characteristics of user tag reflecting user interest, the user tag is selected as the feature of user interest model. In order to solve the problem of data sparsity and noise caused by different label representation and long label, the idea of feature mapping is introduced. The long label is partitioned to represent the sub-tag set. By calculating the similarity between the tags, the user tag is mapped to the feature dimension tag with the highest similarity, and the product of the label similarity and the label frequency is used as the characteristic value of the feature dimension. User tag interest model is constructed, and fuzzy clustering method is used to verify the effectiveness of user tag interest modeling. (2) A user Weibo content interest modeling method based on guided LDA is proposed. Aiming at the influence of Weibo text interaction on the topic distribution of Weibo, this paper presents a guided generation model of Weibo, which considers forwarding Weibo, commenting Weibo and replying to Weibo. Other people comment on the influence of four factors on user Weibo interest theme distribution. Based on the traditional LDA model, a guided LDA Weibo generation model is constructed, and the Weibo theme distribution is obtained, and the user theme distribution is further obtained. Modeling the user's interest from the point of view of Weibo content. (3) A community of interest discovery method combining user tags and Weibo content is proposed. On the basis of (1) and (2), the user tag interest relation network and Weibo content interest relation network are constructed by using the similarity of user model, and are merged with the user concern relation network in Weibo itself, and on this basis, the user tag interest relation network and the content interest relationship network in Weibo are constructed. Considering the problem of community overlap caused by Weibo users belonging to more than one community, this paper proposes a community of interest community discovery method based on k-clique, and obtains the community connection matrix by solving the community overlap matrix. Finally, the user interest community containing multiple connected k-clique is obtained. (4) based on the above research results, a prototype system of Weibo user interest community discovery is designed and implemented.
【学位授予单位】:昆明理工大学
【学位级别】:硕士
【学位授予年份】:2014
【分类号】:TP393.092
本文编号:2466320
[Abstract]:With the development of social network, Weibo has become an indispensable part of people's daily life. In Weibo, the custom user tags of Weibo users and the Weibo behaviors such as publishing Weibo and forwarding Weibo by users reflect the interests of users, how to discover user interests through these information, and how to discover the community of interests of users. It has important research significance and value. In this paper, Weibo user interest community discovery, mainly from the following aspects of in-depth research and discussion: (1) A user tag interest modeling method based on feature mapping is proposed. In view of the characteristics of user tag reflecting user interest, the user tag is selected as the feature of user interest model. In order to solve the problem of data sparsity and noise caused by different label representation and long label, the idea of feature mapping is introduced. The long label is partitioned to represent the sub-tag set. By calculating the similarity between the tags, the user tag is mapped to the feature dimension tag with the highest similarity, and the product of the label similarity and the label frequency is used as the characteristic value of the feature dimension. User tag interest model is constructed, and fuzzy clustering method is used to verify the effectiveness of user tag interest modeling. (2) A user Weibo content interest modeling method based on guided LDA is proposed. Aiming at the influence of Weibo text interaction on the topic distribution of Weibo, this paper presents a guided generation model of Weibo, which considers forwarding Weibo, commenting Weibo and replying to Weibo. Other people comment on the influence of four factors on user Weibo interest theme distribution. Based on the traditional LDA model, a guided LDA Weibo generation model is constructed, and the Weibo theme distribution is obtained, and the user theme distribution is further obtained. Modeling the user's interest from the point of view of Weibo content. (3) A community of interest discovery method combining user tags and Weibo content is proposed. On the basis of (1) and (2), the user tag interest relation network and Weibo content interest relation network are constructed by using the similarity of user model, and are merged with the user concern relation network in Weibo itself, and on this basis, the user tag interest relation network and the content interest relationship network in Weibo are constructed. Considering the problem of community overlap caused by Weibo users belonging to more than one community, this paper proposes a community of interest community discovery method based on k-clique, and obtains the community connection matrix by solving the community overlap matrix. Finally, the user interest community containing multiple connected k-clique is obtained. (4) based on the above research results, a prototype system of Weibo user interest community discovery is designed and implemented.
【学位授予单位】:昆明理工大学
【学位级别】:硕士
【学位授予年份】:2014
【分类号】:TP393.092
【参考文献】
相关期刊论文 前7条
1 林友芳;王天宇;唐锐;周元炜;黄厚宽;;一种有效的社会网络社区发现模型和算法[J];计算机研究与发展;2012年02期
2 赵妍;赵学民;;基于CURE的用户聚类算法研究[J];计算机工程与应用;2012年11期
3 毛晓星;薛安荣;鞠时光;;基于加权语义网和有效信息的个性化用户兴趣建模[J];计算机应用研究;2010年09期
4 黄发良;肖南峰;;基于线图与PSO的网络重叠社区发现[J];自动化学报;2011年09期
5 蔡国永;林航;文益民;;社会语义网社区发现标签传递算法研究[J];计算机科学;2013年02期
6 闫光辉;舒昕;马志程;李祥;;基于主题和链接分析的微博社区发现算法[J];计算机应用研究;2013年07期
7 王志超;于剑;柴变芳;;基于集聚系数的链接社区发现方法[J];南京大学学报(自然科学版);2013年04期
,本文编号:2466320
本文链接:https://www.wllwen.com/guanlilunwen/ydhl/2466320.html