当前位置:主页 > 管理论文 > 移动网络论文 >

基于融合特征的微博主客观分类方法研究

发布时间:2019-01-23 18:23
【摘要】:越来越多的用户喜欢通过微博来实时分享自己的观点或者表达自己的情感,因此,面向微博的意见挖掘或情感分析成为了研究热点。其中,微博主客观分类研究是意见挖掘和情感分析研究的基础,其主要任务是区分微博中表达主观观点的文本和表达客观事实的文本,并进一步从主观性文本中挖掘潜在有价值的信息。此外,面向微博的主客观分类研究对意见问答系统、观点摘要等工作也具有重要意义。本文针对中文微博的主客观分类问题,分别研究了语法和语义特征结合不同特征选择方法对微博主客观分类的影响,同时对基于融合特征的微博主客观分类方法进行了探索性研究。本文主要研究成果如下:(1)针对语法特征,提出了基于2-gram的词、词性特征的提取算法。本文借鉴2-gram模型分别提取了微博文本的连续双词(2-word)、连续双词词性(2-pos)组合模式特征作为语法特征来进行微博主客观分类研究。(2)针对语义特征,充分考虑了情感分析经验以及微博文本特点,提出了微博内容特征、比重特征等丰富的语义特征,并引入了微博文本情感词库来进行微博主客观分类研究。(3)针对微博文本特征选择问题,分别对两类特征选择方法进行了分类性能比较。本文分别利用不同特征选择方法对语法和语义特征进行评估来获取最优特征集,并结合分类模型对分类效果进行了比较。(4)针对微博主客观分类问题,提出了一种基于融合特征的微博主客观分类方法。该方法通过设计特征融合算法对不同特征选择方法进行有效组合来获取融合特征,并结合机器学习方法来进行微博主客观分类研究。本文研究构建了更丰富的主客观分类特征,并设计了一种特征融合算法来探索特征选择方法组合后对主客观分类效果的影响。实验证明,本文提出的特征融合算法可以有效提高主客观分类效果,同时构建了相对通用的主客观分类模型。
[Abstract]:More and more users like to share their views or express their emotions through Weibo in real time. Among them, Weibo's subjective and objective classification is the basis of opinion mining and emotional analysis, and its main task is to distinguish the text expressing subjective views from the text expressing objective facts in Weibo. And further mining the potentially valuable information from the subjective text. In addition, the subjective and objective classification research for Weibo is also of great significance to the quizzes and abstracts of opinions. Aiming at the subjective and objective classification of Chinese Weibo, this paper studies the influence of grammar and semantic features combined with different feature selection methods on the subjective and objective classification of Weibo. At the same time, the subjective and objective classification method of Weibo based on fusion features is studied. The main results of this paper are as follows: (1) an algorithm of extracting word and part of speech features based on 2-gram is proposed for grammatical features. This paper draws on the 2-gram model to extract the 2-word of Weibo's text and the combination pattern feature of continuous two-word (2-pos) as grammatical features. (2) aiming at the semantic features, we study the subjective and objective categorization of Weibo. Having fully considered the experience of emotional analysis and the characteristics of Weibo's text, we put forward the abundant semantic features such as the content features and specific weight features of Weibo. Weibo text emotion thesaurus is introduced to study the subjective and objective classification of Weibo. (3) two feature selection methods are compared in order to solve the feature selection problem. In this paper, different feature selection methods are used to evaluate the grammar and semantic features to obtain the optimal feature collection, and the classification effect is compared with the classification model. (4) aiming at the objective and subjective classification problem of Weibo, This paper presents a subjective and objective classification method for Weibo based on fusion features. In this method, the feature fusion algorithm is designed to effectively combine different feature selection methods to obtain the fusion features, and the machine learning method is used to study Weibo's subjective and objective classification. In this paper, a more abundant subjective and objective classification features are constructed, and a feature fusion algorithm is designed to explore the effect of the combination of feature selection methods on the subjective and objective classification results. Experimental results show that the proposed feature fusion algorithm can effectively improve the subjective and objective classification effect, and build a relatively common subjective and objective classification model.
【学位授予单位】:山西大学
【学位级别】:硕士
【学位授予年份】:2014
【分类号】:TP391.1;TP393.092

【参考文献】

相关硕士学位论文 前1条

1 张博;基于SVM的中文观点句抽取[D];北京邮电大学;2011年



本文编号:2414082

资料下载
论文发表

本文链接:https://www.wllwen.com/guanlilunwen/ydhl/2414082.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户d98ea***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com