基于情感字集的中文情感倾向性分类研究
[Abstract]:Emotional preference classification generally refers to the emotional polarity of the text, such as positive, negative, neutral, etc. In big data's time, it was mainly used to investigate the attitude of the public towards a certain event, person or group. Traditional methods are especially time-consuming and have great limitations. Nowadays, it is more rapid and convenient to get the opinions of others by searching the vast amount of information on the Internet, and the reliability of the opinions obtained from these information is often higher. This paper first analyzes the situation of Chinese affective preference classification based on affective dictionary and carries on the traditional Chinese affective tendency classification experiment by using ICTCLAS participle and Know-net emotion dictionary. After analyzing and summing up the experimental results, It is found that no matter which kind of word segmentation tool or emotion dictionary is used, it will bring some uncertain interference to the classification results of affective tendency, especially different emotion dictionaries have great differences in reliability and category of analysis. In view of the above, this paper proposes the concept of "affective word set", which is not only independent of the usage category but also does not need Chinese word segmentation. So the first thing here is to find out such a set of emotional words: the words themselves can affect the emotional tendency of the words after the words, or the word itself has a strong emotional tendency. In this paper, two different versions of "affective word sets" are mined from two different sources, and the two versions are experimented with to obtain different experimental results. Finally, the better version of the experiment is chosen to improve the calculation method of affective tendency. Because there is no participle process, the common negative words and degree words are summed up and arranged separately, and the influence of these negative words and degree words on the affective words is added to the experimental algorithm. The affective preference classification based on the affective word set is calculated according to the emotion value of each word when calculating the affective tendency value of the sentence, and all the words are completely independent. Some special phrases may affect the emotional tendency of sentences after they are split, so we use the maximum forward matching method to identify these words. Finally, by searching the correlation between words, the information entropy of the words of the same continuous type is reduced, and the accuracy of the experiment is further improved. The highest accuracy rate is nearly 20% higher than that of the traditional words.
【学位授予单位】:昆明理工大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP391.1
【参考文献】
相关期刊论文 前10条
1 李婷婷;姬东鸿;;基于SVM和CRF多特征组合的微博情感分析[J];计算机应用研究;2015年04期
2 朱玺;董喜双;关毅;刘志广;;基于半监督学习的微博情感倾向性分析[J];山东大学学报(理学版);2014年11期
3 杨丽tD;;汉字字体设计中造型的情感表现方式[J];设计;2014年09期
4 刘姗;胡勇;;中文网络话题评论文本语义倾向分析[J];信息安全与通信保密;2012年06期
5 谢丽星;周明;孙茂松;;基于层次结构的多策略中文微博情感分析和特征抽取[J];中文信息学报;2012年01期
6 吴亮;;一种改进的最大匹配分词算法研究[J];现代商贸工业;2010年09期
7 杨超;冯时;王大玲;杨楠;于戈;;基于情感词典扩展技术的网络舆情倾向性分析[J];小型微型计算机系统;2010年04期
8 张小艳;宋丽平;;论文本分类中特征选择方法[J];现代情报;2009年03期
9 吴志杰;;机器翻译中汉语词语切分的现状——汉语分词与汉英机器翻译研究系列之一[J];外语研究;2009年01期
10 周立柱;贺宇凯;王建勇;;情感分析研究综述[J];计算机应用;2008年11期
相关博士学位论文 前1条
1 吴苑斌;情感倾向分析中的结构化方法[D];复旦大学;2012年
相关硕士学位论文 前8条
1 李明;面向微博电影评论的情感分类研究[D];云南财经大学;2014年
2 樊小超;基于机器学习的中文文本主题分类及情感分类研究[D];南京理工大学;2014年
3 陈晓东;基于情感词典的中文微博情感倾向分析研究[D];华中科技大学;2012年
4 张立;基于新闻评论数据的K-means聚类算法的研究[D];太原理工大学;2010年
5 杨超;基于情感词典扩展技术的网络舆情倾向性分析[D];东北大学;2009年
6 邸锦;基于支持向量机的文本分类问题的研究[D];北京交通大学;2008年
7 刘伟;基于限定领域的问句相似度[D];天津师范大学;2008年
8 郑伟;基于类别均衡的文本分类算法研究[D];西安电子科技大学;2006年
,本文编号:2391262
本文链接:https://www.wllwen.com/kejilunwen/ruanjiangongchenglunwen/2391262.html