基于网络评论挖掘的商品综合评分模型研究
[Abstract]:As the Internet has become an important channel for Chinese Internet users to shop, online reviews are also flooded with every shopping platform, and the information provided by online product reviews also affects consumers' shopping behavior. Because there is a huge amount of data in web product reviews, and there are a lot of meaningless, even malicious spam comments, it is a waste of time for users to browse such a large number of comments. And the information obtained is not necessarily true and reliable. By analyzing the current situation of major shopping websites, we find that at present websites generally use a five-point rating system to visually show consumers' scoring of products. This independent rating and comment content leads users to see not only the rating but also the content of the comment to determine the exact information expressed by the comment. In view of the complex status of the above series, this paper based on the network comment mining, and combined with the garbage comment recognition research how to build a product comprehensive scoring model. The final result of this study is to construct a comprehensive commodity scoring model, in which the main process is the emotional analysis of the content of comment. In the emotional analysis of comment content, the word segmentation system is used to preprocess the comment, and the Apriori algorithm and pruning method are used to extract the feature words. Then the number of polar words is expanded with HowNet and "synonym forest", and the polar words are annotated with reference to "Chinese emotional Vocabulary Noumenon" to improve the content of polarity word dictionary. Finally, we use the method of membership degree to extract the Feature-Viewpoint word pair, and analyze the influence of degree adverb and negative word on the opinion word, and calculate the emotional value of the comment content to reflect the reviewer's emotion effectively. And on the basis of emotional analysis of comment content, this paper proposes a method of garbage comment recognition, which combines the behavior of commenters and the content of comments, and analyzes the characteristics of commenters' behaviors and comments' content. KNN classifier is used to classify comments effectively. The final model consists of four factors: the rating, the professional ability of the reviewer, the emotional value of the content of the comment, and whether the comment belongs to the spam. The model consists of two parts: the single comment scoring model and the commodity rating model. In the end, the experimental data are obtained from two mobile phones and one notebook provided by Datacom. In this paper, product feature word extraction, feature-viewpoint word pair extraction, comment content emotion analysis, garbage comment recognition and comprehensive scoring model are tested, and the results are analyzed. The experimental results show that, The method proposed in this paper is reasonable and effective.
【学位授予单位】:杭州电子科技大学
【学位级别】:硕士
【学位授予年份】:2016
【分类号】:F713.36;F713.55
【相似文献】
相关期刊论文 前10条
1 尹群;;信用评分模型中的拒绝推断[J];商场现代化;2010年27期
2 李雪含;杨吉峰;;信用风险评分模型初探[J];现代经济信息;2012年11期
3 张景肖;魏秋萍;姜玉霞;张波;;基于两阶段思想处理拒绝推断的信用评分模型[J];数理统计与管理;2012年06期
4 陈强;陈文彬;;关于零售评分模型部署方式的理论及实证研究[J];金融监管研究;2013年04期
5 刘树红;黄擎明;;可加性评分模型指标间相关性处理和权重确定的研究[J];数量经济技术经济研究;1990年11期
6 陈文华;;信用评分模型与方法[J];中国信用卡;2007年06期
7 黎玉华;;信用卡行为评分模型的开发[J];征信;2013年09期
8 李瑞丽,沈薇;利用统计学方法建立消费信贷评分模型[J];经济与管理;2005年04期
9 邓超;胡威;唐莹;;基于拒绝推论的小企业信用评分模型研究[J];国际金融研究;2011年04期
10 魏秋萍;张景肖;张波;;基于核函数法进行拒绝推断的信用评分模型[J];统计与决策;2012年12期
相关会议论文 前4条
1 毛建军;蔡卫民;;个人信用评分模型比较研究[A];科学发展观与系统工程——中国系统工程学会第十四届学术年会论文集[C];2006年
2 杨文斌;陈恩强;白浪;陈学兵;冯萍;唐红;;三种终末期肝病评分模型对早中期慢性重型乙型肝炎患者短期预后的评估价值[A];第6届全国疑难及重症肝病大会论文集[C];2011年
3 秦宛顺;石庆焱;;一个基于Logistic回归的个人信用评分模型[A];21世纪数量经济学(第4卷)[C];2003年
4 陈希镇;;关于评分模型的研究[A];全国教育与心理统计测量学术年会论文摘要集[C];2006年
相关重要报纸文章 前2条
1 记者王宙;我国个人通用信用评分模型研究取得阶段性成果[N];中国社会科学报;2010年
2 记者 周萃;信用评分模型有助于缓解小企业融资瓶颈[N];金融时报;2006年
相关博士学位论文 前1条
1 朱艳敏;基于信用评分模型的小微企业贷款的可获得性研究[D];苏州大学;2014年
相关硕士学位论文 前10条
1 谢荣华;多维项目反应理论等级评分模型的参数估计[D];西南大学;2015年
2 刘炯;科研论文影响力评估方法研究[D];华南理工大学;2015年
3 肖洋洋;汉语L_2口语的文本特征及评分模型初探[D];南京大学;2014年
4 张红艳;基于河南省某农村人群的2型糖尿病风险评分模型研究[D];郑州大学;2016年
5 陈薇;乙型肝炎病毒相关性肝细胞癌预测模型的验证及优化[D];福建医科大学;2015年
6 史玲玲;基于网络评论挖掘的商品综合评分模型研究[D];杭州电子科技大学;2016年
7 徐伟;基于信用评分模型的民营企业信用评级研究[D];广东商学院;2010年
8 肖涵敏;基于项目节点的项目评分模型及其应用研究[D];西南大学;2012年
9 傅强;基于集成化的个人信用评分模型研究[D];华中科技大学;2009年
10 曾辉;基于数据挖掘的银行个人客户信用评分模型的研究[D];对外经济贸易大学;2007年
,本文编号:2188213
本文链接:https://www.wllwen.com/jingjilunwen/guojimaoyilunwen/2188213.html