Phrase-based hashtag recommendation for microblog posts
本文选题:recommendation + topic ; 参考:《Science China(Information Sciences)》2017年01期
【摘要】:In microblogs, authors use hashtags to mark keywords or topics. These manually labeled tags can be used to benefit various live social media applications(e.g., microblog retrieval, classification). However,because only a small portion of microblogs contain hashtags, recommending hashtags for use in microblogs are a worthwhile exercise. In addition, human inference often relies on the intrinsic grouping of words into phrases.However, existing work uses only unigrams to model corpora. In this work, we propose a novel phrase-based topical translation model to address this problem. We use the bag-of-phrases model to better capture the underlying topics of posted microblogs. We regard the phrases and hashtags in a microblog as two different languages that are talking about the same thing. Thus, the hashtag recommendation task can be viewed as a translation process from phrases to hashtags. To handle the topical information of microblogs, the proposed model regards translation probability as being topic specific. We test the methods on data collected from realworld microblogging services. The results demonstrate that the proposed method outperforms state-of-the-art methods that use the unigram model.
[Abstract]:In microblogs, authors use hashtags to mark keywords or topics. These manually labeled tags can be used to benefit various live social media applications(e.g., microblog retrieval, classification). However,because only a small portion of microblogs contain hashtags, recommending hashtags for use in microblogs are a worthwhile exercise. In addition, human inference often relies on the intrinsic grouping of words into phrases.However, existing work uses only unigrams to model corpora. In this work, we propose a novel phrase-based topical translation model to address this problem. We use the bag-of-phrases model to better capture the underlying topics of posted microblogs. We regard the phrases and hashtags in a microblog as two different languages that are talking about the same thing. Thus, the hashtag recommendation task can be viewed as a translation process from phrases to hashtags. To handle the topical information of microblogs, the proposed model regards translation probability as being topic specific. We test the methods on data collected from realworld microblogging services. The results demonstrate that the proposed method outperforms state-of-the-art methods that use the unigram model.
【作者单位】: Shanghai
【基金】:funded by National Natural Science Foundation of China (Grant Nos. 61473092, 61472088, 61532011) National High Technology Research and Development Program of China (Grant No. 2015AA015408) Shanghai Science and Technology Development Funds (Grant Nos. 13dz2260200, 13511504300)
【分类号】:TP391.3
【相似文献】
相关期刊论文 前4条
1 ;Small-world patterns in Chinese phrase networks[J];Chinese Science Bulletin;2005年03期
2 苏劲松;刘洋;刘群;董槐林;;基于图的统计机器翻译词汇化调序模型(英文)[J];中国通信;2014年05期
3 陈毅东;史晓东;;Improving Phrase-Based Statistical Machine Translation Models by Incorporating Syntax-Based Language Models[J];Journal of Donghua University(English Edition);2010年02期
4 ;[J];;年期
相关会议论文 前2条
1 ;RESEARCH ON THE PROFESSIONAL TRAINING OF PHYSICAL EDUCATION TEACHERS[A];第五届全国青年体育科学学术会议、第二届中国体育博士高层论坛论文集[C];2008年
2 ;A Synchronous Grammar for Statistical MachineTranslation[A];机器翻译研究进展——第四届全国机器翻译研讨会论文集[C];2008年
相关硕士学位论文 前3条
1 曾嘉鸣;汉语主语省略机制及其法译研究[D];广东外语外贸大学;2015年
2 张元;法汉翻译中的否定现象[D];外交学院;2009年
3 杨宁;汉语助词在外向型汉法词典中的处理[D];广东外语外贸大学;2003年
,本文编号:1891280
本文链接:https://www.wllwen.com/kejilunwen/ruanjiangongchenglunwen/1891280.html