基于稀疏编码理论的图像多标签排序算法研究
[Abstract]:In today's high-speed Internet era, the popularity of many digital imaging devices, coupled with the advancement of Internet technology, Internet images are playing an increasingly important role in our lives. Network image search has become a very active and challenging research topic in the field of computer vision. Unlike a decade ago, the Internet now makes it easy to create, upload, share, and distribute digital images on the Internet. Social media, such as Facebook, YouTube, Flickr, allow image uploaders to provide a set of keywords (also known as Social Tags) that describe the image. To index images, the semantic annotation of images is accomplished by the cooperation of users through the network, so this kind of image set is also called Collaborative-Tagged Images. There are a large number of tagged image sets on image sharing websites such as R. This sharing method based on social tagging will greatly improve the performance of mass image organization and retrieval on the Internet. Therefore, how to use these tagged image sets more effectively is the key to improve the performance of automatic image annotation. One of the problems is that users usually upload tags corresponding to images in Random Order, i.e. the set of tags submitted by users is not always sorted according to the size of tag-to-image semantic relevance (Tag Relevance). At present, Flickr does not provide a Relevance-based Ranking-based retrieval sorting mechanism. The random sorting of label sets restricts further application of massive image retrieval performance. Most Interesting: that is, according to the click rate of users, the number of comments and so on, but Flickr can not provide the retrieval mode according to semantic relevance at present. In other words, although community-based annotation sharing greatly improves the performance of mass image organization and retrieval on the Internet, users usually upload tags corresponding to images in random order, i.e. the set of tags submitted by users is not always in accordance with the semantic relevance between tags and image content (Tag R). The random ordering of label sets restricts the further application of massive image retrieval performance. Therefore, label ordering is becoming a new hotspot in multimedia research field. It should be pointed out that the image set ordered by semantic correlation can be used as the key to represent semantics. As mentioned earlier, community labeling has become a popular way to capture, classify, and retrieve content on the Internet, and has been successfully applied in the management and retrieval of real social media systems. Although users provide tags to describe the content of a community image, because these tags are from different cultural backgrounds, network users with knowledge structures have their own subjective understanding of the content of the image. The label quality of community image can not be directly used as a reliable image indexing keyword for keyword-based image retrieval. At present, the label of community image mainly exists the disorder of label arrangement and the imprecision of label content, so the semantic understanding of labeled community image mainly focuses on. Some research institutes (such as MSRA) have studied the Tag Ranking problem. Since an image may be labeled with several semantic conceptual markers at the same time, it is a typical multi-marker learning problem. The image itself has a certain degree of semantic ambiguity. However, the realization of tag set sorting according to semantic relevance is abstracted as a typical multi-label Ranking problem. At present, there are many researches on multi-label learning, while there are relatively few researches on multi-label sorting problem. Most of the existing Tag Ranking algorithms focus on Relevance-based Tag Ranking. Intuitively, given an image and a label set, if the correlation of a label A in the label set is higher than that of label B, it is shown that In other words, the frequency of tag A appearing in the subset of K-nearest neighbor image of a given image is higher than that of tag B. This kind of algorithm mainly has two representative works. (1) Statistical model-based algorithm. Modeling sorting algorithm; (2) Data-driven sorting algorithm. Statistical model-based sorting algorithm uses kernel density estimation to estimate the semantic correlation between each tag in an image and the image itself. Its essence is to estimate the Typicality of the sample, if the image represents the region of a semantic tag. Low-level visual features are more typical, that is, if the feature vectors of the region with the same label are closer in the feature space, the semantic relevance of the label will be high; considering the semantic correlation between the labels, random walk algorithm is used to improve the sorting results and achieve the final label sorting. Because global low-level visual features represent images with multi-label semantics, it is impossible to estimate the density of each label in the feature space. Data-driven sorting algorithm gets a subset of the nearest neighbor images of a given image by simple image global feature matching, and counts them by Neighbor-voting strategy. Different from the sorting algorithm based on statistical model, the data-driven sorting algorithm only uses the visual features of the image when selecting the nearest neighbor sample set of the sorted image without considering the label information of the image. Intuitively, tag sorting algorithm based on nearest neighbor voting mechanism shows good scalability in massive image datasets because of its simplicity. However, it should be pointed out that this kind of algorithm ignores the semantic correlation between tags, so its sorting performance is not very good. Secondly, the algorithm also uses global visual features. In this paper, an improved image multi-label sorting algorithm is proposed. By introducing the sparse representation theory in the field of signal processing, the nearest neighbor image retrieval problem is transformed into a sparse reconstruction problem, which improves the selection of nearest neighbor image sets. In recent years, the combination of Compressed Sensing (CS) and feature selection theory and method to form more effective sparse representations for images has become a hot topic in the field of computer vision and machine learning. Tibshirani and Breiman of the University of California, Berkeley, et al. almost simultaneously proposed the idea of associating feature selection sparsely with (?) 1-norm constraints to make the selected features as sparse as possible and to improve the interpretability and accuracy of the data processing process. For statistical analysis of high-dimensional data, the theory and method of image semantic understanding can be studied on the basis of sparse representation. The idea of image multi-label sorting algorithm based on sparse representation proposed in this paper is as follows: Firstly, the algorithm essentially belongs to image multi-label sorting based on semantic correlation sorting. Given a test image to be sorted and a large set of labeled community images, we consider the test image to be sorted as a test sample to be reconstructed, and the large set of labeled community images as an over-complete dictionary. The sparse reconstruction of a few samples from the over-complete dictionary can be used to characterize the semantic similarity and correlation between each labeled image and the sample image in the bullet based on the sparse coefficient vector obtained from the learning. Therefore, each dimension of the sparse coefficient vector obtained from the learning represents the test sample image and the dictionary. Finally, the nearest neighbor image subset of the test image is obtained based on the acquired semantic correlation, and the frequency of each key word in the tag sequence is counted by the nearest neighbor voting strategy, and the tag sequence is sorted according to the frequency. Considering the semantic correlation between tags (i.e. the symbiosis relationship), the Random Walk algorithm is used to improve the sorting result and achieve the final sorting. We implement the algorithm proposed in this paper by using MATLAB programming language and carry out experimental verification on the NUS-WIDE image data set. Comparison is made to verify the effectiveness of our proposed sparse representation based image label sorting algorithm.
【学位授予单位】:北京交通大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP391.41
【相似文献】
相关期刊论文 前10条
1 安朝辉;钱剑敏;;一种新的排序算法——端点排序算法[J];现代电子技术;2011年24期
2 卢敏;黄亚楼;谢茂强;王扬;刘杰;廖振;;代价敏感的列表排序算法[J];计算机研究与发展;2012年08期
3 张正铀;;散列排序算法[J];广西科学院学报;1982年01期
4 全惠云;;基于矩阵分裂法的一类异步N&行排序算法[J];计算技术与自动化;1991年01期
5 董德林;两个高效排序算法的APPLESOFT BASIC程序[J];丽水师专学报;1992年S1期
6 王晓东;最优堆排序算法[J];小型微型计算机系统;2000年05期
7 吴江,张德同;二次分“档”链接排序算法分析[J];计算机研究与发展;2001年08期
8 李德启,王雄;一种新型快速的排序算法[J];计算机工程;2001年03期
9 赵忠孝;一种新的散列排序算法[J];电脑开发与应用;2001年03期
10 许善祥,朱学东,邵敬春;选择排序算法的改进[J];佳木斯大学学报(自然科学版);2001年04期
相关会议论文 前10条
1 周晓方;金志权;;寻找最佳分布式排序算法[A];第九届全国数据库学术会议论文集(上)[C];1990年
2 张艳秋;李建中;;一种基于蛇型磁带的排序算法[A];第十八届全国数据库学术会议论文集(研究报告篇)[C];2001年
3 刘春阳;叶君峰;母海龙;陆秋霞;陈沧;高莺;;一种商品标题主题词的重要性排序算法[A];第五届全国信息检索学术会议论文集[C];2009年
4 王少帅;汤庆新;姚路;;并行独立集排序算法的改进与实现[A];第十六届全国青年通信学术会议论文集(上)[C];2011年
5 于芳;王大玲;于戈;陈冬玲;鲍玉斌;;面向用户的排序算法研究[A];第二十四届中国数据库学术会议论文集(研究报告篇)[C];2007年
6 闫泼;马军;陈竹敏;;面向主题的网页排序算法研究[A];第三届全国信息检索与内容安全学术会议论文集[C];2007年
7 张健沛;李连江;杨静;;个性化搜索引擎排序算法的研究与改进[A];第三届全国信息检索与内容安全学术会议论文集[C];2007年
8 吴志彬;陈义华;;ANP中超矩阵排序算法研究[A];2006中国控制与决策学术年会论文集[C];2006年
9 陈丛丛;石冰;陈健;;面向主题的查询相关网页排序算法[A];第三届中国智能计算大会论文集[C];2009年
10 齐曼;张珩;;实时视觉仿真中帧连贯性应用[A];'2000系统仿真技术及其应用学术交流会论文集[C];2000年
相关重要报纸文章 前1条
1 广东 黄陀;基本算法简介(三)[N];电脑报;2001年
相关博士学位论文 前3条
1 赵立军;基于归并的高效排序算法的研究[D];中国科学院研究生院(计算技术研究所);1998年
2 崔筠;无向基因组的移位排序算法[D];山东大学;2006年
3 郝凡昌;有向基因组复合操作重组排序算法研究[D];山东大学;2011年
相关硕士学位论文 前10条
1 徐林龙;基于商品特征属性的排序算法研究[D];西南交通大学;2015年
2 陈浩;基于图理论的图像搜索结果重排序的研究[D];安徽大学;2016年
3 双全;基于用户行为分析的搜索排序算法研究[D];华中科技大学;2014年
4 王麒深;面向网络舆情的社会情感排序算法研究[D];中国民航大学;2012年
5 郭佳;一种SDN环境中的网络节点重要性排序算法[D];西安电子科技大学;2015年
6 冯少泳;两层哈希的重排序算法[D];华南理工大学;2016年
7 陆沛栋;基于可重构SoC平台的排序算法设计和自相关算法优化[D];南京大学;2017年
8 祁洋;RankNet学习排序算法的一种改进[D];吉林大学;2017年
9 RAPHAEL DE-SOUZA;基于稀疏编码理论的图像多标签排序算法研究[D];北京交通大学;2017年
10 王靖;数据库管理系统中高能效排序算法[D];浙江工业大学;2012年
,本文编号:2225130
本文链接:https://www.wllwen.com/kejilunwen/ruanjiangongchenglunwen/2225130.html