图像特征表示的学习算法研究
[Abstract]:In many computer vision tasks, one of the intrinsic difficulties is to generate well-discriminatory image representation, i.e. high-performance image features. Since image features are robust enough to deal with intra-class variations and discriminant enough to deal with inter-class variations, designing excellent image features is a challenging task. Image features are generally divided into image block hierarchical features and image level features (i.e. local features and global features), the former is used to describe an image block and the latter is used to describe a complete image. The main research results are summarized as follows: (1) Firstly, a new image layer feature representation is proposed for image classification. The traditional Bag-of-Words model completely discards the spatial distribution information of features and loses some discriminant power. Spatial Correlogram (SCR) is a feature representation method, which describes the spatial distribution of local features by capturing the frequency of common occurrence of visual word pairs in the spatial range, thus improving the discriminant ability of image recognition. In addition, we combine the correlation graph features with the spatial pyramid model to generate a hybrid feature. Detailed experiments on the scene/object database show that the proposed correlation graph features and hybrid features can achieve higher image classification accuracy than the traditional word packet model. (2) Secondly, this paper proposes a new image classification method. Efficient Kernel Descriptor (EKD) is a new feature representation of image blocks. The design of image block features also belongs to the basic research content in the field of computer vision. Excellent image block feature representation can effectively improve the performance of image classification, object recognition and other related algorithms, but artificially designed images. Kernel Descriptor (KD) method provides a new way to generate image block features. Kernel Principal Component Analysis (KPCA) method is applied to feature representation based on matching kernel functions between image blocks. However, this method needs all joint basis vectors to generate kernel descriptor features, which results in high time complexity. Therefore, we design an efficient kernel descriptor algorithm. The algorithm is based on the incomplete Cholesky decomposition and automatically selects a small number of Pivot associations. The experimental results show that the efficient kernel descriptor (EKD) achieves better performance than the original kernel descriptor (KD) in image / scene classification applications. (3) Thirdly, on the basis of constructing an efficient kernel descriptor (EKD), we propose a new image layer feature representation, which is efficient. Efficient Hierarchical Kernel Descriptor (EHKD). Primitive Kernel Descriptor (KD) features can only be used to describe image blocks, so Bo et al. proposed Hierarchical Kernel Descriptor (HI KD) to describe the whole image in the framework of kernel descriptor (KD) algorithm. The construction process is similar to that of the kernel descriptor (KD), so the generation hierarchical kernel descriptor (HKD) algorithm will also encounter the computational efficiency problem in the generation kernel descriptor (KD) algorithm. To overcome this problem, we design an efficient hierarchical kernel descriptor algorithm. The experimental results show that the efficient hierarchical kernel descriptor (EHKD) has advantages over the hierarchical kernel descriptor (HKD) in computational efficiency and feature representation ability. (4) Finally, a supervised image block feature representation is proposed. Supervised Efficient Kernel Descriptor (SEKD). The previously mentioned kernel descriptor (KD) methods and efficient kernel descriptor (EKD) methods belong to the category of unsupervised learning. They design block-level features through similarity between image blocks and display them. Compared with the hand-designed image block features, these two methods give the interpretation of gradient-oriented histogram from the point of view of kernel, and use the information of pixels to "grow" the image block hierarchical features. Considering the label information of the image block itself, it is necessary to design a feature learning method which integrates the label information of the image in supervised mode. For this reason, we propose an efficient kernel descriptor algorithm based on supervised learning. The algorithm is based on the incomplete Cholesky decomposition algorithm which integrates the label information of the image class. Supervised Learning Efficient Kernel Descriptor (SEKD) has the advantage of shorter representation dimension and stronger discriminant ability than unsupervised learning.
【学位授予单位】:北京交通大学
【学位级别】:博士
【学位授予年份】:2016
【分类号】:TP391.41
【相似文献】
相关期刊论文 前10条
1 陈芳;一种基于错切原理的图像旋转方法[J];淮阴师范学院学报(自然科学版);2004年04期
2 李少芳;陈德礼;;数字图像旋转实现的探讨[J];计算机与现代化;2007年09期
3 李峰;;交互式、可控制图像旋转[J];电脑编程技巧与维护;2008年09期
4 赵琰;魏为民;;用于图像认证和窜改检测的稳健图像摘要[J];计算机应用研究;2011年05期
5 王滨海;许正飞;陈西广;张海龙;邵瑞雪;;图像旋转算法的分析与对比[J];光学与光电技术;2011年02期
6 陶德元,李舒平,周激流;消除图像旋转失真的方法[J];数据采集与处理;1991年04期
7 李伟青;图像旋转的快速显示技术[J];计算机应用研究;1994年03期
8 沈定刚,,戚飞虎;任意图像的主方向定位[J];上海交通大学学报;1995年04期
9 曹建;变换图像及与其它图像程序的结合使用技术[J];软件世界;1996年06期
10 丁宏庆;数字图像旋转的硬件实现[J];电子技术;1998年12期
相关会议论文 前4条
1 鲁传运;黄言平;季托;;图像旋转不变特征特性研究[A];第九届全国光电技术学术交流会论文集(下册)[C];2010年
2 唐振军;王朔中;魏为民;张新鹏;;利用分块相似系数构造感知图像Hash[A];第八届全国信息隐藏与多媒体安全学术大会湖南省计算机学会第十一届学术年会论文集[C];2009年
3 王彦锟;刘方;;一种快速稳健的图像旋转角度估计算法[A];计算机技术与应用进展·2007——全国第18届计算机技术与应用(CACIS)学术会议论文集[C];2007年
4 王炳健;楼红斌;卢刚;刘上乾;;多模光电图像配准算法性能评估[A];2011西部光子学学术会议论文摘要集[C];2011年
相关重要报纸文章 前3条
1 奇妙天堂;PowerPoint XP玩转图象轻松做[N];中国电脑教育报;2003年
2 晓峰;EPC图像转换专家:批量转换的得力助手[N];中国摄影报;2005年
3 小鸭;扫描一点通[N];电脑报;2001年
相关博士学位论文 前4条
1 谢博捚;图像特征表示的学习算法研究[D];北京交通大学;2016年
2 林春雨;图像/视频的多描述编码及传输[D];北京交通大学;2010年
3 高光勇;基于混沌和图像矩的鲁棒零水印技术研究[D];南京邮电大学;2012年
4 李长松;空间太阳望远镜稳像系统中图像相关器的研究[D];中国科学院研究生院(国家天文台);2008年
相关硕士学位论文 前10条
1 刘霞;基于尺度不变与视觉显著特征的图像感知哈希技术研究[D];西南大学;2015年
2 史力如;图像与思维及重叠图像式绘画的探索[D];天津美术学院;2015年
3 王开芳;照片/素描及跨年龄阶段异质人脸的识别研究[D];山东大学;2015年
4 董爱萍;小尺度图像旋转失真分析与矫正方法研究[D];大连海事大学;2015年
5 袁征帆;基于安卓的火车客票管理系统的设计与实现[D];南京大学;2014年
6 黄韵;基于词袋模型和词汇树的图像检索技术研究[D];西安电子科技大学;2014年
7 王东旭;基于快速检索的图像溯源软件平台[D];西安电子科技大学;2014年
8 孙洁;基于隐支持向量机模型的个性化图像推荐和检索[D];北京交通大学;2014年
9 宋宝林;基于图像特征的图像哈希算法及实现[D];山东师范大学;2014年
10 石晟;普通光照下叶片图像特征信息抽取[D];北京林业大学;2014年
本文编号:2183454
本文链接:https://www.wllwen.com/shoufeilunwen/xxkjbs/2183454.html