基于卷积神经网络的场景分类的研究与应用
[Abstract]:Scene classification is one of the important research directions in the field of image processing. With the development of computer technology and the Internet, a large number of image data flow into people's lives and work. In the face of such huge image information, traditional scene classification methods and techniques show a lot of shortcomings. In recent years, convolutional neural network (Convolutional Neural Network,CNN) has made many breakthroughs in the field of image processing. It extracts image features directly from image pixels by simulating the learning process of human brain. The feature extraction and classifier are combined into a learning framework to classify and recognize the related objects. In addition, the local connection, weight sharing and down-sampling of convolutional neural networks greatly reduce the training parameters of the network, simplify the network model, and further improve the training efficiency of the network. Aiming at the complex variability of scene image and the weak generalization ability of traditional scene classification methods, this paper combines convolution neural network method to classify scene. The classification performance of convolutional neural networks is mainly determined by the hierarchical structure of the network. Therefore, the factors influencing the classification performance of convolutional neural networks are studied in this paper, based on which a convolution neural network model is designed. Applied to scene classification. The specific work is as follows: 1. Aiming at the problem of how to select hierarchical structure in the model of convolution neural network applied in scene classification design, a shallow convolution neural network model is designed in this paper, which is applied to the task of scene image classification in Scene-15 dataset and SUN-397 dataset. The effects of different size and number of convolution kernels, different activation functions and different sampling methods on the classification performance of convolution neural networks are studied. It is shown that the classification performance of convolutional neural networks can be improved by using smaller convolutional kernels and more kernel numbers, maximum sampling and ReLU activation function. 2. In order to better meet the requirements of the actual scene image, this paper improves the neural network model based on the above research, and designs an 8-layer convolution neural network. The convolution layer of the network uses smaller convolution cores and increases the number of convolution cores which can extract more image features and improve classification performance. At the same time, the maximum sampling method and the ReLU activation function are used in the sampling layer. In this paper, the improved convolution neural network model is compared with AlexNet model and VGGNet model on Scene-15 data set and SUN-397 data set. The experimental results show that the model has good classification effect in scene classification. In this paper, the structure design and parameter optimization of convolution neural network are carried out by using MatConvNet toolbox on MATLAB software. The factors influencing the classification performance of convolution neural network are analyzed, and the convolution neural network model is designed. Applied to scene classification. A large number of experiments show that the network model has good classification performance and generalization ability in scene classification.
【学位授予单位】:兰州理工大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP391.41;TP183
【参考文献】
相关期刊论文 前7条
1 顾广华;韩晰瑛;陈春霞;赵耀;;图像场景语义分类研究进展综述[J];系统工程与电子技术;2016年04期
2 李学龙;史建华;董永生;陶大程;;场景图像分类技术综述[J];中国科学:信息科学;2015年07期
3 肖保良;;基于Gist特征与PHOG特征融合的多类场景分类[J];中北大学学报(自然科学版);2014年06期
4 赵理君;唐娉;霍连志;郑柯;;图像场景分类中视觉词包模型方法综述[J];中国图象图形学报;2014年03期
5 ZHOU Li;HU DeWen;ZHOU ZongTan;;Scene recognition combining structural and textural features[J];Science China(Information Sciences);2013年07期
6 杨昭;高隽;谢昭;吴克伟;;局部Gist特征匹配核的场景分类[J];中国图象图形学报;2013年03期
7 吴伟仁;王大轶;邢琰;龚小谨;刘济林;;月球车巡视探测的双目视觉里程算法与实验研究[J];中国科学:信息科学;2011年12期
相关硕士学位论文 前4条
1 吴正文;卷积神经网络在图像分类中的应用研究[D];电子科技大学;2015年
2 刘静;基于特征组合的图像场景分类[D];湘潭大学;2015年
3 周圣云;基于视觉感知的室内场景识别与理解[D];电子科技大学;2015年
4 李飞腾;卷积神经网络及其应用[D];大连理工大学;2014年
,本文编号:2295584
本文链接:https://www.wllwen.com/kejilunwen/zidonghuakongzhilunwen/2295584.html