基于卷积神经网络的人体行为识别研究
[Abstract]:In recent years, with the introduction of high-definition video equipment, artificial intelligence based on behavior recognition technology has been rapidly developed in the field of intelligent safe city, smart home and military security. Because of its wide application prospect and economic value, behavior analysis and recognition technology has become a hotspot in the field of computer vision. The traditional behavior recognition algorithms are usually divided into three steps: motion foreground detection, feature extraction and training recognition. Although the recognition rate of this method is acceptable, its robustness is not high and the workload is enormous. In addition, many factors such as occlusion between targets, complex background and uncertain shooting angle in the actual scene result in difficulty or even invalidation of traditional methods. This paper aims to improve these problems in traditional behavior recognition methods by using convolution neural network (Convolutional Neural Networks,CNN) to improve the robustness of the algorithm and improve the accuracy of recognition as much as possible. Aiming at the disadvantage that background subtraction and inter-frame differential can not extract the complete foreground without too much motion amplitude, this paper proposes a human body silhouette extraction algorithm based on Gao Si differential (Difference of Gaussian,DoG image. In this method, two subtraction images of adjacent Gao Si scale space are used to construct differential images containing human contour information, and then binary enhancement and morphological processing are performed to obtain rough human silhouette images. In the second step, the threshold is used to scan and detect the rough body silhouette area of each line, and then the complete and accurate human body silhouette image is obtained after the operations such as blocking operation. In order to fuse the temporal information of the image sequence, the human body silhouette image is accumulated in the period, and the two-dimensional feature map is generated, which is sent into the CNN for training and recognition. Finally, the average accuracy rate of 85.3% is obtained on the KTH common data set by the experiments of network parameter adjustment and 50% discount cross-validation, which proves the feasibility of the recognition framework. In order to better deal with video data, researchers extend the convolution neural network to 3 D. In this paper, 3D CNN is used to carry out experiments and it is found that the best recognition effect can be obtained by combining "optical flow graph, frame difference graph and three frame difference map". The average accuracy is 92.0% on the KTH common data set after the experiments of network parameter adjustment and 50% discount cross-validation. Secondly, by analyzing the proportional distribution of the number of samples in the KTH data set and the corresponding accuracy, this paper proposes the use of secondary training. The oversampling strategy and the extended data set are three improved methods to prove that the uneven distribution of the data has an effect on the experimental results, and thus to improve the recognition rate. Finally, the three improved methods reach the average accuracy of 93.5% and 94.7% respectively, which provide a solution to the classification problem of small sample or unbalanced data set. In addition, the method of behavior recognition using 3DCNN not only reduces the workload of feature extraction, but also improves the robustness of the algorithm, that is, it improves the problems existing in the traditional recognition methods.
【学位授予单位】:中国科学技术大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP391.41;TP183
【参考文献】
相关期刊论文 前8条
1 刘琮;许维胜;吴启迪;;时空域深度卷积神经网络及其在行为识别上的应用[J];计算机科学;2015年07期
2 黄凯奇;陈晓棠;康运锋;谭铁牛;;智能视频监控技术综述[J];计算机学报;2015年06期
3 陶新民;郝思媛;张冬雪;徐鹏;;不均衡数据分类算法的综述[J];重庆邮电大学学报(自然科学版);2013年01期
4 雷庆;李绍滋;;动作识别中局部时空特征的运动表示方法研究[J];计算机工程与应用;2010年34期
5 卜涛涛;卢超;;图像分割算法研究[J];电脑知识与技术;2010年08期
6 田国会;;家庭服务机器人研究前景广阔[J];国际学术动态;2007年01期
7 何卫华;李平;文玉梅;叶波;;复杂背景下基于图像融合的运动目标轮廓提取算法[J];计算机应用;2006年01期
8 范莹,郭成安;一种运动图像的检测与识别技术[J];大连理工大学学报;2004年01期
相关博士学位论文 前2条
1 曹鹏;不均衡数据分类方法的研究[D];东北大学;2014年
2 何卫华;人体行为识别关键技术研究[D];重庆大学;2012年
相关硕士学位论文 前7条
1 张永刚;基于多特征融合的行为识别算法研究[D];内蒙古大学;2016年
2 陆霖霖;基于改进ISA深度网络的人体行为识别研究与实现[D];电子科技大学;2016年
3 吴杰;基于卷积神经网络的行为识别研究[D];电子科技大学;2015年
4 费凡;智能视频监控中运动人体异常行为的自动检测与识别算法的研究与实现[D];南京邮电大学;2014年
5 郭劲智;视频图像行人检测方法研究[D];华南理工大学;2012年
6 覃耀辉;视频中的人体动作行为识别研究[D];电子科技大学;2011年
7 赵凤娟;行人异常智能视频监控系统研究与实现[D];电子科技大学;2011年
,本文编号:2274394
本文链接:https://www.wllwen.com/shoufeilunwen/xixikjs/2274394.html