移动背景下视觉的行人检测、识别与跟踪技术研究
发布时间:2019-05-18 06:47
【摘要】:近些年来,基于计算机视觉的行人检测与跟踪技术得到了突飞猛进的发展,已有大量的应用出现在视频场景监控,目标行为分析,机器人控制,人机接口,智能交通等领域。一些比较热门而且尚处于发展阶段的应用包括汽车辅助驾驶、自主车系统、无人驾驶汽车等。由于面对越来越复杂的外部应用环境,特别是对于快速变化的环境背景,纯粹基于计算机视觉的目标识别技术正面临越来越大的挑战。就目前而言,依然缺乏一种鲁棒、精确并且快速的检测与跟踪算法。以行人检测为例,行人存在非常大的类内变化,特别是衣着、光照的显著变化,加上人体姿态和运动的随意性,以及人与人与环境之间的相互影响,和类人物体的干扰,使得当前各类行人检测算子,离复杂环境下的实际需求均还有一定差距。而对于应用范围更广的目标跟踪技术,静态背景下的相关研究已经比较成熟,而对于移动变化的背景,目前的研究还非常不足。本文的主要贡献包括以下几个方面:1)研究了单帧图像的快速行人检测算法。随着基于视觉的行人检测算法的研究深入,进一步提高行人检测算子的性能已面临非常大的困难。同时,复杂的算法带来了更高的计算复杂度,严重影响了检测系统的时效性。我们考虑在单帧图像条件下,通过适当的改进使算法在不降低时效性的情况下,进一步提高其鲁棒性。主要研究了基于Adaboost+Chn Ftrs的快速行人检测算法,通过选择不同的特征组合得到了相对最优的算子。此外,还在Adaboost的学习过程中采用了特征查找表LUT算法,大大提高了训练速度,并提高了训练数据的样本容量。2)对不同在线分类算法与目标特征模型进行了跟踪性能的测试,并提出基于Fern分类算法的在线目标颜色模型,和结构模型,经过测试取得了更好的跟踪效果。针对常用的特征,如颜色特征、结构特征、模板等,结合不同的在线分类算法建立在线目标分类模型,其中颜色特征采用基于超像素颜色直方图的SPT算法;结构特征采用基于压缩特征的CT算法,对于CT,本文也提出采用基于多通道图像的压缩特征,可改善算法效果;模板匹配采用最近邻分类器(NNClassifier)和TLD算法,通过测试指出了各种目标特征在线分类模型的适应场景和各自的不足。在此基础上,本文提出了采用Fern算法结合颜色超像素特征,和多通道空间小块特征,实现了更好的跟踪效果,可以较好地适应包括存在各种遮挡、光照变化、目标姿态变化、目标尺度变化,和背景多样变化的复杂场景。3)考虑目标跟踪的一般情况,为了更好地适应目标尺度的变化,提出采用基于粒子群优化的粒子滤波作为跟踪滤波算法,实验证明,粒子群能显著提高粒子滤波的滤波性能,并对跟踪整体性能的提高有很大作用。4)提出了一种移动背景和复杂场景下的针对行人等目标的相对实时鲁棒的跟踪算法,可处理包括目标姿态、尺度变化,光照变化,遮挡和干扰等各种跟踪场景。算法采用多元特征在线目标分类模型,结合基于粒子群优化的粒子滤波算法进行跟踪。其多元特征模型包含Fern颜色模型、Fern结构模型、以及CT自适应结构模型,各特征模型之间具有很强的互补性,综合后可达到相当不错的跟踪效果。此外,在该算法中引入基于直接模板匹配的NNClassifier作为监督模型,在不增加计算复杂度的情况下利用模板的慢自适应性抵制跟踪漂移。我们还提出了如何解决光照变化问题的方法(其中颜色模型对光照变化特别敏感),使得算法可以处理复杂光照变化条件下的目标跟踪问题。
[Abstract]:In recent years, the technology of pedestrian detection and tracking based on computer vision has been developed by leaps and bounds, and a large number of applications appear in the fields of video scene monitoring, target behavior analysis, robot control, man-machine interface, intelligent traffic and the like. Some of the most popular and developing applications include auto-assisted driving, autonomous vehicle systems, driverless cars, and more. Because of the increasingly complex external application environment, especially for rapidly changing environment, the target recognition technology based on computer vision is facing more and more challenges. At present, a robust, accurate and fast detection and tracking algorithm is still lacking. By taking the pedestrian detection as an example, the pedestrian has a very large intra-class change, in particular the remarkable change of the clothes and the illumination, the random nature of the human body posture and the movement, the mutual influence between the human and the environment, and the interference of the human-like object, so that the present various pedestrian detection operators, There is still a gap in the actual demand in a complex environment. For the target tracking technology with a wider application range, the related research in the static background has become more mature, and for the background of the moving change, the current research is still very low. The main contribution of this paper includes the following aspects:1) The fast pedestrian detection algorithm for single-frame image is studied. With the study of the visual-based pedestrian detection algorithm, it is very difficult to further improve the performance of the pedestrian detection operator. At the same time, the complicated algorithm brings higher computational complexity, which seriously affects the time-effectiveness of the detection system. We consider that, under the condition of single-frame image, the robustness of the algorithm is further improved by appropriate improvement so that the algorithm can not reduce the time-effectiveness. The fast pedestrian detection algorithm based on Adaboost + Chn Ftrs is mainly studied, and the relative optimal operator is obtained by selecting different feature combinations. In addition, the characteristic look-up table LUT algorithm is adopted in the learning process of the Adaboost, the training speed is greatly improved, and the sample capacity of the training data is improved. In addition, the online target color model and the structure model based on the Fern classification algorithm are put forward, and a better tracking effect is obtained through the test. Aiming at the common characteristics, such as color characteristics, structural features, templates and the like, an on-line target classification model is established in combination with different on-line classification algorithms, wherein the color characteristics adopt the SPT algorithm based on the hyper-pixel color histogram; the structure characteristic adopts a CT algorithm based on the compression feature, In this paper, a multi-channel image-based compression feature is proposed, which can improve the algorithm effect; the template matching adopts the nearest neighbor classifier (NNNN) and the TLD algorithm, and the adaptive scene and the respective deficiency of the on-line classification model of various target features are pointed out through the test. On the basis of this, this paper proposes that the Fn algorithm is used to combine the color super-pixel features and the multi-channel space tile features, and the better tracking effect can be realized, which can be better adapted to include various occlusion, light change, target attitude change, target scale change, and in order to better adapt to the change of the target scale, a particle filter based on the particle swarm optimization is adopted as a tracking filtering algorithm, And a relative real-time robust tracking algorithm for pedestrian and the like in a moving background and a complex scene is proposed, and various tracking scenes including target pose, scale change, illumination change, occlusion and interference can be processed. In this paper, a multi-element on-line target classification model is used to track the particle swarm optimization based on particle swarm optimization. The multivariate characteristic model contains the Fern color model, the Fern structure model, and the CT self-adaptive structure model. In addition, the NNNClassifier based on the direct template matching is introduced as the monitoring model in the algorithm, and the tracking drift is resisted by the slow self-adaptability of the template without increasing the computational complexity. We also put forward a method to solve the problem of illumination change (in which the color model is particularly sensitive to light change), so that the algorithm can deal with the problem of target tracking under the condition of complex illumination.
【学位授予单位】:中国科学院研究生院(上海技术物理研究所)
【学位级别】:博士
【学位授予年份】:2015
【分类号】:TP391.41
本文编号:2479766
[Abstract]:In recent years, the technology of pedestrian detection and tracking based on computer vision has been developed by leaps and bounds, and a large number of applications appear in the fields of video scene monitoring, target behavior analysis, robot control, man-machine interface, intelligent traffic and the like. Some of the most popular and developing applications include auto-assisted driving, autonomous vehicle systems, driverless cars, and more. Because of the increasingly complex external application environment, especially for rapidly changing environment, the target recognition technology based on computer vision is facing more and more challenges. At present, a robust, accurate and fast detection and tracking algorithm is still lacking. By taking the pedestrian detection as an example, the pedestrian has a very large intra-class change, in particular the remarkable change of the clothes and the illumination, the random nature of the human body posture and the movement, the mutual influence between the human and the environment, and the interference of the human-like object, so that the present various pedestrian detection operators, There is still a gap in the actual demand in a complex environment. For the target tracking technology with a wider application range, the related research in the static background has become more mature, and for the background of the moving change, the current research is still very low. The main contribution of this paper includes the following aspects:1) The fast pedestrian detection algorithm for single-frame image is studied. With the study of the visual-based pedestrian detection algorithm, it is very difficult to further improve the performance of the pedestrian detection operator. At the same time, the complicated algorithm brings higher computational complexity, which seriously affects the time-effectiveness of the detection system. We consider that, under the condition of single-frame image, the robustness of the algorithm is further improved by appropriate improvement so that the algorithm can not reduce the time-effectiveness. The fast pedestrian detection algorithm based on Adaboost + Chn Ftrs is mainly studied, and the relative optimal operator is obtained by selecting different feature combinations. In addition, the characteristic look-up table LUT algorithm is adopted in the learning process of the Adaboost, the training speed is greatly improved, and the sample capacity of the training data is improved. In addition, the online target color model and the structure model based on the Fern classification algorithm are put forward, and a better tracking effect is obtained through the test. Aiming at the common characteristics, such as color characteristics, structural features, templates and the like, an on-line target classification model is established in combination with different on-line classification algorithms, wherein the color characteristics adopt the SPT algorithm based on the hyper-pixel color histogram; the structure characteristic adopts a CT algorithm based on the compression feature, In this paper, a multi-channel image-based compression feature is proposed, which can improve the algorithm effect; the template matching adopts the nearest neighbor classifier (NNNN) and the TLD algorithm, and the adaptive scene and the respective deficiency of the on-line classification model of various target features are pointed out through the test. On the basis of this, this paper proposes that the Fn algorithm is used to combine the color super-pixel features and the multi-channel space tile features, and the better tracking effect can be realized, which can be better adapted to include various occlusion, light change, target attitude change, target scale change, and in order to better adapt to the change of the target scale, a particle filter based on the particle swarm optimization is adopted as a tracking filtering algorithm, And a relative real-time robust tracking algorithm for pedestrian and the like in a moving background and a complex scene is proposed, and various tracking scenes including target pose, scale change, illumination change, occlusion and interference can be processed. In this paper, a multi-element on-line target classification model is used to track the particle swarm optimization based on particle swarm optimization. The multivariate characteristic model contains the Fern color model, the Fern structure model, and the CT self-adaptive structure model. In addition, the NNNClassifier based on the direct template matching is introduced as the monitoring model in the algorithm, and the tracking drift is resisted by the slow self-adaptability of the template without increasing the computational complexity. We also put forward a method to solve the problem of illumination change (in which the color model is particularly sensitive to light change), so that the algorithm can deal with the problem of target tracking under the condition of complex illumination.
【学位授予单位】:中国科学院研究生院(上海技术物理研究所)
【学位级别】:博士
【学位授予年份】:2015
【分类号】:TP391.41
【参考文献】
相关期刊论文 前5条
1 范保杰;朱琳琳;崔书平;李向军;唐延东;;旋翼无人机视觉跟踪系统[J];红外与激光工程;2011年01期
2 刘足华;熊惠霖;;基于随机森林的目标检测与定位[J];计算机工程;2012年13期
3 钱慧佳;杨明;李榚;王春香;;一种基于激光雷达和视觉的行人检测方法[J];上海交通大学学报;2010年07期
4 李兵;刘磊;魏志强;;一种具有强实时性、强鲁棒性的图像匹配算法[J];软件学报;2014年07期
5 李娟;邵春福;杨励雅;李琦;;基于Kalman滤波的行人跟踪方法研究[J];交通运输系统工程与信息;2009年06期
,本文编号:2479766
本文链接:https://www.wllwen.com/shoufeilunwen/xxkjbs/2479766.html