基于视频的人体目标跟踪与识别技术研究

发布时间:2018-03-28 08:51

  本文选题:计算机视觉 切入点:智能视频系统 出处:《电子科技大学》2016年博士论文


【摘要】:基于视频的目标跟踪与识别技术是计算机视觉的主要研究方向之一,是诸如智能监控、人机交互、地形导航及视频智能标注检索等应用的基础和关键技术,是实现“智慧城市”、“平安城市”的重要手段,具有重要的理论研究与实际应用价值。然而,自然非受控条件下获取的视频中,环境复杂多变,对其中各类目标跟踪识别带来诸多挑战。针对各种复杂场景及不同目标,如何设计实现效率高、鲁棒性好、实时性强的目标跟踪识别技术仍然是当今业界研究的热点及难点。鉴于此认识,本文立足于前人一系列优秀成果,启发于人类自身视觉系统完美的目标发现跟踪与识别机制,主要针对视频中人体目标的跟踪与识别问题展开了深入的研究,取得的创新性成果如下:1.针对目标样本稀缺,致使目标特征初始不充分,抑或目标原有特征易被遮挡、伪装,甚或因时光荏苒而逐渐灭失消亡,从而导致目标已知特征数据逐渐失效,如此种种,最终导致目标跟踪识别效率极低,提出了一种基于单样本的自主在线的目标特征学习及更新算法(One Sample Based Autonomous Online Features Learning and Updating Algorithm,FLUA)。算法首先基于目标单样本获取的局部特征,对视频中目标的相似视图区域进行识别定位,尔后根据视频帧之间静态特征点的值、分布及其运动一致性等多重匹配的校验,在线学习更新目标新特征,一旦目标新特征得到确认,立即作用于随后的目标跟踪识别及其特征学习更新过程。实验结果表明,本章提出的FLUA算法无需大量目标样本图像,无需事先大量训练,纵然在单样本情形下,特征学习获取效果依然显著,有效的提高了目标跟踪过程的效率。特征学习无需复杂的迭代求解过程,更新速度快,能够满足跟踪系统实时性要求。2.针对人体目标头部姿态抑或脸部表情等变化、脸部化妆抑或伪装等等严重影响目标人脸部图像的跟踪获取,从而对基于人脸的目标跟踪与识别等应用系统产生极为不利的影响,提出了一种基于人体模糊跟踪的人脸跟踪获取算法(Human Body Fuzzy Tracking Based Face Tracking and Capturing Algorithm,B-FTC)。算法首先根据目标身体各肢体部分特征及运动一致性匹配跟踪定位目标的身体,尔后根据目标头部与身体的位置及运动相关性定位获取目标的脸部图像。在对目标身体跟踪识别的同时,引入了在线特征学习更新机制以应对目标的外观特征的逐步变化。实验结果证明,该算法对目标头部姿态、镜头视角、脸部表情等等变化,以及脸部局部遮挡、化妆、伪装等等不利因素具有完全的鲁棒性,同时具有极好的脸部跟踪获取及归属分类效果,在自然监控视频及四川变脸表演视频中,对目标脸部图像的跟踪获取率都在90%以上,正确率几近达到100%。3.针对监控视频不同于生活摄影,其中人物脸部表情及头部动作较多自然变化,获取的脸部图像多以不同视角的‘表情碎片’形式存在,从而导致基于正面或近正面表情平静的人脸识别算法失效,本文提出了一套可以相当程度免疫于头部姿态、表情、光线等诸多变化以及部分遮挡等不利情形下,N:M的video-to-video人脸自动识别算法(Space and Expression Double Weighted based Video-to-Video Face Recognition,SEDW-2VFR)。算法首先根据脸部碎片特征点的值、分布的双重匹配及其运动变换的误差大小对基准视频中跟踪获取的目标脸部碎片图像进行区域及表情的双重分类,对基准目标的每一脸部图像类集进行特征投影矩阵的生成及特征的提取,而后对待测视频中跟踪获取的目标人脸部碎片进行在线分权2D-PCA识别。实验表明,该算法对头部姿态及表情等变化具有很强的鲁棒性,在自然条件下的生活视频中,目标跟踪识别率依然达到90%以上。4.针对大多数现有步态识别算法预设条件苛刻,其中步态表示、提取及比对过程复杂,计算量大,识别效果差,提出了一种基于肢体区域及步态周期双重区分的步态特征异步提取,同步分权融合的2D-PCA步态识别算法(Limbs and Gait Period Double Distinguished Feature Asynchronous Extraction and Synchronous Weighted Fusion Based Gait Recognition,FAESWF-GR)。算法首先对基准目标各肢体部分进行异步特征提取,并根据步态特征周期的长度进行归类和‘时间片’划分,而后采用2D-PCA算法对不同周期长度的步态特征‘时间片’子集进行特征投影矩阵生成及特征提取,然后对待识别目标肢体各部分进行在线的步态周期特征提取及时间片划分,同时进行同步与分权相融合的2D-PCA步态识别。实验表明,算法的步态特征异步提取机制具备了对视频中身体局部碎片图像进行特征提取的能力,从而使算法对身体的视角、姿态、焦距的变化及身体的局部遮挡等等都具有了极强的鲁棒性。另外,算法的同步与分权相融合的综合比对机制中肢体各部分特征权重可调,从而使算法能针对不同情形对肢体各部分特征赋予不同权重,极易体现肢体各部分在不同情形下步态的整体性及权重的差异性,很好适应自然条件下视频中目标人身体及其所处环境的各种复杂变化,取得较高识别率。
[Abstract]:Target tracking and recognition technology based on video is one of the main research direction of computer vision, such as intelligent monitoring, human-computer interaction, and based retrieval applications such as terrain navigation and intelligent video annotation, is the realization of "smart city", "an important means of safe city, has the important value of theoretical research and practical application however, non natural obtained under controlled conditions in the video, including all kinds of complicated environment, target tracking and Recognition brings many challenges. For a variety of complex scenes and different goals, how to design and realize the high efficiency, good robustness, real-time target tracking and recognition technology is still the hotspot and difficulty in the research of industry. In view of this understanding, this paper is based on previous a series of excellent achievements, inspired by the human visual system perfect target tracking and recognition mechanism found, to video Focuses on tracking and recognition problems in human target, innovative results obtained are as follows: 1. for the samples are scarce, resulting in the initial target feature is not sufficient, or the original features easily obscured, camouflage, or even because of loss of time flies and gradually die, which leads to target known feature data is invalid. So eventually, the target tracking and recognition efficiency is very low, this paper proposes a target feature of single sample independent online learning and updating algorithm based on (One Sample Based Autonomous Online Features Learning and Updating Algorithm, FLUA). The first algorithm based on local feature single sample acquisition, identification of similar video objects in the view area then, according to the video frame between the static feature point value, check the distribution and motion consistency of multiple matching, learning update in line New features, new features of the target once confirmed, immediately for a subsequent target tracking and recognition and feature learning update process. The experimental results show that the proposed FLUA algorithm does not need a large number of target images, without a lot of training, even in a single sample, obtain the feature learning effect is still significant, effectively improve the efficiency of the process of target tracking. The characteristics of learning without iteration process complex, update speed, can meet the requirements of real-time tracking system for.2. human head pose or facial expression changes, facial makeup or camouflage so seriously affected the facial image of the target tracking and acquisition, target tracking and recognition of face the application system based on an extremely negative impact, puts forward a fuzzy tracking human face tracking algorithm based on Human (Body Fuzzy for Track Ing Based Face Tracking and Capturing Algorithm, B-FTC). Firstly, according to the target body characteristics and motion of each limb matching tracking and positioning the target's body, then according to the position and moving target correlation between the head and body of the target face images. In the target tracking and recognition of the body at the same time, the characteristics of online learning update mechanism to gradually change the appearance characteristics on the target. The experimental results show that the algorithm of head pose, camera angle, facial expressions and so on, and the face is partially occluded, makeup, camouflage and so on unfavorable factors has robustness, while having excellent face tracking acquisition and classification effect in natural video surveillance and Sichuan face video performance, the target tracking facial image acquisition rate is above 90%, the correct rate almost reached for 100%.3. Video monitoring is different from life photography, in which characters face and head movements more natural changes, the face image acquisition exist in the different perspectives of the "expression fragments' form, which leads to positive or positive expression in the calm face recognition algorithm based on failure, this paper presents a considerable degree can be immune to the head pose and expression. Light changes and partial occlusions and many other adverse circumstances, video-to-video face automatic recognition algorithm of N:M (Space and Expression Double Weighted based Video-to-Video Face Recognition, SEDW-2VFR). The algorithm first face feature points according to the values of the fragments, double classification distribution of double matching and motion transform error size and the expression of the target region face image fragments get reference in the video, the reference target each face image feature set cast Generation and characteristics of shadow matrix, and then to be obtained in video target tracking facial fragments online decentralization 2D-PCA recognition. Experiments show that the algorithm has strong robustness to the head pose and facial expression, under natural conditions in the live video, target tracking and recognition rate is still more than 90%.4. for most of the existing gait recognition algorithm design conditions, including gait representation, extraction and alignment process is complex, large amount of calculation, poor recognition effect, put forward a step behavior and gait cycle of double limb asynchronous regional differentiation based gait recognition algorithm 2D-PCA extraction, synchronous fusion (Limbs and Gait power Period Double Distinguished Feature Asynchronous Extraction and Synchronous Weighted Fusion Based Gait Recognition, FAESWF-GR). The algorithm first has the reference target each part of the body Asynchronous feature extraction, and classification and the "time slice" divided according to the gait cycle length, and use the 2D-PCA algorithm to the gait feature "on different period length of time slice 'subset feature projection matrix generation and feature extraction, classification features and extract the gait cycle time slice and then treat each part of body target identification online. At the same time the 2D-PCA gait recognition combined with synchronous decentralization. Experimental results show that the mechanism of gait feature extraction algorithm of asynchronous have the ability of feature extraction for video image fragments of body parts on the body, so that the algorithm from the perspective of attitude, and the focal length of the local change of body occlusion has a strong robustness. In addition, the comprehensive comparison mechanism integration and decentralization of the synchronization algorithm of each part of body feature weight can be adjusted, so that the algorithm can not for sympathy The shape gives different weights to each part of the limbs. It can easily reflect the difference of gait's integrity and weight between different parts of the body, so it can adapt to all kinds of complex changes of the target's body and its environment under natural conditions, and achieve a high recognition rate.

【学位授予单位】:电子科技大学
【学位级别】:博士
【学位授予年份】:2016
【分类号】:TP391.41

【参考文献】

相关期刊论文 前3条

1 李丹;王洪涛;;基于粒子重采样滤波算法的红外图像消噪[J];红外技术;2014年03期

2 刘雪超;吴志勇;黄德天;梁敏华;;结合视觉感知的调焦窗口构建[J];中国激光;2014年01期

3 罗海波;史泽林;;红外成像制导技术发展现状与展望[J];红外与激光工程;2009年04期



本文编号:1675611

资料下载
论文发表

本文链接:https://www.wllwen.com/shoufeilunwen/xxkjbs/1675611.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户15c0a***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com