基于低秩和稀疏表示模型的视频目标提取和跟踪研究
[Abstract]:The extraction and tracking of video object is the basic problem in the field of computer vision, and it is also the key and core technology of intelligent video monitoring system. At present, although the research in this aspect has made remarkable progress, because of the complexity of data, scene and environment, the extraction and tracking of video object is still a very challenging research topic. From the viewpoint of low rank and sparse representation model, this paper discusses the extraction and tracking of video object from the viewpoint of low rank and sparse representation model, studies the video target segmentation based on regularization low rank representation model, and based on weighted low rank decomposition multi-modal motion target detection, Object tracking based on image block representation and dynamic graph learning and multi-modal target tracking based on collaborative sparse representation model. In terms of video target extraction, a video target segmentation framework based on regularization low rank representation model is proposed for video data. Using supervoxel as graph node, using low rank representation model to optimize the similarity relation among them, we can effectively overcome the interference of sparse large noise and dense Gaussian noise. In order to improve the discriminability between supervoxels, the sparse representation coefficient matrix is regularized in sparse representation model, that is, regularization sparse representation model. Because the video data is usually very large, an optimization algorithm based on sub-optimal low-rank decomposition is proposed to solve the proposed model efficiently, and its convergence is guaranteed theoretically. At the same time, a stream processing method is proposed so that the segmentation method can process unlimited long video in limited computing and storage resources. In order to verify the validity, this paper applies the similarity relation of the optimized supervoxel to the unsupervised and interactive video object segmentation task. In view of the complexity of scene and environment, a universal framework for multi-modal motion target detection based on weighted low-rank decomposition is proposed in this paper. Since the visible spectrum information is affected by complex scenes, illumination and haze, the thermal infrared spectral information is introduced to supplement it. in particular, by introducing a quality weight for each modality, combining background data with a low rank structure, a sparse foreground template of multi-modality sharing and a foreground and a continuity constraint of a background pixel point are jointly modeled, so that multi-modal data can be adaptively fused, and then the moving object is detected by the rod. In order to improve the algorithm detection efficiency and maintain the accuracy, an efficient algorithm based on edge-preserving filtering is proposed, which makes the efficiency of the algorithm close to real-time. In addition, a multi-modal motion target detection platform including 25 video pairs is constructed, which makes up for the lack of standard evaluation system in this field and promotes research and development in related fields. In the aspect of target tracking, in order to solve the problem of model drift in the detection-based tracking framework, a dynamic graph learning method based on image block is proposed in this paper. First, the tracking block is divided into non-overlapping small image blocks, and a weight is allocated for each image block to represent the importance of the image block for the object. because the traditional 8-neighborhood graph ignores the global structure of the graph and the local linear relationship, the structure of the graph is dynamically learned by using the global low-rank structure, the sparse local linear relation and the non-negative dynamic learning graph of the edge right between the image blocks as the graph nodes, and meanwhile, the weight vector of the image block is optimized in a semi-supervised manner. Secondly, in order to improve the timeliness of tracking method, a real-time optimization algorithm is proposed to solve the proposed model. Finally, the optimized weight vector is embedded into the target tracking and model updating, so that the tracking performance is greatly improved. In order to overcome the challenges of scene and environment complexity, a multi-modal target tracking method based on collaborative sparse representation model is proposed in this paper. The traditional multi-modal target tracking method treats each modality equally, and if the information of a certain modality has very large ambiguity, the final tracking result is affected. Therefore, a robust tracking is achieved by adaptively fusing different modalities, i.e., introducing a quality weight for each modality in the sparse representation model. In particular, the weight of each modality is determined by the reconstruction error of the modality and the determination of the target and background, and is jointly optimized with the sparse representation coefficients. In addition, since the problem lacks the standard evaluation platform, a standard multi-modal target detection platform is constructed, including 50 matching video pairs, 22 reference methods and two measurement methods. The platform provides a standard evaluation system for the problem and related fields, which contributes to the research in this field.
【学位授予单位】:安徽大学
【学位级别】:博士
【学位授予年份】:2016
【分类号】:TP391.41
【相似文献】
相关期刊论文 前10条
1 李映;张艳宁;许星;;基于信号稀疏表示的形态成分分析:进展和展望[J];电子学报;2009年01期
2 赵瑞珍;王飞;罗阿理;张彦霞;;基于稀疏表示的谱线自动提取方法[J];光谱学与光谱分析;2009年07期
3 杨蜀秦;宁纪锋;何东健;;基于稀疏表示的大米品种识别[J];农业工程学报;2011年03期
4 史加荣;杨威;魏宗田;;基于非负稀疏表示的人脸识别[J];计算机工程与设计;2012年05期
5 高志荣;熊承义;笪邦友;;改进的基于残差加权的稀疏表示人脸识别[J];中南民族大学学报(自然科学版);2012年03期
6 朱杰;杨万扣;唐振民;;基于字典学习的核稀疏表示人脸识别方法[J];模式识别与人工智能;2012年05期
7 耿耀君;张军英;袁细国;;一种基于稀疏表示系数的特征相关性测度[J];模式识别与人工智能;2013年01期
8 张疆勤;廖海斌;李原;;基于因子分析与稀疏表示的多姿态人脸识别[J];计算机工程与应用;2013年05期
9 李正周;王会改;刘梅;丁浩;金钢;;基于形态成分稀疏表示的红外小弱目标检测[J];弹箭与制导学报;2013年04期
10 胡正平;赵淑欢;李静;;基于块稀疏递推残差分析的稀疏表示遮挡鲁棒识别算法研究[J];模式识别与人工智能;2014年01期
相关会议论文 前3条
1 何爱香;刘玉春;魏广芬;;基于稀疏表示的煤矸界面识别研究[A];虚拟运营与云计算——第十八届全国青年通信学术年会论文集(上册)[C];2013年
2 樊亚翔;孙浩;周石琳;邹焕新;;基于元样本稀疏表示的多视角目标识别[A];2013年中国智能自动化学术会议论文集(第五分册)[C];2013年
3 葛凤翔;任岁玲;郭鑫;郭良浩;孙波;;微弱信号处理及其研究进展[A];中国声学学会水声学分会2013年全国水声学学术会议论文集[C];2013年
相关博士学位论文 前10条
1 李进明;基于稀疏表示的图像超分辨率重建方法研究[D];重庆大学;2015年
2 王亚宁;基于信号稀疏表示的电机故障诊断研究[D];河北工业大学;2014年
3 姚明海;视频异常事件检测与认证方法研究[D];东北师范大学;2015年
4 黄国华;蛋白质翻译后修饰位点与药物适应症预测方法研究[D];上海大学;2015年
5 王瑾;基于稀疏表示的数据收集、复原与压缩研究[D];北京工业大学;2015年
6 王文卿;基于融合框架与稀疏表示的遥感影像锐化[D];西安电子科技大学;2015年
7 解虎;高维小样本阵列自适应信号处理方法研究[D];西安电子科技大学;2015年
8 秦振涛;基于稀疏表示及字典学习遥感图像处理关键技术研究[D];成都理工大学;2015年
9 薛明;基于稀疏表示的在线目标跟踪研究[D];上海交通大学;2014年
10 孙乐;空谱联合先验的高光谱图像解混与分类方法[D];南京理工大学;2014年
相关硕士学位论文 前10条
1 王道文;基于稀疏表示的目标跟踪算法研究[D];华南理工大学;2015年
2 李哲;基于稀疏表示和LS-SVM的心电信号分类[D];河北大学;2015年
3 孙雪青;Shearlet变换和稀疏表示相结合的甲状腺结节图像融合[D];河北大学;2015年
4 吴丽璇;基于稀疏表示的微聚焦X射线图像去噪方法[D];华南理工大学;2015年
5 赵孝磊;基于图像分块稀疏表示的人脸识别算法研究[D];南京信息工程大学;2015年
6 黄志明;基于辨别式稀疏字典学习的视觉追踪算法研究[D];华南理工大学;2015年
7 张铃华;非约束环境下的稀疏表示人脸识别算法研究[D];南京信息工程大学;2015年
8 贺妍斐;基于稀疏表示与自适应倒易晶胞的遥感图像复原方法研究[D];南京信息工程大学;2015年
9 杨烁;电能质量扰动信号的稀疏表示/压缩采样研究[D];西南交通大学;2015年
10 应艳丽;基于低秩稀疏表示的目标跟踪算法研究[D];西南交通大学;2015年
,本文编号:2267027
本文链接:https://www.wllwen.com/shoufeilunwen/xxkjbs/2267027.html