能量受限条件下的手语视频编码方法研究
[Abstract]:Sign language is the most natural way for the deaf and mute to communicate with the visual language of expression, lip movement and other body potential expression. Different from head-shoulder video, sign language video is more complicated and more difficult to study because of the increase of hand shape and arm movement. Compared with the research of sign language video recognition and synthesis, the current coding research for sign language video is less, and most of them are rate-distortion (R-D) theory, and the relationship between coding rate and distortion is studied based on rate-distortion (R-D) theory, so that the distortion of reconstructed sign language video is minimized. However, with the rapid increase of wireless network bandwidth and the wide application of new generation video coding standard H.264, the restriction of coding rate has become weaker and stronger, while the limitation of wireless video terminal in power consumption is becoming stronger and stronger. Therefore, how to minimize the distortion of sign language video, reduce energy consumption and prolong battery renewal cycle has become an urgent problem under the condition of limited energy of wireless video terminal. This paper makes an in-depth study of sign language video coding under energy-limited conditions with the aim of realizing sign language video coding by using the visual selection attention mechanism of the deaf-mute, the power rate distortion theory and the energy distribution video coding method of the region of interest. the dynamic balance optimization between power consumption, coding code rate and coding distortion can reduce the overall power consumption of the wireless video terminal as much as possible while ensuring the subjective and objective coding quality of the sign language video, New theory and new method for optimizing parameter configuration and resource allocation for deaf-mute sign language video coding under energy-limited condition Methods: The research work of this thesis mainly comprises the following steps: (1) theoretical analysis and experiment statistics influence factors influencing the video coding complexity of H.264 sign language, and divides the parameters of the H.264 sign language video coder into four different levels according to the complexity, and then adaptively selects according to the energy of the battery and the complexity of the video motion of the wireless video terminal. The experiment results show that the method can reduce the computational complexity of the encoder and save the wireless video terminal system while ensuring the quality of the sign language video coding is basically unchanged. (2) the energy perception of the region of interest is established by comprehensively considering the imbalance of the energy of the wireless video terminal battery and the visual attention mechanism of the deaf-mute; the method comprises the following steps of: determining the reference frame number and the search element range according to the current available battery energy and the video frame complexity of the wireless video terminal according to the current available battery energy and the video frame complexity of the wireless video terminal; determining the macro block according to the visual importance of different macro block areas of the sign language video at the macro block layer; the measurement mode and the quantization coefficient are finally determined according to the frame layer and the macro block layer; The experimental results show that the method can reduce the computational complexity of the encoder and save the wireless video at the same time of guaranteeing the coding quality of the sign language video ROI. Power-Rate-Distance (P-R-D) characteristics of three coding modes of H. 264 frame, inter-frame and inter-frame coding modes are analyzed in detail. On this basis, the energy consumption model and P-R-D model of coded frame sign language video are respectively set up. An algorithm is used to optimize the number of macro blocks in frame, inter-frame and skip coding mode in one frame of video. The experiment results show that the proposed P-R-D model and reality The performance of P-R-D is matched. (4) The force field (Force F) is proposed for sign language video gesture detection under the shielding condition of hand face. The method comprises the following steps of: respectively calculating a force field image of a hand face shielding frame and a pure face frame, in that method, the gray component difference of each block histogram is obtain, and finally, the gray component difference of each block histogram is equal to that of each block histogram, The gray threshold is compared to obtain the hand position. The experiment proves that the method can be used in real time
【学位授予单位】:兰州理工大学
【学位级别】:博士
【学位授予年份】:2014
【分类号】:TN919.81
【参考文献】
相关期刊论文 前10条
1 刘鹏宇;何絮;贾克斌;;对特定模式进行预判的H.264帧间快速编码算法[J];兵工学报;2011年04期
2 崔玉斌;蔡安妮;;一种新颖的H.264帧内预测快速算法[J];北京邮电大学学报;2008年02期
3 韦耿;王亮;朱斌;;无线移动环境视频编码动态功耗模型研究[J];传感技术学报;2009年03期
4 张淑芳;李华;;基于H.264的多参考帧快速选择算法[J];电子学报;2009年01期
5 吴晓军;白世军;卢文涛;;基于H.264视频编码的运动估计算法优化[J];电子学报;2009年11期
6 周宇;陈熙霖;赵德斌;姚鸿勋;高文;;基于数据生成的手语识别自适应方法[J];高技术通讯;2009年12期
7 何书前;倪江群;石春;;一种分层判决结构的H.264/AVC快速帧间模式选择方法[J];电子学报;2013年11期
8 曹昕燕;赵继印;李敏;;基于肤色和运动检测技术的单目视觉手势分割[J];湖南大学学报(自然科学版);2011年01期
9 杨春玲;王华兴;;基于结构相似度的H.264快速运动估计算法[J];华南理工大学学报(自然科学版);2008年08期
10 张良国;高文;陈熙霖;陈益强;王春立;;面向中等词汇量的中国手语视觉识别系统[J];计算机研究与发展;2006年03期
相关博士学位论文 前2条
1 韦耿;视频编码功率率失真模型及低复杂度算法研究[D];华中科技大学;2007年
2 李斌;面向高性能视频编码标准的率失真优化技术研究[D];中国科学技术大学;2013年
,本文编号:2273345
本文链接:https://www.wllwen.com/kejilunwen/wltx/2273345.html