误差恢复视频压缩中的高级可伸缩编码和运动估计
[Abstract]:At present, we are in an era of high information development, and we will encounter a great deal of multimedia content data in our daily life, especially the pictures and video information to be transmitted through the network. The demand for rich media on the Internet and wireless networks is growing rapidly, driving these rich-media communication and entertainment services, not only for enhanced broadband access, but also strong media coding techniques to make the transmission more efficient. Some video coding standards, such as the iso/ iec mpeg series and the itu-t video coding standard, have been developed to significantly reduce the data rate. Most of these video compression methods use a block-based discrete cosine transform (dct) with motion compensation to eliminate spatial and temporal redundancy. In the video coding technology designed for network transmission, the two main problems are: the first is that the performance of any network system is the best to deliver the data, but it can't guarantee the reliability of the network sex. video data, compared to other data types, have a larger amount of data, so the network's limited transmission bandwidth, low processor power consumption, and available storage space may limit its propagation energy Force. For video applications, high transmission errors bring additional water, such as time delay, complexity, and product quality. Retransmission is an effective way to address network transmission errors, but it introduces a network-attached load that may not be suitable for requiring low latency with. Its main purpose is to protect the video data and to hide or restore the number of videos in the possible errors According to. Heterogeneity in large-area networks is another question of limiting video applications problem. Different types of networks have different bandwidth and flow negative The heterogeneous video network requires the provision of video services with variable quality and is capable of automatically and accurately meeting these requirements Please. The most critical part of video compression is the transport motion estimation. Motion estimation is a motion vector The process. These vectors determine the amount of transport generated in the previous frame to compensate for the predicted frame The real-time implementation of the algorithm is very important to the real-time realization of the algorithm. The motion estimation algorithm can be divided into time domain algorithm and frequency. Domain algorithm. The matching algorithm and the gradient-based algorithm are the weight of the time-domain algorithm. The matching algorithm can be divided into a block matching algorithm and a characteristic piece. The gradient-based algorithm can be divided into pixel recursion and block delivery. the method comprises the following steps of: applying phase correlation, wavelet domain matching and DCT domain matching in a frequency domain algorithm methods. gradient techniques are commonly used for image-to-image processing, The analysis of the sequence. The pixel recursive technique, as a subset of the gradient technique, is applied in the image sequence coding where the best match search is based on pixel-by-pixel on the basis of pixel-based technology requires very high computational complexity, discomfort, In-time application, the frequency-domain technique is the relation between the dependence and the transfer coefficient of the shift image, and it is not widely used in the image finally, the block matching technique, based on the idea of minimizing the particular cost function, becomes the most widely used method in the coding application, block-matched motion estimation in a variety of motion estimation algorithms is the most important method. To minimize the search time in a block match, a simple and effective calculation The block matching motion estimation (BMME) is the most popular and practical in the video coding The motion estimation method of the H.26X standard series and the MPEG standard series the bmme method is used. block matching is a related technique that looks for candidate images of a particular area in the current image block and the reference frame the best match between the blocks. The block matching process uses at least two frame pictures, that is, reference frames and current frames. the current frame is decomposed into individual macro blocks, the motion is estimated at each macro, a motion estimation algorithm finds the macro module to be encoded on the reference frame for the current frame the most matched macro-module, once the best-matched macro-module is found, the difference or the prediction error between the best-matched macro-module and the current macro-module is calculated, and then the DCT transformation is carried out, Quantization and run-length coding. In addition to coding differences between different macro blocks, the relative displacement between the two macro blocks The vector will also be encoded. In this paper, we first discuss various block-based fast motion estimation algorithms, which are based on the search speed and computational complexity. These algorithms are evaluated. The best performance will be The algorithms are carefully analyzed. These algorithms include exhaustive search or full search (FS: Full Search), three-step search (TSS: Three Step Search), new three-step search (NTSS: New Three Step Search), four-step search (4SS: Four Step Search), diamond search (DS: Diamond Search), and adaptive cross-mode search (ARPS: Adaptive Good Patte) (r n Search). Secondly, we put forward the new ARPS The dynamic adaptive cross search algorithm. It uses the spatial correlation between the adjacent blocks, so we use ARPS _ S The ARPS _ S is based on the assumption that the distribution of the motion vector is not only related to the predicted motion vector height, but also has a high degree of correlation in both the vertical and horizontal directions This constitutes a cross-form. The module around the module of interest, the maximum and minimum of the MV, can be considered to be the estimated deviation of the predicted MV, so that they can be used as an accurate estimate of the length of the arm, indicating the phase The dynamic range of motion in the direction. In contrast to ARPS, in ARPS The four arms in the _ S are not equal. The initial search point for ARPS is 5, and ARPS The number of initial search points for _ S is 6. In our lab, ARPS _ S is searching for speed and video The quality is better than the ARPS. In the end, the paper will discuss the use of the scalable the scalable video coding and decoding technique refers to a user encoding a video sequence into a plurality of bit streams, so as to support the various quality levels of the decoding end. The two types of scalable error recovery coding techniques are described and evaluated in this paper: layered coding and decoding (LC: Layered Coding) and multi-description codec (MDC: Multiple Descr) the properties of the compressed video bitstream are such that video difference recovery techniques have a great importance. for example, the error of a single bit in the vlc encoded video data may result in a loss of synchronization between the encoder and the decoder, a loss of a plurality of video blocks is further caused by the loss of a plurality of bit errors, which often occur in the case of a burst channel error or packet loss, which may result in partial or full video frames, The loss of the time-domain dimension is caused by the loss of the time-domain dimension. The direct result of using motion compensation techniques when using motion compensation techniques. Error recovery and scalability are apparent The scalable video coding and decoding technique refers to the fact that the user encodes a video sequence into a number of bits The stream, thus supporting the various quality levels of the decoding end. The scalability is in some acceptable information A good robustness is provided in the event of a loss. At the same time, it does not bring too much to the decoding The problem does not seriously affect the visual quality. The layered codec (LC: Layered Coding) and the multi-description codec (MDC: Multiple Descriptions Coding) are video transmission Two types of scalable coding techniques. Robust video coding and decoding techniques are limiting the propagation and enhancement of errors It plays an important role in the visual quality. The robust video coding and decoding can be achieved by simultaneously designing a reasonable result and maintaining the acceptable redundancy at the minimum complexity The invention can effectively solve the problem of error concealment, In several layers, each layer has a different importance to fidelity. The layer is also called a base layer and the base layer may be independently encoded. sometimes called the enhancement layer, their decoding depends on the base layer. The quality of the video at the base layer is the lowest, with the quality of the video will be improved with the enhancement of the enhancement layer. In the case of congestion, the network that supports the layered service the network first transmits a base layer packet for decoding the most important base layer packet. the layered video encoding method is first proposed to be used to combat the at least one of the at least one of the at least one of the at least one of The packet loss in the m network is lost and the robustness of the transmission is improved. the main error correction and the scalable coding method. this layered coding is also applied to the multicast in some ip With, for example, the internet multicast backbone. The entire bit stream in the MDC (description is equally important). The layered coding is often associated with unequal error protection (UEP: Unfair Error Protection), which in turn is the most important in the transmission the data, that is, the base layer data, provides a higher degree of protection. Nevertheless, if the base layer is lost (e.g., due to a server crash or a connection failure) or a large number of errors are received, The additional information in the enhancement layer is hardly useful in the nature of the structure. The frequency sequence is compressed into several bit streams with the same importance. Each bit stream (also called Description) Independent decoding, and they can be enhanced with each other. When the receiver the reconstructed video quality is higher when more description is received. thus, The parallel scalability is naturally occurring in multi-description coding. A part of this article This is how to generate a bit stream in the LC and MDC. Each frame first passes through D ct transforms and then quantized and zag-coded. in the layered coding, the most important dct coefficients (the first ten systems the number) is assigned to the base layer and the remaining allocated to the enhancement layer. hi the multi-description encoding,6 the 4 dct coefficients are equally divided into odd and even two parts, The simulation results show that the MDC scene is better than the LC scene. the method can obviously improve the robustness of the real-time video application. in the mdc coding, since all the received information is useful in the case of an error, the problem of layered coding in the best-effort network is avoided, so that the best-effort packet transmission network
【学位授予单位】:北京邮电大学
【学位级别】:博士
【学位授予年份】:2014
【分类号】:TN919.81
【相似文献】
相关期刊论文 前10条
1 李应兴;;基于子块运动估计补偿的视频误码块掩饰[J];微计算机信息;2006年36期
2 冯峗;方宗德;金晟毅;;基于统计学理论的参数模型运动估计方法[J];计算机工程与应用;2007年09期
3 戴卫恒,程宏煌,姚u&u&;一种基于云模型的运动估计快速算法[J];电视技术;2001年09期
4 洪波,余松煜;基于对象的菱形搜索运动估计方法[J];数据采集与处理;2001年01期
5 杨晓辉,李中科,吴乐南;模型基辅助编码中实时运动估计的自适应方法[J];信号处理;2002年06期
6 娄东升;一种新的运动估计与运动补偿算法[J];北京广播学院学报(自然科学版);2003年02期
7 陈良琴,陈新;基于提升小波变换域运动估计的序列图像压缩方法[J];陕西科技大学学报;2004年06期
8 齐兵;王群生;杨春玲;;一种运动估计快速算法的研究与实现[J];通信技术;2006年S1期
9 李志欣;李建华;侯建党;;一种改进的运动估计新算法[J];计算机工程与应用;2007年18期
10 刘彦辉;贾俊玲;张颜艳;;一种自适应的六边形-方形运动估计搜索算法[J];广东通信技术;2009年07期
相关会议论文 前10条
1 周露平;陈宗海;王海波;;运动估计中的不确定性分析[A];2007系统仿真技术及其应用学术会议论文集[C];2007年
2 孙明利;吴一全;;基于改进的粒子群算法的块匹配运动估计方法[A];2008通信理论与技术新发展——第十三届全国青年通信学术会议论文集(下)[C];2008年
3 邹晓春;冯燕;赵歆波;;一种快速的块匹配运动估计新算法[A];中国航空学会信号与信息处理专业全国第八届学术会议论文集[C];2004年
4 郭翌;汪源源;侯涛;;基于运动估计和非局部平均的超声心动图滤波[A];中国仪器仪表学会第十二届青年学术会议论文集[C];2010年
5 欧阳国胜;罗永伦;;一种用于视频编码运动估计的新算法[A];2006中国西部青年通信学术会议论文集[C];2006年
6 邹晓春;赵歆波;冯燕;;图像序列分析综述[A];信号与信息处理技术第三届信号与信息处理全国联合学术会议论文集[C];2004年
7 魏津瑜;孙静静;李欣;代中华;;基于运动估计的动态夜视图像的上色算法[A];2011年中国智能自动化学术会议论文集(第一分册)[C];2011年
8 高韬;于明;;基于冗余小波变换的运动估计及DSP实现[A];第十三届全国图象图形学学术会议论文集[C];2006年
9 李振亚;宋建斌;李波;;一种采用混合搜索模式的H.264运动估计快速算法[A];第四届和谐人机环境联合学术会议论文集[C];2008年
10 鲁小兵;肖创柏;;H.264运动估计搜索窗口的动态调整算法[A];图像图形技术研究与应用2009——第四届图像图形技术与应用学术会议论文集[C];2009年
相关重要报纸文章 前1条
1 田力;准确“锁定”交通肇事车辆[N];人民公安报;2010年
相关博士学位论文 前10条
1 陈运必;高性能运动估计的架构设计与优化的研究[D];中国科学技术大学;2011年
2 纪中伟;先进的运动估计与运动补偿算法在数字视频处理中的应用[D];电子科技大学;2002年
3 王镇道;视频压缩的运动估计与小波方法研究[D];湖南大学;2008年
4 魏伟;视频压缩编码的运动估计与补偿技术[D];天津大学;2008年
5 许晓中;视频编码标准中运动估计技术研究[D];清华大学;2009年
6 于雪松;基于单目无标记点的人体3D运动估计关键技术的研究[D];哈尔滨工业大学;2009年
7 刘新春;面向MPEG-4的视频分割算法研究[D];中国科学院电子学研究所;2000年
8 朱向军;视频运动对象分割与先进运动估计/运动补偿算法之研究[D];浙江大学;2006年
9 向东;基于H.264框架的运动估计和变换研究[D];华中科技大学;2006年
10 郑兆青;用于H.264视频编码的运动估计VLSI结构研究[D];华中科技大学;2007年
相关硕士学位论文 前10条
1 邹晓春;基于快速块匹配的图象序列运动估计技术研究[D];西北工业大学;2005年
2 吴庆伟;运动估计方法研究与序列图像的相关性分析[D];华中科技大学;2005年
3 叶学兵;视频压缩中运动估计的研究[D];北京化工大学;2005年
4 魏伟;基于可变形块匹配的运动估计与补偿[D];天津大学;2006年
5 田胜军;基于块匹配算法的运动估计研究[D];电子科技大学;2006年
6 王平;基于粒子群的视频运动估计算法研究与优化[D];电子科技大学;2009年
7 陈良琴;视频压缩系统运动估计技术研究[D];福州大学;2005年
8 张益林;运动估计匹配标准的抗噪声研究[D];上海交通大学;2009年
9 龚源泉;视频运动估计与噪声抑制滤波部件的设计[D];浙江大学;2005年
10 丁锐;用于运动估计的高效三步法的硬件设计与仿真[D];湖南大学;2006年
,本文编号:2502041
本文链接:https://www.wllwen.com/kejilunwen/wltx/2502041.html