基于结构化概率模型学习的异构数据压缩编码

发布时间：2018-04-29 07:12

本文选题：结构化概率模型 + 推理学习算法　；参考：《上海交通大学》2014年博士论文

【摘要】：数据采集、网络服务和存储技术的发展逐渐产生大量具有复杂结构的异构数据。具有多种统计特性、复杂分布的异构数据压缩的关键在于捕捉其中复杂的结构依赖性。基于结构化概率模型学习利用概率图模型来表示异构数据的复杂结构，并进行模型推断和学习，以及由推理算法优化学习过程。通过优化学习，结构化概率模型能够根据异构数据的复杂结构，对预测问题进行同步的集合预测，能够产生少于逐个进行预测的信息量，所以它们更适合用于异构结构压缩中。本文首先根据异构数据的结构特点建立广义上下文模型，并在此基础上，将基于结构化概率模型学习应用于异构数据压缩中，如基因测序序列压缩、图像无损编码以及视频帧内编码等。首先，本文拓展了采用已预测子序列后缀的经典上下文，通过组合重排和多方向扩展，生成多个方向上已预测符号的任意组合作为上下文，并构建模型。对于具有多方向上已预测符号有限价组合的组合排列的上下文结构，广义上下文模型引入和建立模型树来表示和选取它们的结构，并使用归一化最大似然函数来估计最优上下文模型组合的结构和参数，及进一步对广义上下文模型类进行上下文筛选来排除冗余模型，最终获得最小描述长度准则下最优模型组合。基于所选最优模型组合，能够推导出用于预测的加权估计概率。广义上下文模型的模型冗余被证明为与数据大小无关。广义上下文模型充分考虑符号间依赖性，得出它们的预测概率，因此其实质上为结构化概率模型。将它应用在Calgary全集中异构数据以及可执行程序中时，基于最大似然估计预测的压缩性有所提升。因此可以预期基于结构化概率模型学习应用于异构数据压缩中会有更好的压缩性能。于是我们将基于结构化概率模型学习应用于三种异构数据：基因、图像和视频中。本文提出了一种基于结构化概率模型学习的基因测序序列压缩方案，通过分层预测结构来优化目标和参考序列的差值以用于小波编码。该方案为目标序列中的每个核苷酸片段找出最匹配的参考片段，通过为包括匹配位移和大小的各种边信息配置寻找具有最小编码代价的片段来获得。这样获得的差值序列适合于进行小波编码，因为其中符号主要为零。同时马尔可夫链被用来表示各片段边信息间依赖性，并在此约束下进行边信息预测。置信传播过程被用来获得这些预测，通过在马尔可夫链各结点间传播最可能预测来结构化地预测各结点对应边信息的边际概率，并随时进行更新。这种基于结构化概率模型学习的基因测序序列压缩方案能够通过目标和参考序列的差值有效利用它们间相关性，并同时利用边信息间的依赖性来预测概率分布减少额外编码开销，因此相对于参照算法能够明显提升编码性能。本文提出了一种基于结构化概率模型学习的图像无损编码方案，能够同时利用基于二维上下文的空间统计相关性进行最优预测，以及通过数据驱动的结构依赖性约束预测误差使之与码字估计真实分布一致。使用最大边际马尔可夫网络，通过局部结构一致性的联合约束来结构化地组合支持向量机，由此对整个相关区域进行最大边际估计。通过训练不同上下文环境下的模型参数，使得训练中各像素真实值与其它可能值之间的区别边际最大，从而得到基于上下文的最优预测。在可分解损失函数下随着充分采样，预测误差能够渐近逼近训练误差。自然测试图像无损编码的结果也验证了其性能。本文提出了一种基于结构化概率模型学习的视频帧内编码方案。基于结构化概率模型学习的预测与高性能视频编码进行集成，作为可选模式进行率失真优化，从而对整个亮度块同时预测得到最优率失真性能。并且使用最大边际马尔可夫网络约束整个亮度块基于最大边际估计的预测，使之符合二维离散余弦变换的真实概率分布，，所以拉普拉斯损失函数被用来衡量训练与预测中的损失。由于有损编码中残差通常符合拉普拉斯分布，在求解最优预测的过程，期望传播算法被用来以函数族逼近该真实分布。同时使用期望传播来进行信息传递与推断能够保证预测的全局最优性，从而避免了置信传播可能使迭代过程停止在局部最优点的问题。由于损失函数可分解，所以充分采样下预测误差渐近等价训练误差。实验结果证实本方案相较于高性能视频编码标准具有更好的率失真性能和重建图像视觉质量。
[Abstract]:The development of data acquisition , network service and storage technology has gradually generated a lot of heterogeneous data with complex structure . The key of heterogeneous data compression with multiple statistical characteristics and complex distribution is to capture the complex structure dependence . Based on the structural probability model learning , the complex structure of heterogeneous data is captured , and the learning process is optimized by inference algorithm . Based on the structural characteristics of heterogeneous data , the structured probability model can be used in heterogeneous structural compression . Based on the structural characteristics of heterogeneous data , the structure probability model can be applied to heterogeneous data compression , such as gene sequencing , image lossless coding and intra - frame coding .

First , this paper extends the classical context of using the suffix of predicted sub - sequence . By combining the rearrangement and the multi - direction extension , we generate arbitrary combination of the predicted symbols in multiple directions as the context and construct the model . The generalized context model is used to estimate the structure and parameters of the combination of the optimal context model by using the normalized maximum likelihood function , and the generalized context model class is used to estimate the optimal model combination . Based on the combination of the selected optimal model , the weighted estimation probability for the prediction can be derived . The model redundancy of the generalized context model is proved to be independent of the data size .

The generalized context model takes full account of inter - symbol dependencies to derive their predicted probabilities , and therefore is essentially a structured probability model . It is expected to improve the compression performance based on maximum likelihood estimation predictions when applied in heterogeneous data and executable programs . Therefore , it is expected that learning applications based on structured probability models will have better compression performance in heterogeneous data compression . We then apply the structured probability model learning to three heterogeneous data : genes , images , and videos .

This paper proposes a gene sequencing sequence compression scheme based on structured probability model learning . The difference between target and reference sequence is optimized by hierarchical prediction structure to obtain the most matching reference fragment . This scheme is used to find out the most matching reference fragment for each nucleotide segment in the target sequence .

This paper presents an image lossless coding scheme based on structured probability model learning , which can be used to predict the correlation of spatial statistics based on two - dimensional context .

This paper presents a video intra - frame coding scheme based on structured probability model learning . Based on the prediction of structured probability model learning and high performance video coding , the optimal rate distortion performance is predicted .

【学位授予单位】：上海交通大学
【学位级别】：博士
【学位授予年份】：2014
【分类号】：TN919.81

【共引文献】