基于FPGA的数字音频内容监测系统的研究与设计
发布时间:2019-03-29 14:46
【摘要】:随着科技的进步和发展,广播电视技术也在不断进步,广播电视节目的数量也增加很多。传统的监测方式受到极大的挑战,因此需要新型自动监测系统出现来适应广播电视技术的发展。同时现在市面上在使用的音频监测系统主要是基于文本库的监测比对方式,,这种监测方式受限于文本库的大小和内容,不能普遍使用。因此,基于内容的音频监测技术随之得到广泛的研究。这要求不同于以前文本库的方式,需要提取能反映音频信号特点的特征参数信息,使得系统能有更强的适应能力。同时基于内容的音频监测技术是使用实时监测的方法,能够快速给出监测结果。 本文研究的是基于内容的音频监测问题。研究的目的是监测是否有播放内容的变化,主要涉及播出节目和广告两个方面。这样既保护了广播电视制作方的知识产权,同时也让老百姓能看到安全放心的广播电视节目。系统使用了FPGA芯片,使得最后做出的系统体积小,而且便于系统更新升级。 本文根据监测系统的要求,详细的分析了语音信号的主要特征参数,主要介绍了语音信号的时域特征参数,最终根据实际使用情况选择了短时平均过零率和短时平均幅度电平两个特征参数共同作为监测系统的特征参数。其中这两个特征参数都是基于语音信号的短时特性,这就要求在特征数据提取之前必须要先对语音信号做加窗分帧预处理。考虑到作为电视伴音监测系统,于是选择了视频信号中的ODD信号作为分帧信号,这样能使监测比对的两路信号做到粗同步的要求。这两个特征参数的数据量也很小,便于远端提取出来的特征数据通过网络传输到本地监测端。在经过很多不同的传输路径之后,两路信号在时间上已经不同步了,所以信号的同步对齐是该系统需要重要解决的问题。基于对延迟信息的分析,提出了搜索跟踪技术来解决信号的对齐问题,即在本地端加固定整数帧和小数帧延迟去逼近远端的特征数据。为了加快搜索跟踪的速度,使用了照射体制,即使用连续多帧本地特征数据跟远端特征数据同时进行比较。最后对系统的各个模块做了仿真验证。
[Abstract]:With the progress and development of science and technology, radio and television technology is also progressing, the number of radio and television programs is also increasing a lot. Traditional monitoring methods are greatly challenged, so a new automatic monitoring system is needed to adapt to the development of radio and television technology. At the same time, the audio monitoring system in use on the market is mainly based on the text library of the monitoring and comparison mode, this monitoring mode is limited by the size and content of the text library, can not be widely used. Therefore, content-based audio monitoring technology has been widely studied. This requires that the feature parameter information which can reflect the characteristics of audio signal should be extracted, which is different from that of the previous text library, so that the system can have stronger adaptability. At the same time, the content-based audio monitoring technology is a real-time monitoring method, which can give the monitoring results quickly. In this paper, the content-based audio monitoring problem is studied. The purpose of the study is to monitor whether there is a change in broadcast content, mainly involving broadcast programs and advertising. This not only protects the intellectual property rights of radio and television producers, but also allows ordinary people to see safe and secure radio and television programs. The FPGA chip is used in the system, which makes the final system small and easy to update and upgrade. According to the requirement of monitoring system, the main characteristic parameters of speech signal are analyzed in detail, and the time domain characteristic parameters of speech signal are mainly introduced. Finally, two characteristic parameters, short-time average zero-crossing rate and short-time average amplitude level, are selected as the characteristic parameters of the monitoring system. These two feature parameters are based on the short-term characteristics of the speech signal, which requires that the speech signal must be pre-processed by windowing and framing before the feature data is extracted. Considering that it is a TV sound monitoring system, the ODD signal in the video signal is chosen as the framing signal, so that the two signals of the monitoring comparison can be coarsely synchronized. The data amount of these two feature parameters is also very small, so it is convenient to transfer the feature data extracted from the remote end to the local monitoring end through the network. After many different transmission paths, the two signals are out of synchronization in time, so the synchronization of the signals is an important problem to be solved in the system. Based on the analysis of delay information, a search-tracking technique is proposed to solve the problem of signal alignment, in which fixed integer frames and decimal frames are added to the local end to approximate the characteristic data of the remote end. In order to accelerate the speed of searching and tracking, an irradiating system is used, even if the continuous multi-frame local feature data is compared with the remote feature data at the same time. Finally, each module of the system is simulated and verified.
【学位授予单位】:电子科技大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:TN941.1
本文编号:2449648
[Abstract]:With the progress and development of science and technology, radio and television technology is also progressing, the number of radio and television programs is also increasing a lot. Traditional monitoring methods are greatly challenged, so a new automatic monitoring system is needed to adapt to the development of radio and television technology. At the same time, the audio monitoring system in use on the market is mainly based on the text library of the monitoring and comparison mode, this monitoring mode is limited by the size and content of the text library, can not be widely used. Therefore, content-based audio monitoring technology has been widely studied. This requires that the feature parameter information which can reflect the characteristics of audio signal should be extracted, which is different from that of the previous text library, so that the system can have stronger adaptability. At the same time, the content-based audio monitoring technology is a real-time monitoring method, which can give the monitoring results quickly. In this paper, the content-based audio monitoring problem is studied. The purpose of the study is to monitor whether there is a change in broadcast content, mainly involving broadcast programs and advertising. This not only protects the intellectual property rights of radio and television producers, but also allows ordinary people to see safe and secure radio and television programs. The FPGA chip is used in the system, which makes the final system small and easy to update and upgrade. According to the requirement of monitoring system, the main characteristic parameters of speech signal are analyzed in detail, and the time domain characteristic parameters of speech signal are mainly introduced. Finally, two characteristic parameters, short-time average zero-crossing rate and short-time average amplitude level, are selected as the characteristic parameters of the monitoring system. These two feature parameters are based on the short-term characteristics of the speech signal, which requires that the speech signal must be pre-processed by windowing and framing before the feature data is extracted. Considering that it is a TV sound monitoring system, the ODD signal in the video signal is chosen as the framing signal, so that the two signals of the monitoring comparison can be coarsely synchronized. The data amount of these two feature parameters is also very small, so it is convenient to transfer the feature data extracted from the remote end to the local monitoring end through the network. After many different transmission paths, the two signals are out of synchronization in time, so the synchronization of the signals is an important problem to be solved in the system. Based on the analysis of delay information, a search-tracking technique is proposed to solve the problem of signal alignment, in which fixed integer frames and decimal frames are added to the local end to approximate the characteristic data of the remote end. In order to accelerate the speed of searching and tracking, an irradiating system is used, even if the continuous multi-frame local feature data is compared with the remote feature data at the same time. Finally, each module of the system is simulated and verified.
【学位授予单位】:电子科技大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:TN941.1
【参考文献】
相关期刊论文 前9条
1 郭利刚;方土富;;浅析广播电视广告节目监测技术[J];中国传媒科技;2007年01期
2 梁达先;;一种射频视音频中断监测器[J];电视技术;2008年06期
3 崔文冲;;应用于广播电视音频监测系统的技术分析[J];电视技术;2011年22期
4 李涛;电视广告计算机监审管理系统的研制与应用[J];广播与电视技术;2005年01期
5 陈月昌;肖蓉;武爱敏;;数字音频监测系统设计及应用[J];广播与电视技术;2007年05期
6 崔冬青,李治柱,吴亚栋;一种噪声环境下连续语音识别的快速端点检测算法[J];计算机工程与应用;2003年23期
7 黄知义,周宁;基于内容视频检索的关键技术研究[J];现代情报;2005年10期
8 李国辉,李恒峰;基于内容的音频检索:概念和方法[J];小型微型计算机系统;2000年11期
9 马彬;浅谈有线电视播出信号实时监测系统的运用[J];有线电视技术;2005年18期
相关硕士学位论文 前1条
1 周庆华;基于嵌入式Linux说话人识别系统的研究[D];西华大学;2009年
本文编号:2449648
本文链接:https://www.wllwen.com/falvlunwen/zhishichanquanfa/2449648.html