声带振动发音过程机理研究与仿真
发布时间:2018-10-26 17:59
【摘要】:语音作为人类交流的重要方式之一,越来越多地被深入探究和扩展,而对人类发声原理的研究是促使语音技术进一步发展的基础,此研究广泛应用于语音识别、语音合成和相关医学诊断等领域。对发声过程进行建模和仿真,对发声机理的深入研究有重要的作用。对声带振动模型的研究能够形象地描述声带振动特性,通过深入这种研究,有助于对人体发声过程的进一步研究和实际应用。本文从发声的生理过程出发,通过对国内外已有的有关语音发声模型的调研分析,对语音发声模型展开了细致研究和仿真。首先介绍发音系统的各个子系统的生理结构、功能特性,并具体分析了声带振动发音过程的原理。在此理论之上,对双质体激励源模型和多管级联的声道模型进行了分析和建模。先根据声带的分层结构,引出双质体激励源模型,针对发音过程中的声道特性,给出声管级联的声道模型。然后利用电声类比的方法为声道传输特性建模。对这两个模型,使用Java编程进行了计算机编程仿真。使用差分方程来对模型进行了离散化,然后通过仿真程序合成出若干浊音信号,给出合成浊音信号的频谱图、时域波形图,并以表格的形式列出了频域共振峰,在表格中与典型值进行了对比,从主观的听觉、客观的共振峰数值验证了仿真的有效性。另一方面,通过仿真程序进行多个发音参数的调整(包括声带紧张度、肺部气压、声门初始面积、声道长度等),实现了对不同发音特点的语音信号的有效仿真合成。
[Abstract]:As one of the important ways of human communication, speech is more and more deeply explored and expanded. The research on the principle of human phonation is the basis of the further development of speech technology, which is widely used in speech recognition. Speech synthesis and related medical diagnostics. Modeling and simulation of vocal process plays an important role in the study of sound mechanism. The study of vocal cord vibration model can describe the vibration characteristics of vocal cord vividly. The further study will be helpful to the further study and practical application of human vocal process. Based on the physiological process of phonation and through the investigation and analysis of the existing phonetic models at home and abroad, this paper makes a detailed study and simulation of the phonetic models. This paper first introduces the physiological structure and functional characteristics of each subsystem of the pronunciation system, and analyzes the principle of the vocal cord vibration pronunciation process in detail. Based on this theory, the dual body excitation source model and the multi-tube cascade sound channel model are analyzed and modeled. According to the layered structure of the vocal cord, the dual body excitation source model is derived, and the sound channel model of the sound tube cascade is given according to the characteristics of the vocal channel in the process of pronunciation. Then the acoustic analogy is used to model the transmission characteristics of the channel. For these two models, the computer programming simulation is carried out by using Java programming. The model is discretized by difference equation, and then some turbidimetric signals are synthesized by simulation program. The spectrum diagram and time domain waveform of the synthesized turbidimetric signals are given, and the resonance peaks in frequency domain are listed in the form of tables. Compared with the typical values in the table, the validity of the simulation is verified by the subjective auditory and objective resonance peak values. On the other hand, the speech signal with different pronunciation characteristics is effectively synthesized by adjusting several phonetic parameters (including vocal cord tension, lung pressure, glottis initial area, vocal channel length, etc.) through the simulation program.
【学位授予单位】:青岛大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TN912.3
本文编号:2296522
[Abstract]:As one of the important ways of human communication, speech is more and more deeply explored and expanded. The research on the principle of human phonation is the basis of the further development of speech technology, which is widely used in speech recognition. Speech synthesis and related medical diagnostics. Modeling and simulation of vocal process plays an important role in the study of sound mechanism. The study of vocal cord vibration model can describe the vibration characteristics of vocal cord vividly. The further study will be helpful to the further study and practical application of human vocal process. Based on the physiological process of phonation and through the investigation and analysis of the existing phonetic models at home and abroad, this paper makes a detailed study and simulation of the phonetic models. This paper first introduces the physiological structure and functional characteristics of each subsystem of the pronunciation system, and analyzes the principle of the vocal cord vibration pronunciation process in detail. Based on this theory, the dual body excitation source model and the multi-tube cascade sound channel model are analyzed and modeled. According to the layered structure of the vocal cord, the dual body excitation source model is derived, and the sound channel model of the sound tube cascade is given according to the characteristics of the vocal channel in the process of pronunciation. Then the acoustic analogy is used to model the transmission characteristics of the channel. For these two models, the computer programming simulation is carried out by using Java programming. The model is discretized by difference equation, and then some turbidimetric signals are synthesized by simulation program. The spectrum diagram and time domain waveform of the synthesized turbidimetric signals are given, and the resonance peaks in frequency domain are listed in the form of tables. Compared with the typical values in the table, the validity of the simulation is verified by the subjective auditory and objective resonance peak values. On the other hand, the speech signal with different pronunciation characteristics is effectively synthesized by adjusting several phonetic parameters (including vocal cord tension, lung pressure, glottis initial area, vocal channel length, etc.) through the simulation program.
【学位授予单位】:青岛大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TN912.3
【参考文献】
相关期刊论文 前5条
1 康晓莉;曹辉;;声带双弹簧对称振动模型研究[J];陕西师范大学学报(自然科学版);2011年06期
2 蒋景英,虞启琏,邱庆军,徐可欣;声带振动的二质量块-有限元模型[J];生物医学工程学杂志;2005年02期
3 贾雷鸣,李迎新,张伟,黄勇旺;声带振动波形分析方法的研究[J];天津医科大学学报;2001年03期
4 郑义,蒋刚毅,张礼和,,蔡金海;基于声带振动模型和声门波的嘶音研究[J];声学学报;1996年06期
5 张礼和,程启明;嘶音的三质量块声带模型分析方法[J];中国生物医学工程学报;1991年04期
相关博士学位论文 前2条
1 邵骏;声带振动动态特征的定量研究[D];复旦大学;2009年
2 吴永忠;基于物理模型的声音合成技术理论研究[D];合肥工业大学;2003年
相关硕士学位论文 前1条
1 潘欣裕;汉语耳语音特征分析与应用研究[D];苏州大学;2007年
本文编号:2296522
本文链接:https://www.wllwen.com/kejilunwen/xinxigongchenglunwen/2296522.html