鼻辅音感知线索研究
发布时间:2018-10-24 20:27
【摘要】:语音识别系统的性能受许多因素的影响,如不同的说话人、说话方式、环境噪音等。为了提高系统的识别率和稳定性,一种重要的解决方法是寻找更好的、高强健性的基于人耳听觉感知特性的感知线索。基于此,三维深度研究方法(3DDS)被发明,用来探究语音信号在人耳内部的感知线索,并已成功的运用于对摩擦音和爆破音的感知线索识别。本文将这种方法拓展到鼻辅音的感知线索研究。在三个感知实验结果分析的基础上,定义了冗余感知线索和次要感知线索,并找到了/m/的感知线索是大约位于363~1250 Hz的语音部分,/n/的感知线索是大约位于939~2826 Hz的语音部分。
[Abstract]:The performance of speech recognition systems is affected by many factors, such as different speakers, speech styles, environmental noise and so on. In order to improve the recognition rate and stability of the system, an important solution is to find better and more robust cues based on human auditory perception. Based on this, a 3D depth study method (3DDS) was developed to explore the perceptual cues of speech signals in the human ear, and has been successfully applied to recognize the perceptual cues of frictional and explosive sounds. This paper extends this method to the study of perceptual cues of nasal consonants. Based on the analysis of the results of three perceptual experiments, the redundant perceptual cues and the secondary perceptual cues are defined, and the / m / perceptual cues are found to be the phonetic part located at about 363n / 1250 Hz, and the / n/ perception cues are about 939 / 2826 Hz.
【作者单位】: 电子科技大学电子工程学院;伊利诺伊大学厄巴拿香槟分校电子计算机工程系;
【基金】:美国National Institute of Health(Grant No.R21-RDC009277A)
【分类号】:TN912.34
本文编号:2292464
[Abstract]:The performance of speech recognition systems is affected by many factors, such as different speakers, speech styles, environmental noise and so on. In order to improve the recognition rate and stability of the system, an important solution is to find better and more robust cues based on human auditory perception. Based on this, a 3D depth study method (3DDS) was developed to explore the perceptual cues of speech signals in the human ear, and has been successfully applied to recognize the perceptual cues of frictional and explosive sounds. This paper extends this method to the study of perceptual cues of nasal consonants. Based on the analysis of the results of three perceptual experiments, the redundant perceptual cues and the secondary perceptual cues are defined, and the / m / perceptual cues are found to be the phonetic part located at about 363n / 1250 Hz, and the / n/ perception cues are about 939 / 2826 Hz.
【作者单位】: 电子科技大学电子工程学院;伊利诺伊大学厄巴拿香槟分校电子计算机工程系;
【基金】:美国National Institute of Health(Grant No.R21-RDC009277A)
【分类号】:TN912.34
【相似文献】
中国重要报纸全文数据库 前2条
1 贵阳市乌当中学 万朝炯;浅谈前后鼻韵母辨正的教学[N];贵州民族报;2010年
2 何广见;取人名应兼顾语音美[N];语言文字周报;2007年
中国硕士学位论文全文数据库 前1条
1 钱虹;汉藏语系鼻辅音的类型及历史演变[D];安徽师范大学;2011年
,本文编号:2292464
本文链接:https://www.wllwen.com/shoufeilunwen/xxkjbs/2292464.html