文本无关的多说话人确认研究
[Abstract]:In recent years, in the field of biometrics, speaker recognition has attracted more and more attention because of its unique advantages of security, economy and accuracy, and has gradually become an important way of identity verification in people's lives and work. It has broad market prospects. This paper begins with the system framework of speaker verification, and then introduces each part of the system in detail. Then, aiming at the speaker verification under complex conditions, it focuses on feature extraction, speaker segmentation, model building and other technologies. The main research work and innovation of this paper are as follows: 1. Based on the GMM-UBM speaker verification system as the baseline system of this paper, the related factors affecting the performance of the system are studied and analyzed, including Gaussian mixture, training speech length, scoring regularization technology, and verified by experiments. 2. In feature extraction, in order to improve the performance of speaker verification system in noisy environment, this paper proposes a method to improve the performance of the system. A multi-window spectral subtraction MFCC feature with strong noise robustness is proposed. The multi-window spectral subtraction MFCC is an improvement on the existing multi-window spectral MFCC (Multitaper MFCC), which combines the multi-window spectral estimation technique with the spectral subtraction method. The simulation results show that when the test speech contains additive noise, it is better than the multi-window spectral MFCC extraction algorithm. The speaker verification system using multi-window spectral subtraction MFCC achieves good results in EER with equal error rate and minDCF with minimum detection cost function. In order to improve the segmentation speed and accuracy at the same time, this paper first proposes a three-step segmentation strategy to implement the BIC speaker segmentation algorithm. The experimental results show that the improved segmentation algorithm has a great improvement in segmentation speed and accuracy. 4. In the aspect of model building, I-vector speaker modeling technology is explored and studied, especially the extraction process of I-vector and the construction of I-vector based speaker. The speaker recognition system is analyzed and compared with the speaker verification system based on GMM-UBM.
【学位授予单位】:电子科技大学
【学位级别】:硕士
【学位授予年份】:2014
【分类号】:TN912.34
【共引文献】
相关期刊论文 前10条
1 贺前华;王志锋;Alexander I Rudnicky;朱铮宇;李新超;;基于改进PNCC特征和两步区分性训练的录音设备识别方法[J];电子学报;2014年01期
2 黄奋;马皓;邓菁;;说话人识别技术在社保系统中的远程身份认证应用研究[J];电子技术与软件工程;2014年02期
3 马勇;鲍长春;;基于稀疏神经网络的说话人分割[J];北京工业大学学报;2015年05期
4 李晋;郭武;戴礼荣;;联合因子分析算法中基于信号子空间的空间变换方法[J];模式识别与人工智能;2013年08期
5 杨栋;周秀玲;郭平;;基于贝叶斯通用背景模型的图像标注[J];自动化学报;2013年10期
6 祝太锋;;基于动态反馈负载均衡算法的改进[J];湖南农机;2013年11期
7 骆启帆;章坚武;吴震东;;一种基于MFCC与韵律特征的说话人确认方法[J];杭州电子科技大学学报;2013年05期
8 陈丽萍;王尔玉;戴礼荣;宋彦;;基于深层置信网络的说话人信息提取方法[J];模式识别与人工智能;2013年12期
9 廖晓锋;范修斌;姜青山;;基于协方差的高斯混合模型参数学习算法[J];计算机科学;2013年S2期
10 郭心语;何晓丰;宫学庆;张蓉;周傲英;;一种基于曝光量和点击率的用户组优化策略[J];计算机研究与发展;2013年S1期
相关会议论文 前6条
1 骆启帆;章坚武;吴震东;;一种基于MFCC与韵律特征的说话人确认方法[A];浙江省电子学会2013学术年会论文集[C];2013年
2 尹聪;白静;龚[,
本文编号:2206917
本文链接:https://www.wllwen.com/kejilunwen/wltx/2206917.html