针对答题关键信息的汉英口语翻译题自动评分方法的研究
[Abstract]:The automatic score of oral English test has been a hot topic in the field of computer-assisted language learning. At present, most of the existing automatic scoring methods are aimed at reading questions, following reading questions and so on, while the automatic scoring methods for Chinese-English translation questions are relatively rare. When using manual scoring method to score Chinese-English translation questions, the grader mainly focuses on whether or not the key information required in the reference answers is included in the answer questions, in which the key information is generally reflected by the key words. Based on this, this paper simulates the manual scoring method, and studies the related questions about the automatic score of the Chinese-English oral translation questions for the key information of the answer question, including the speech signal preprocessing, the key information recognition and the extraction of the key information integrity feature. The pronunciation fluency calculation and fluency feature extraction, and the design of automatic scoring method combined with the key information integrity feature and fluency feature, etc. The main research work includes: (1) considering the difficulty of constructing accurate continuous speech recognition system in the absence of a large amount of annotated speech data, In this paper, an unsupervised keyword detection method based on dynamic time warping (DTW) algorithm is used to identify the keywords in the answer speech. Firstly, the validity of the method based on SLN-DTW (Segmental local-normalized-DTW) is verified in the TIMIT corpus, and the experimental results show that its performance is superior. Then combining with WordNet to construct the key word recognition library of answer questions, the SLN-DTW keyword detection method is applied to the key words recognition in the answer speech. The experimental results show that, The number of keywords detected by SLN-DTW keyword detection method can be used as an effective feature of the coverage of key information. (2) for the keywords detected, in order to obtain further confidence, A speech recognition method based on convolutional neural network (CNN) is proposed in this paper. Firstly, by constructing a speech recognition model based on CNN and processing speech feature parameters with mean warping algorithm, the recognition experiment is carried out on the Spoken Arabic Digit dataset of UCI machine learning library. The recognition rate of the experiment is better than that of other models on the same data set. The result of recognition experiment on the collected key word data set is better than that of the other two commonly used recognition models. The feasibility of using CNN speech recognition method is proved. (3) the key information integrity features are obtained by using the results of keyword detection and keyword recognition. Together with the fluency feature extracted from the speech level of the original answer question, it constitutes the automatic scoring feature of this paper. The automatic scoring model of this paper is constructed by regression analysis of all the features. The whole correlation between the machine score and the original score is 0.729, which proves that the feature extracted is effective for the machine score, and the performance of the scoring model is tested by the real test data, and the overall correlation between the machine score and the original score is 0.729, which proves that the extracted features are effective. It also verifies the effectiveness of the automatic scoring method for the key information of Chinese-English translation.
【学位授予单位】:广东外语外贸大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:H315.9
【参考文献】
相关期刊论文 前10条
1 周飞燕;金林鹏;董军;;卷积神经网络研究综述[J];计算机学报;2017年06期
2 魏扬威;黄萱菁;;结合语言学特征和自编码器的英语作文自动评分[J];计算机系统应用;2017年01期
3 侯靖勇;谢磊;杨鹏;肖雄;梁祥智;徐海华;王磊;吕航;马斌;CHNG EngSiong;李海洲;;基于DTW的语音关键词检出[J];清华大学学报(自然科学版);2017年01期
4 徐海铭;金燕;王磊;;口译水平测评中的语言指标效度研究——以英语专业八级考试中的口译样本为例[J];外语测试与教学;2016年01期
5 张晴晴;刘勇;潘接林;颜永红;;基于卷积神经网络的连续语音识别[J];工程科学学报;2015年09期
6 杨鹏;谢磊;张艳宁;;低资源语言的无监督语音关键词检测技术综述[J];中国图象图形学报;2015年02期
7 杨柳燕;;中国口译学习者汉英交替传译流利度的探索性研究[J];浙江外国语学院学报;2015年01期
8 潘广源;柴伟;乔俊飞;;DBN网络的深度确定方法[J];控制与决策;2015年02期
9 刘建伟;刘媛;罗雄麟;;深度学习研究进展[J];计算机应用研究;2014年07期
10 刘建伟;刘媛;罗雄麟;;玻尔兹曼机研究进展[J];计算机研究与发展;2014年01期
相关博士学位论文 前1条
1 严可;发音质量自动评测技术研究[D];中国科学技术大学;2012年
相关硕士学位论文 前5条
1 李慧慧;基于深度学习的短语音说话人识别研究[D];郑州大学;2016年
2 陈嘉华;基于深度学习的英语语音识别与发音质量评价[D];广东外语外贸大学;2015年
3 梁静;基于深度学习的语音识别研究[D];北京邮电大学;2014年
4 黄文涛;基于神经网络的嵌入式语音识别系统研究[D];广东工业大学;2012年
5 严可;英文朗读题及复述题自动评测技术研究[D];中国科学技术大学;2009年
,本文编号:2374441
本文链接:https://www.wllwen.com/waiyulunwen/yingyulunwen/2374441.html