当前位置:主页 > 科技论文 > 自动化论文 >

基于深度学习的情绪感知系统的研究与设计

发布时间:2018-03-30 14:45

  本文选题:深度学习 切入点:情绪感知 出处:《电子科技大学》2017年硕士论文


【摘要】:情绪感知就是对人的情绪进行识别,是人工智能研究的重要方面。为了提高人机交互体验,让机器更好地理解人的情感,学术界从人的声音,表情,动作等方面展开了研究,其中从语音角度进行的情绪感知是本文的主要内容。深度学习是人工智能领域当前最热的领域,在语音识别,图像识别,自然语言处理方面都取得了显著的成果。而深度学习领域的飞速发展,也产生了一些比较有效的模型方法,如深度信念网络DBN,卷积神经网络CNN,循环神经网络RNN等等,如何利用深度学习方法在语音情绪感知方面提高情绪感知的准确率是一个新的研究问题。本文正是针对上述问题,以如何应用深度学习方法提高情绪感知准确率为研究对象,在对传统语音情绪感知的研究理论进行归纳总结的基础上,同时对深度学习领域的各种模型方法进行详尽的理论分析,使用tensorflow平台建立深度学习模型并且设计基于C/S的iOS移动端的语音情绪感知系统。主要工作如下:1.本文研究了情绪感知的传统方法,分析了传统情绪识别方法优缺点。传统情绪感知传统方法主要是使用手工特征提取,人工种类很多,最常用的是MFCC梅尔倒谱系数,但从语音识别领域近年来的成果来看,效果不如将音频转化为语谱图传入神经网络进行自动特征学习得到的训练结果好,本文在语音情绪感知中也引入了将语音转为语谱图输入,进行自动特征学习的方式。2.本文研究分析了深度学习的主流模型,分析了当前已有文著采用的深度学习方法,进一步提出XNN-SVM模型在语音情绪感知领域进行应用。笔者基于Tensorflow平台使用XNN-SVM模型建立了系统原型,并在此系统原型上进行若干对比实验,证明了模型的改进效果。3.本文设计实现一个基于C/S模式的双端识别语音情绪感知系统,既可以通过手机进行本地识别,同时可以通过服务器进行识别反馈,帮助改进模型。并且采集了300条语音情感数据进行系统测试,验证了该模型的工程实用性。
[Abstract]:Emotion perception is the recognition of human emotion, which is an important aspect of artificial intelligence research. In order to improve the human-computer interaction experience and make the machine understand human emotion better, the academic circles have carried out research from the aspects of human voice, expression, action and so on. The emotion perception from the perspective of speech is the main content of this paper. Deep learning is the hottest field in the field of artificial intelligence, in speech recognition, image recognition, The rapid development of deep learning has produced some effective modeling methods, such as deep belief network (DBN), convolutional neural network (CNN), cyclic neural network (RNN), and so on. It is a new research problem how to improve the accuracy of emotion perception in phonetic emotion perception by using the method of deep learning. In this paper, we focus on how to use the method of deep learning to improve the accuracy of emotion perception. On the basis of summing up the traditional theories of phonological emotion perception, and at the same time making a detailed theoretical analysis of various model methods in the field of in-depth learning. Using tensorflow platform to set up the model of deep learning and to design the voice emotion perception system of iOS mobile side based on C / S. The main work is as follows: 1.This paper studies the traditional methods of emotion perception. The advantages and disadvantages of traditional emotion recognition methods are analyzed. Traditional emotion perception methods mainly use manual feature extraction, there are many artificial types, the most commonly used is MFCC Mel cepstrum, but from the recent achievements in the field of speech recognition, The effect is not as good as the result of automatic feature learning by converting audio frequency into speech spectrum afferent neural network. In this paper, we also introduce speech into speech spectrum input in speech emotion perception. 2. This paper studies and analyzes the mainstream model of deep learning, and analyzes the methods of depth learning that have been used in current works. Furthermore, the application of XNN-SVM model in the field of phonological emotion perception is proposed. Based on the Tensorflow platform, the author uses XNN-SVM model to build the prototype of the system, and carries out some comparative experiments on the prototype of the system. 3. This paper designs and implements a two-terminal speech emotion sensing system based on C / S mode, which can be recognized locally by mobile phone, and can be recognized and feedback by the server at the same time. The model is improved and 300 speech emotion data are collected for system test, which verifies the engineering practicability of the model.
【学位授予单位】:电子科技大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP18

【参考文献】

相关期刊论文 前5条

1 刘雨青;刘艳芳;;基于时空域转换的音频信号分析与识别[J];数码设计;2016年02期

2 邵兵;杜鹏飞;;基于卷积神经网络的语音情感识别方法[J];科技创新导报;2016年06期

3 韩文静;李海峰;阮华斌;马琳;;语音情感识别研究进展综述[J];软件学报;2014年01期

4 董建彬;马艳玲;;Mel频率倒谱系数的提取与改进[J];科技信息(科学教研);2008年15期

5 赵蕤,王作英;用于语音识别的基于频谱调整的信道自适应方法[J];清华大学学报(自然科学版);2005年04期

相关硕士学位论文 前1条

1 丁倩;基于语音信息的多特征情绪识别算法研究[D];山东大学;2015年



本文编号:1686373

资料下载
论文发表

本文链接:https://www.wllwen.com/kejilunwen/zidonghuakongzhilunwen/1686373.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户8bacf***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com