卷积神经网络特征重要性分析及增强特征选择模型

发布时间：2018-06-21 14:30

本文选题：卷积神经网络 + 特征重要性分析　；参考：《软件学报》2017年11期

【摘要】：卷积神经网络等深度神经网络凭借着其强大的表达能力、突出的分类性能,已在不同领域内得到了广泛应用.当面对高维特征时,深度神经网络通常被认为具有较好的鲁棒性,能够隐含地对特征进行选择,但由于网络参数巨大,如果数据量达不到足够的规模,则会导致学习不充分,因而可能无法达到最优的特征选择.而神经网络的黑箱特性使得无法观测神经网络选择了哪些特征,也无法评估其特征选择的能力.为此,以卷积神经网络为例,首先研究如何显式地表达神经网络中的特征重要性,提出了基于感受野的特征贡献度分析方法;其次,将神经网络特征选择与传统特征评价方法进行对比分析发现,在非海量样本的情况下,传统特征评价方法对高重要性特征和噪声特征的识别能力反而能够超过神经网络.因此,进一步地提出了卷积神经网络增强特征选择模型,将传统特征评价方法对特征重要性的理解结合到神经网络的学习过程中,以辅助深度神经网络进行特征选择.在基于文本的社交媒体用户属性建模任务下进行了对比实验,结果验证了该模型的有效性.
[Abstract]:Convolutional neural networks such as depth neural networks have been widely used in different fields because of their strong expressive ability and outstanding classification performance. When faced with high dimensional features, deep neural networks are generally considered to be robust and can implicitly select features. However, because of the huge network parameters, if the amount of data does not reach enough scale, it will lead to inadequate learning. As a result, it may not be possible to achieve optimal feature selection. The black box characteristics of the neural network make it impossible to observe which features the neural network has selected and to evaluate its ability of feature selection. For this reason, taking convolutional neural networks as an example, this paper studies how to express the importance of features in neural networks explicitly, and puts forward a method of feature contribution analysis based on receptive field. By comparing neural network feature selection with traditional feature evaluation method, it is found that in the case of non-massive samples, the recognition ability of traditional feature evaluation method for high-importance features and noise features is higher than that of neural network. Therefore, an enhanced feature selection model based on convolutional neural networks is proposed. The traditional feature evaluation method is combined with the understanding of the importance of features in the learning process of the neural network, and the feature selection is carried out with the help of the depth neural network. The effectiveness of the model is verified by a comparative experiment under the task of text-based social media user attribute modeling.
【作者单位】：清华大学智能技术与系统国家重点实验室;清华大学计算机科学与技术系;
【分类号】：TP183

【相似文献】