智能建模中冗余问题的分析与处理及其应用
发布时间:2018-08-22 13:58
【摘要】:随着现代科学技术飞速发展,石油化工产业不断进步和发展,石油化工产品质量不断提高、种类不断多元化。当面临新颖的生产工艺、不断复杂化的过程系统时,其过程机理难以获得,机理模型难以建立。因此,基于数据的建模方法得到了越来越多的关注。基于数据的建模方法不用了解化工过程机理,却十分依赖样本数据的质量和模型的结构。而在这两方面往往会出现冗余问题。A)挑选的输入变量可能与因变量无关,且相互之间可能存在冗余。工业过程中,通过先验知识或已知部分机理知识确定足够多的变量个数,而它们之间通常存在复杂的交互关系。如果将它们都作为模型输入,就会直接增加模型输入结构的复杂度,而且冗余问题可能间接传递到模型输出,严重影响其性能。B)模型结构中的冗余问题。模型的性能与其结构的优劣息息相关,而且结构的复杂度决定了模型的计算效率。为此,本文针对这两类智能建模中的冗余问题,首先通过多元统计方法中的主成分分析,并结合互信息分析法,研究和探讨了如何消除输入变量之间的冗余问题、如何发现与因变量无关的输入变量。其次,通过偏互信息法和提出的基于偏互信息的聚类方法,消除神经网络隐含层输出之间存在的冗余问题,优化神经网络结构。本文的主要研究成果如下: (1)针对输入变量之间可能存在复杂的冗余问题,结合径向基神经网络建模,提出了基于主成分-互信息分析的径向基神经网络模型(Principal Component Analysis-Mutual Information-Radial Basis Function Neural Network, PCA-MI-RBFNN)。首先通过主成分分析,把含有冗余问题的原始输入变量转换为新变量—主成分。主成分之间互不相关,并按照样本方差从大到小构建。模型要描述的是输入和输出变量之间的关系,因此按方差最大化来选取主成分变量作为模型输入,忽略了输入与输出变量之间的相关信息。于是结合互信息分析法,可以准确分析各主成分与输出变量之间的相关性,挑选出最佳的主成分作为模型输入。经过标准测试建模数据和精对苯二甲酸生产过程中氧化单元粗对苯二甲酸中对羧基苯甲醛(4-carboxybenzaldehyde,4-CBA)含量软测量建模的测试,结果表明消除输入冗余后,PCA-MI-RBFNN模型具有良好的鲁棒和预测性能。 (2)针对挑选的输入变量可能与因变量无关,且输入变量之间可能存在复杂的冗余问题,结合相关向量机建模,提出了基于互信息-主成分-互信息分析的相关向量机模型(Mutual Information-Principal Component Analysis-Mutual Information-Relevance Vector Machine, MI-PCA-MI-RVM)。针对化工过程中高维的原始输入变量,其中有些变量与因变量毫不相关,若将这些变量直接用于建模,则会导致模型不准确;有些变量虽然与因变量相关,但相互之间存在冗余问题,若将这些变量直接用于建模,则会间接降低模型性能。因此提出对原始样本数据的粗筛选方式。MI-PCA-MI-RVM方法首先通过互信息分析,获得所有输入与输出变量之间的互信息量,并根据互信息量的概率密度分布,确定区分无关变量与相关变量的阈值,剔除无关输入变量。然后,针对剩余的输入变量,通过主成分-互信息分析挑选出与模型输出最相关的主成分作为模型输入。通过对二甲苯氧化反应中的4-CBA含量软测量模型的测试,结果表明剔除无关输入变量以及消除输入冗余后,MI-PCA-MI-RVM模型具有良好的鲁棒和预测性能。 (3)针对径向基神经网络的结构优化问题,提出了基于偏互信息-最小二乘的隐含层单元挑选及其网络权值和阈值更新(Partial Mutual Information-Least Square Regression-Radial Basis Function Neural Network, PMI-LSR-RBFNN)。PMI-LSR-RBFNN方法首先通过改进的互信息分析法-偏互信息方法,挑选出合适隐含层单元,这些被挑选出的单元不仅与相互之间冗余最小,而且与输出变量的相关性最大。然后,通过最小二乘,对隐含层输出与输出层输出直接进行线性回归,更新了权值与阈值,建立RBFNN模型。在英威达氧化过程燃烧副反应建模中,与基于K均值、模糊C均值、K中心点和减法聚类的改进径向基网络相比,PMI-LSR-RBFNN网络结构简洁且模型性能更佳。通过Sammon非线性映射分析,由偏互信息分析挑选出的隐含层单元在空间位置上并不是均匀分布,但表现出更佳的模型性能。同时,基于建立的模型,进行各主要操作变量的灵敏度分析,其结果符合燃烧副反应过程的已知先验知识。 (4)针对多层前馈神经网络的结构优化问题,提出了基于最小冗余最大相关-偏互信息聚类方法和最小二乘的隐含层单元挑选及其网络权值和阈值更新方法(MinimalRedundancy Maximal Relevance-Partial Mutual Information Clustering-Least Square Regression-Multi Layer Feed Forward Network, mPMIc-LSR-MLFN)当变量维数增加时偏互信息会消耗大量计算时间,且容易失去估计精度,因此提出了一种新颖的最小冗余最大相关-偏互信息聚类方法。由最小冗余最大相关性分析挑选出合适的隐含层单元作为初始聚类中心;再通过偏互信息量的计算,对所有隐含层单元进行聚类,并在各类中迭代更新该类中心,直到所有中心不再变化,从而寻找到最佳隐含层单元。最后通过最小二乘法线性回归更新输出层输入与隐含层输出之间的权值和阈值。在石脑油干点软测量模型的应用中,与基于K均值、减法等聚类的MLFN和三类改进极限学习机(OP-、OS-、B-ELM)相比,mPMIc-LSR-MLFN模型结构最为简洁,预测性能最为出色。
[Abstract]:With the rapid development of modern science and technology, the petrochemical industry has been progressing and developing, the quality of petrochemical products has been improved and the kinds of petrochemical products have been diversified. Data-based modeling relies heavily on the quality of sample data and the structure of the model without understanding the mechanism of chemical processes. In both cases, redundancy often occurs. A) The input variables selected may be independent of dependent variables and may be redundant with each other. If they are used as model inputs, the complexity of model input structure will be increased directly, and the redundancy problem may be transmitted to model output indirectly, seriously affecting its performance. The performance of the model is closely related to its structure, and the complexity of the structure determines the computational efficiency of the model. To solve the redundancy problem in these two kinds of intelligent modeling, this paper firstly studies and discusses how to eliminate the redundancy between input variables through the principal component analysis of multivariate statistical methods and the mutual information analysis method. Secondly, through partial mutual information method and the proposed clustering method based on partial mutual information, the redundancy problem between hidden layer outputs of neural networks is eliminated and the structure of neural networks is optimized.
(1) In order to solve the complex redundancy problem between input variables, a principal component analysis-Mutual Information-Radial Basis Function Neural Network (PCA-MI-RBFNN) model based on principal component analysis and mutual information analysis is proposed. The model describes the relationship between input and output variables, so the principal component variable is selected as the input of the model according to the variance maximization, ignoring the relationship between input and output variables. Then combined with the mutual information analysis method, the correlation between the principal components and the output variables can be analyzed accurately, and the best principal components can be selected as the model input. The test results of CBA content soft sensor modeling show that PCA-MI-RBFNN model has good robustness and predictive performance after eliminating input redundancy.
(2) Considering that the selected input variables may be independent of dependent variables and that there may be complex redundancy between input variables, a Mutual Information-Principal Component Analysis-Mutual Information-Relevance Vector Ma is proposed based on mutual information-principal component-mutual information analysis. Chine, MI-PCA-MI-RVM. For high-dimensional raw input variables in chemical processes, some of them have nothing to do with dependent variables. If these variables are directly used for modeling, the model will be inaccurate; some variables are related to dependent variables, but there are redundancy problems between them. If these variables are directly used for modeling, the model will be inter-sessional. MI-PCA-MI-RVM first obtains the mutual information between all input and output variables by mutual information analysis, and then determines the threshold to distinguish independent variables from related variables according to the probability density distribution of mutual information. For the remaining input variables, the principal component which is most relevant to the output of the model is selected as the model input by principal component-mutual information analysis. The results of the soft-sensing model of 4-CBA content in xylene oxidation show that the MI-PCA-MI-RVM model has good Lu after removing the independent input variables and eliminating input redundancy. Rod and prediction performance.
(3) To optimize the structure of radial basis function neural networks (RBFNN), the hidden layer cell selection based on partial mutual information-least squares (PMI-LSR-RBFNN) and its network weights and thresholds update (PMI-LSR-RBFNN) are proposed. Then, the RBFNN model is established by linear regression between the output of the hidden layer and the output of the output layer directly through the least square method, updating the weights and thresholds. The PMI-LSR-RBFNN network has a simpler structure and better performance than the improved radial basis function network based on K-means, fuzzy C-means, K-center and subtractive clustering in the modeling of combustion side reactions in INVIDA oxidation process. At the same time, the sensitivity analysis of the main operating variables based on the model is carried out, and the results accord with the known prior knowledge of the combustion side reaction process.
(4) To optimize the structure of multi-layer feed-forward neural networks, a minimum redundancy maximum correlation-partial mutual information clustering method and a least squares hidden layer unit selection method, as well as a minimal Redundancy Maximal Relevance-Partial Mutual Information Clustering-Least Square Regression-Mul method are proposed. Ti Layer Feed Forward Network (mPMIc-LSR-MLFN) Partial Mutual Information (PMI) consumes a lot of computation time when the dimension of variables increases, and it is easy to lose the estimation accuracy. Therefore, a novel clustering method of minimum redundancy maximum correlation-biased information is proposed. Clustering centers; then, all hidden layer units are clustered by calculating the partial information, and the centers are updated iteratively until all centers are not changed, so as to find the best hidden layer units. Finally, the weights and thresholds between the input of output layer and the output of hidden layer are updated by the least squares linear regression. In the application of soft sensor model of naphtha dry point, compared with MLFN based on K-means, subtraction and other clustering and three kinds of improved extreme learning machines (OP-, OS-, B-ELM), the mPMIc-LSR-MLFN model has the most concise structure and the best prediction performance.
【学位授予单位】:华东理工大学
【学位级别】:博士
【学位授予年份】:2015
【分类号】:TE65;TP183
[Abstract]:With the rapid development of modern science and technology, the petrochemical industry has been progressing and developing, the quality of petrochemical products has been improved and the kinds of petrochemical products have been diversified. Data-based modeling relies heavily on the quality of sample data and the structure of the model without understanding the mechanism of chemical processes. In both cases, redundancy often occurs. A) The input variables selected may be independent of dependent variables and may be redundant with each other. If they are used as model inputs, the complexity of model input structure will be increased directly, and the redundancy problem may be transmitted to model output indirectly, seriously affecting its performance. The performance of the model is closely related to its structure, and the complexity of the structure determines the computational efficiency of the model. To solve the redundancy problem in these two kinds of intelligent modeling, this paper firstly studies and discusses how to eliminate the redundancy between input variables through the principal component analysis of multivariate statistical methods and the mutual information analysis method. Secondly, through partial mutual information method and the proposed clustering method based on partial mutual information, the redundancy problem between hidden layer outputs of neural networks is eliminated and the structure of neural networks is optimized.
(1) In order to solve the complex redundancy problem between input variables, a principal component analysis-Mutual Information-Radial Basis Function Neural Network (PCA-MI-RBFNN) model based on principal component analysis and mutual information analysis is proposed. The model describes the relationship between input and output variables, so the principal component variable is selected as the input of the model according to the variance maximization, ignoring the relationship between input and output variables. Then combined with the mutual information analysis method, the correlation between the principal components and the output variables can be analyzed accurately, and the best principal components can be selected as the model input. The test results of CBA content soft sensor modeling show that PCA-MI-RBFNN model has good robustness and predictive performance after eliminating input redundancy.
(2) Considering that the selected input variables may be independent of dependent variables and that there may be complex redundancy between input variables, a Mutual Information-Principal Component Analysis-Mutual Information-Relevance Vector Ma is proposed based on mutual information-principal component-mutual information analysis. Chine, MI-PCA-MI-RVM. For high-dimensional raw input variables in chemical processes, some of them have nothing to do with dependent variables. If these variables are directly used for modeling, the model will be inaccurate; some variables are related to dependent variables, but there are redundancy problems between them. If these variables are directly used for modeling, the model will be inter-sessional. MI-PCA-MI-RVM first obtains the mutual information between all input and output variables by mutual information analysis, and then determines the threshold to distinguish independent variables from related variables according to the probability density distribution of mutual information. For the remaining input variables, the principal component which is most relevant to the output of the model is selected as the model input by principal component-mutual information analysis. The results of the soft-sensing model of 4-CBA content in xylene oxidation show that the MI-PCA-MI-RVM model has good Lu after removing the independent input variables and eliminating input redundancy. Rod and prediction performance.
(3) To optimize the structure of radial basis function neural networks (RBFNN), the hidden layer cell selection based on partial mutual information-least squares (PMI-LSR-RBFNN) and its network weights and thresholds update (PMI-LSR-RBFNN) are proposed. Then, the RBFNN model is established by linear regression between the output of the hidden layer and the output of the output layer directly through the least square method, updating the weights and thresholds. The PMI-LSR-RBFNN network has a simpler structure and better performance than the improved radial basis function network based on K-means, fuzzy C-means, K-center and subtractive clustering in the modeling of combustion side reactions in INVIDA oxidation process. At the same time, the sensitivity analysis of the main operating variables based on the model is carried out, and the results accord with the known prior knowledge of the combustion side reaction process.
(4) To optimize the structure of multi-layer feed-forward neural networks, a minimum redundancy maximum correlation-partial mutual information clustering method and a least squares hidden layer unit selection method, as well as a minimal Redundancy Maximal Relevance-Partial Mutual Information Clustering-Least Square Regression-Mul method are proposed. Ti Layer Feed Forward Network (mPMIc-LSR-MLFN) Partial Mutual Information (PMI) consumes a lot of computation time when the dimension of variables increases, and it is easy to lose the estimation accuracy. Therefore, a novel clustering method of minimum redundancy maximum correlation-biased information is proposed. Clustering centers; then, all hidden layer units are clustered by calculating the partial information, and the centers are updated iteratively until all centers are not changed, so as to find the best hidden layer units. Finally, the weights and thresholds between the input of output layer and the output of hidden layer are updated by the least squares linear regression. In the application of soft sensor model of naphtha dry point, compared with MLFN based on K-means, subtraction and other clustering and three kinds of improved extreme learning machines (OP-, OS-, B-ELM), the mPMIc-LSR-MLFN model has the most concise structure and the best prediction performance.
【学位授予单位】:华东理工大学
【学位级别】:博士
【学位授予年份】:2015
【分类号】:TE65;TP183
【参考文献】
相关期刊论文 前10条
1 张尧;席云华;胡金磊;夏成军;林凌雪;;基于PCA和RBF神经网络的中长期负荷预测方法[J];电气应用;2008年02期
2 段青;赵建国;马艳;;优化组合核函数相关向量机电力负荷预测模型[J];电机与控制学报;2010年06期
3 颜学峰,陈德钊,胡上序;用自适应偏最小二乘回归为药物定量构效关系建模[J];分析化学;2002年05期
4 史文利;高天宝;王树恩;;基于主成分分析与聚类分析的城市化水平综合评价[J];工业工程;2008年03期
5 陈牢,
本文编号:2197309
本文链接:https://www.wllwen.com/kejilunwen/shiyounenyuanlunwen/2197309.html