数据挖掘在NDRG2转录调控和信号转导研究中的应用
[Abstract]:In recent years, the application of high-throughput technology in biomedical research has led to the accumulation of a large number of biological data, including gene and protein sequences, DNA microarrays and biomedical images. In our research, we used data mining technology to extract effective information from HepG2 cell expression profiles chip for the first time, and innovatively predicted the transcriptional regulatory factors of NDRG2 and the phase of NDRG2 and Dusp6. Interaction molecules.
Because the expression of NDRG2 in tumor cells is decreased, in order to understand the biological effect of NDRG2 on tumor cells, we detected the gene expression of HepG2 cells with expression profiling chip, and analyzed the enrichment of the chip data. GO biological process analysis showed that the expression of genes involved in G protein signal transduction increased. Five of these genes were identified by qRT-PCR, while the genes involved in M phase were decreased, which was consistent with the analysis of cell cycle. Signal pathway analysis showed that the expression of genes related to blood cell differentiation and cell adhesion increased significantly, and the expression of genes related to protein GPI modification, protein degradation and cell secretion decreased. We found that NDRG2 could increase the phosphorylation level of p38 through the analysis of motifs and experiments. Through enrichment analysis, we successfully extracted effective information from the chip data and provided a molecular basis for understanding the mechanism of NDRG2 in tumor cells.
In order to understand the expression pattern of NDRG2 under different conditions, we used ARACNE algorithm and phantom scan to predict the transcription factors that regulate the expression of NDRG2. Through phantom scan, we found that there were 129 binding sites of transcription factors in the promoter region of NDRG2. Finally, we obtained 53 candidate transcription factors that might regulate the expression of NDRG2 gene. Among these transcription factors, KLF4 was chosen to induce colon cancer cells to differentiate. Functional analysis of these transcription factors showed that they were mainly related to cell differentiation and organogenesis. The transport and localization of substances in cells are consistent with previous studies.
Finally, in order to find the interaction molecules of NDRG2, we searched for the homologues of NDRG2 in Arabidopsis and identified NDL1, NDL2 and NDL3 as the homologous proteins of NDRG2. These NDL proteins have been reported to interact with AGB1 and RGS1, suggesting that NDRG2 can interact with the homologues of these two molecules in humans. AGB1 is a G protein trimer. In humans, five AGB1 homologues, GNB1-GNB5.RGS1, are GTP enzyme activators that interact with the alpha subunit of G-protein trimer, activate its activity to hydrolyze GTP and thus inhibit the G-protein signaling pathway. Although there is only one RGS protein in Arabidopsis, there are about 20 in humans. RGS protein. Among these proteins, RGS5 is the most similar to RGS1 in Arabidopsis and is therefore used for further experimental verification. Through immunoprecipitation and his-pulldown experiments, we found that RGS5 can interact with NDRG2. To establish a method for predicting a protein-protein interaction molecule, we first combined the bases. Interacting molecules of Dusp6 were predicted based on the expression profile data and protein sequence characteristics. We used MINDy algorithm to find the regulatory proteins of Dusp6, and used Pred_PPI to improve the accuracy of prediction.
In conclusion, by combining data mining with experimental verification, we have successfully developed a strategy that can be used to study the transcriptional regulation and signal transduction involved in a particular gene.
【学位授予单位】:第四军医大学
【学位级别】:博士
【学位授予年份】:2012
【分类号】:R363
【共引文献】
相关期刊论文 前10条
1 张建标;王传玉;申文康;李孝丰;;考虑索赔额的NCD系统平稳分布研究[J];安徽工程科技学院学报(自然科学版);2008年01期
2 曲静;;基于马尔科夫链的西安春季首场透雨预测方法研究[J];安徽农业科学;2011年24期
3 吴蓓;;马氏链预测模型的代数处理方法[J];安庆师范学院学报(自然科学版);2010年01期
4 程韦;苏志同;;数据挖掘技术在现代信息管理系统中的研究与分析[J];北京工业职业技术学院学报;2008年04期
5 罗万春;易东;刘恩;龚利红;;整数非线性规划模型在乳腺癌转移相关基因表达调控网络建立中的应用[J];北京生物医学工程;2008年02期
6 马猛;钮俊清;宁岩;郑浩然;王煦法;;聚类和关联规则挖掘在基因表达数据分析中的应用研究[J];北京生物医学工程;2008年04期
7 孟海洋;薛红;郭培源;曹利红;;网上超市购物系统的设计与实现[J];北京工商大学学报(自然科学版);2008年06期
8 张梅荣;;基于方阵乘幂的马尔可夫链问题研究[J];北京印刷学院学报;2009年06期
9 张梅荣;;吸收态马尔可夫链及其在高校学生学业管理模型中的应用[J];北京印刷学院学报;2011年04期
10 付海燕;薛国珍;;我国图书进出口量化对比及结构演化分析[J];出版发行研究;2011年06期
相关会议论文 前9条
1 黄素英;陈华;;胃癌转移相关的肿瘤标志性基因筛选研究[A];第十届中国科协年会论文集(三)[C];2008年
2 朱春江;陆宇e
本文编号:2184742
本文链接:https://www.wllwen.com/xiyixuelunwen/2184742.html