川桑密码子使用模式与密码子分析工具的开发

发布时间:2018-08-06 11:23
【摘要】:密码子是m RNA上的三个连续核苷酸,是合成蛋白质时核苷酸编码氨基酸的规律。遗传密码的破译对接了蛋白质信息和核酸信息,是分子生物学研究中一个里程碑事件。部分氨基酸由多种不同类型的同义密码子所编码,这使得编码相同蛋白质的基因可能具有完全不同的密码子使用模式。而同义密码子的偏好性使用在转录、翻译水平上影响着基因的表达。在共翻译蛋白质折叠过程中,密码子的偏好性使用改变了蛋白质的构象,从而导致其功能的分化。不仅如此,m RNA的稳定性,癌症和其他疾病的发生与密码子的使用模式相关。随着后基因组时代的到来,从公共数据库中可以获取海量的基因组和转录组数据。如何分析利用已有的数据,为实验研究提供参考,是目前生物信息学研究的重点内容之一。蔷薇目桑科植物川桑(Morus notabilis)的基因组测序,为其他蔷薇目植物的研究提供了丰富的基因数据。桑树本身适应性强,在全球有广泛的分布,并且具有很高的食用药用价值。本研究对川桑密码子进行了较为全面的生物信息学分析。主要研究结果如下:1.川桑叶绿体基因组密码子分析对川桑叶绿体基因组的研究发现,选择压力作用于基因的密码子使用。对十五个蔷薇目物种的叶绿体的研究表明,蔷薇目植物间的叶绿体基因核酸组成十分近似。物种间密码子使用模式的相似性与物种的亲缘关系不完全一致,表明选择压力作用于一些物种中的一些基因使得相近物种的密码子使用模式呈现出较大的差异。蔷薇目物种叶绿体基因密码子使用普遍受到选择作用,川桑相较而言受到了最弱的自然选择。2.川桑核基因组密码子分析对川桑核基因组密码子使用模式的分析鉴定出了川桑中的高频密码子和最优密码子。结合转录组数据分析发现,川桑的最优密码子的使用比例随着基因的表达水平升高而升高,这种现象在持家基因中表现尤为明显。对影响川桑密码子使用的潜在因素分析后发现,基因的碱基组成、长度、表达量和编码氨基酸的分子量均会影响同义密码的偏好性使用。在进行多重相关分析时发现,川桑中高表达基因具有长度短,强的密码子偏好性,高的GC含量,偏好使用低分子量和合成成本更低的氨基酸的特点。对其密码子偏好性观测值与预测值差异较大的基因分析后发现,涉及核苷酸代谢、能量代谢、转录、翻译等过程的基因往往更容易受到选择作用从而呈现出更强的密码子偏好性,这意味着基因功能与密码子使用模式似乎存在一定的联系。进行交互信息分析后发现,仅在涉及嘧啶代谢、嘌呤代谢和RNA聚合酶等过程的基因中存在大量密码子具有基因功能特异的密码子使用模式,而其他功能的基因没有十分独特的密码子使用模式。研究发现持家基因中密码子偏好性与基因表达水平有一个较好的对应关系,近年在人的基因中发现类似的现象并提出了持家基因的密码子可能受到更显著的翻译选择。研究中发现基因的表达水平影响着基因的密码子偏好性,川桑中持家基因的表达水平明显高于组织表达特异基因和其他基因。比较了表达水平近似的持家基因和其他基因后仍然得到类似的结果,表明在持家基因中密码子受到明显的选择。密码子上下文偏好同样在川桑中被观察到。有趣的是这种密码子上下文偏好性受基因表达水平的影响,不仅相邻密码子之间存在偏好性,密码子在基因中的位置同样影响着同义密码子的偏好性使用。基因起始区域偏好使用简并位点为C的密码子,而回避使用简并位点为U的密码子。简并位点为C的密码子在不同基因中呈现出较为一致的使用趋势,而其他的密码子基于位置的偏好性并不十分一致。尽管存在基于位置的密码子偏好性,但是这种偏好性似乎没有使得川桑基因的折叠能量与同义密码子空模型之间存在显著差异。3.带有用户界面的密码子分析软件Codons Box的开发整理本研究中经常使用的功能,使用Java开发带有友好用户界面的密码子分析工具Codons Box。这使后续密码子分析可以回避部分重复性的工作,研究者可以专注于更为深入和个性化的密码子研究。Codons Box的网络地址为:https://github.com/cqwenyan/Codons Box,该软件将为相关领域的研究提供帮助。川桑密码子的研究为其他植物的相关分析提供了基础资料,也为桑树转基因和育种研究提供了基础数据。开发得到的工具为密码子分析提供了简单高效的方法,方便其他物种中密码子的个性化研究。
[Abstract]:The codon is the three continuous nucleotides on M RNA, the law of nucleotide encoding amino acids at the time of synthesis of proteins. The deciphering of the genetic code is a milestone in molecular biology research. Some amino acids are encoded by a variety of synonymous codons, which makes the encoding of the same egg. The genes of white matter may have completely different codon usage patterns. The preference of synonymous codons is used in transcription and the level of translation affects the expression of genes. In the process of CO translational protein folding, the preference of codons changes the conformation of the protein, resulting in its functional differentiation. Not only that, the stability of M RNA is stable. Qualitative, cancer and other diseases are related to the usage patterns of codons. With the arrival of the post genome era, a large number of genome and transcriptome data can be obtained from the public database. How to analyze and utilize the existing data to provide reference for experimental research is one of the key contents of bioinformatics research. The genome sequencing of the mulberry plant, Kasan (Morus notabilis), provides rich genetic data for the study of other Rosa plants. The mulberry tree itself has a wide distribution in the world and has a high edible and medicinal value in the world. This study has carried out a more comprehensive bioinformatics analysis on the codons of the mulberry. The results are as follows: 1. the study of the chloroplast genome of the mulberry chloroplast genome has found that the selection pressure is used to use the codon in the gene. The study of chloroplasts of fifteen species of Rosa species shows that the nucleic acid composition of the chloroplast gene among the Rosa plants is very similar. The similarity of the codon usage pattern among species is similar. The genetic relationship with species is not exactly the same, indicating that the selection of some genes in some species makes the codon usage patterns of similar species quite different. The use of the chloroplast gene codons in the Rosa species is generally selected. With the analysis of codon analysis of group codon analysis, the high frequency codon and the best codon are identified in the codon of the nuclear genome of Morus. Combined with the analysis of transcriptional data, it is found that the ratio of the best codon is increased with the increase of gene expression level. This phenomenon is particularly evident in the housekeeping gene. The analysis of the potential factors affecting the use of codon found that the base composition, length, expression of the gene and the molecular weight of the encoded amino acids all affect the preference of the synonymous ciphers. In the multicorrelation analysis, the high expression genes in the mulberry were found to have short length, strong codon preference, high GC content, and preferred use. The characteristics of an amino acid with low molecular weight and lower cost of synthesis. Gene analysis, which differs from the predicted values of their codon preference, finds that genes involved in nucleotides metabolism, energy metabolism, transcription, translation and other processes tend to be more likely to be selected to present a stronger codon preference, which means genes. There seems to be a connection between the function and the codon usage pattern. After the analysis of interactive information, it is found that there are a large number of codon in the genes involving pyrimidine metabolism, purine metabolism and RNA polymerase, which have genetic function specific codon usage patterns, while other active genes do not have a very unique codon usage model. The study found that there is a good correspondence between the codon preference and the gene expression level in the housekeeping gene. In recent years, similar phenomena have been found in human genes and the codon of the family gene may be more significantly translated. The expression level of the housekeeping gene was significantly higher than that of the specific gene and other genes in the tissue. The similar results were still obtained after the homemaking and other genes of the approximate expression level, indicating that the codons in the housekeeping genes were obviously selected. The codon context preference was observed in the mulberry. Codon context preference is influenced by the level of gene expression, not only between adjacent codons, but the location of codon in the gene also affects the preference of synonymous codons. The gene initiating region favours the codon with the degenerate site as C and avoids the use of the degenerate U codon. The degeneracy site is C The codon presents a more consistent use trend in different genes, while other codons are not very consistent in position based preference. Although there is a position based codon preference, this preference does not seem to make significant differences between the folding energy of the mulberry gene and the synonymous codon empty model.3. Codons Box, a cipher analysis software with a user interface, has developed the functions often used in this study, using Java to develop a codon analysis tool with friendly user interface, Codons Box., which enables subsequent codon analysis to avoid partial duplication, and researchers can focus on more in-depth and personalized passwords. The network address of.Codons Box is: https://github.com/cqwenyan/Codons Box, which will help the research in the related fields. The research of the codon of the sonson provides basic data for the related analysis of other plants, and provides basic data for the study of mulberry transgenic and breeding. The developed tools are the codons. Analysis provides a simple and efficient method to facilitate the personalized research of codon in other species.
【学位授予单位】:西南大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:S888.2;Q943.2

【参考文献】

相关期刊论文 前4条

1 章丹丹;凌霜;张洪平;史海霞;薛永亮;杨晓露;许锦文;卞卡;;桑枝总黄酮体外抗炎活性及机制研究[J];时珍国医国药;2010年11期

2 向义和;;遗传密码是怎样破译的[J];物理与工程;2007年02期

3 冯冰虹,苏浩冲,杨俊杰;桑白皮丙酮提取物对呼吸系统的药理作用[J];广东药学院学报;2005年01期

4 赵翔,霍克克,李育阳;毕赤酵母的密码子用法分析[J];生物工程学报;2000年03期

相关博士学位论文 前5条

1 马峇;桑树全基因组转座子的鉴定及特征分析[D];西南大学;2014年

2 亓希武;桑树花青素生物合成相关基因的鉴定及功能研究[D];西南大学;2014年

3 王艳君;角毛壳菌Chi58基因克隆与密码子优化及在毕赤酵母中的高效表达[D];哈尔滨工业大学;2009年

4 王芳平;密码对的使用与基因组进化[D];内蒙古大学;2009年

5 刘朔;纳豆激酶基因密码子优化设计与合成及在毕赤酵母中的高效表达[D];南京农业大学;2007年

相关硕士学位论文 前10条

1 陈丹;猪胰腺脂肪酶密码子重组优化、毕赤酵母高效表达及酶学特性分析[D];四川农业大学;2014年

2 范丽;桑树木质素合成基因的生物信息和功能分析[D];西南大学;2013年

3 王幸兴;绿脓杆菌外毒素A基因的密码子优化、脱毒改造及其特性研究[D];河北农业大学;2011年

4 刘静;植物线粒体基因组密码子偏性的特征和形成原因的研究[D];南京大学;2011年

5 张劲松;光合NAD(P)H脱氢酶复合体编码基因的生物信息学分析[D];上海师范大学;2010年

6 曾伟伟;密码子优化和分子佐剂增强新城疫病毒F基因DNA疫苗免疫效果[D];中国农业科学院;2009年

7 栾世家;A型流感病毒NS1基因密码子去优化改造引起病毒毒力减弱的研究[D];中国科学技术大学;2009年

8 鲁飞;密码子优化联合体内电穿孔增强日本血吸虫TPI DNA疫苗免疫保护作用的研究[D];江苏省血吸虫病防治研究所;2009年

9 曹致琦;果蝇基因组中内含子数目与密码子使用偏性的关系[D];西南大学;2007年

10 柏锡;t-PA基因对黄瓜的遗传转化及其在不同植物中的表达效率分析和密码子改造[D];东北农业大学;2003年



本文编号:2167583

资料下载
论文发表

本文链接:https://www.wllwen.com/shoufeilunwen/benkebiyelunwen/2167583.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户6e51e***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com