当前位置:主页 > 科技论文 > 基因论文 >

正向遗传学研究中的候选基因识别

发布时间:2019-03-07 22:08
【摘要】:随着测序技术的发展,高通量测序的精准度不断提高,成本逐年下降,相关的应用亦越来越广,本文以高通量测序技术为基础,利用测序数据,提出鉴定家系遗传病的候选基因的方法。在实际操作上需面临流程参数设定,显隐性假设,以及候选基因是抑制子或激活子的假设等问题。对研究者而言如何选择更优的参数,针对各种遗传假设快速合理地得到分析结果,是充满挑战性的。为了更好地解决这一问题,基于二代测序技术的基因识别软件GIPS应运而生,他为研究者提供了四种指导性参数,并成功地应用于水稻测序数据上。本文统筹考虑了家系遗传病中各种可能出现的情况,在第一代GIPS的基础上增加了新的家系过滤算法,ANNOVAR注释软件接口,曼哈顿图谱绘制功能,以及底层的背景突变率算法的重定义和公式改进。进一步增加该软件的实用性,可扩展性,生物学可靠性以及精准度。并为提高运行效率,推出了快速获得候选基因列表的新命令。针对第二代GIPS软件的优化。首先,讨论了在家系遗传病中可能存在的各种情况,如待研究病症的显隐性,候选基因是抑制子或是激活子。并根据每种情况,利用不同SNP在基因座上的基因型,进行家系筛选。为了使得用户更直观地了解基因组上所有基因与表型的相关程度,绘制出基因组上每一个基因与表型相关联的显著性的曼哈顿图谱。亦可更直观的反应出通过阈值筛选的候选基因数目。通过改进背景突变率的算法,我们得到更为精确的评估标准,文中亦对前后两种背景突变率作了详细的比较,为了更多的从生物学角度改进该算法,我们还可以综合不同类型的打分软件,从不同角度(例如序列保守程度,理化性质等),指导SNP的筛选。为此新一代的GIPS在原来的基础上增加了新的打分软件的接口,以上这些新增功能均成功应用在水稻数据和耳聋等家系遗传病数据上,并已发表文章于SCI 期刊 Plant Physiology。网址:http://www.plantphysiol.org/content/170/4/1929
[Abstract]:With the development of sequencing technology, the precision of high-throughput sequencing has been improved, the cost of high-throughput sequencing has been reduced year by year, and the related applications have also become more and more extensive. This paper is based on high-throughput sequencing technology and uses sequencing data. A method for identifying candidate genes for hereditary diseases in families was proposed. In practice, some problems such as process parameter setting, recessive hypothesis and hypothesis that candidate genes are suppressors or activators are needed. It is challenging for researchers to choose better parameters and get the analysis results quickly and reasonably according to a variety of genetic assumptions. In order to solve this problem, GIPS, a gene recognition software based on second-generation sequencing technology, emerged as the times require. He provides four guiding parameters for researchers and has been successfully applied to rice sequencing data. In this paper, a new family filtering algorithm, ANNOVAR annotation software interface and Manhattan map drawing function are added on the basis of the first generation of GIPS, considering all kinds of possible conditions in family genetic diseases. And the redefinition and formula improvement of the underlying background mutation rate algorithm. Further increase the practicability, expansibility, biological reliability and precision of the software. In order to improve the operation efficiency, a new command was proposed to quickly obtain candidate gene list. Optimization of the second generation GIPS software. First of all, the possible conditions of familial genetic diseases are discussed, such as the recessive of the disease to be studied, and the candidate genes are suppressors or activators. According to each situation, different genotypes of SNP at the locus were used to screen the families. In order to make users understand the correlation degree of all genes and phenotypes in the genome more intuitively, the Manhattan map of the correlation between each gene and phenotype in the genome is plotted. It can also more intuitively reflect the number of candidate genes screened by threshold. By improving the algorithm of background mutation rate, we get more accurate evaluation criteria. We also make a detailed comparison between the two kinds of background mutation rate, in order to improve the algorithm more from the biological point of view. We can also integrate different scoring software to guide the screening of SNP from different angles (such as conservative degree of sequence, physical and chemical properties, etc.). To this end, the new generation of GIPS has added a new interface to the scoring software. These new functions have been successfully applied to rice data and deafness data in pedigrees, and have been published in the SCI journal Plant Physiology.. Web site: http://www.plantphysiol.org/content/170/4/1929
【学位授予单位】:浙江大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:Q78

【参考文献】

相关硕士学位论文 前1条

1 朱忠旭;基于表型测序的功能基因识别[D];浙江大学;2015年



本文编号:2436496

资料下载
论文发表

本文链接:https://www.wllwen.com/kejilunwen/jiyingongcheng/2436496.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户c4d98***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com