全基因组关联研究中的两阶段设计与分析
[Abstract]:Whole genome association, an important tool for finding the susceptible genes of complex diseases, has helped scientists successfully find a number of genetic variants associated with a variety of human diseases (single nucleotide polymorphisms). Compared to one stage design (all cases and control samples are sequenced in all loci), two of them are reasonably constructed. Phase design (the first phase selected a part of the case - control sample to sequence all the sites, select a small portion of the most significant loci to enter the second stage and sequence on the remaining samples according to the results of the association test), which can greatly reduce the workload and cost of sequencing and thus become the whole genome. A commonly used method in association studies. Repeated analysis of data separately examined at each stage often loses the effectiveness of the test. Some scholars have proposed a combined analysis strategy of combining two stages of test statistics to improve the statistical efficiency. The existing joint analysis methods are based on a hypothesis known. The model is used to construct the test statistics, but the genetic model that is subordinate to the single nucleotide polymorphic loci in the actual disease is usually unknown, that is, the genetic model is uncertain. If the assumed genetic model is incorrect, it may lead to unrobust performance.
This paper focuses on the robust unit point joint analysis method in the two stage design of whole genome research, including the following three sub topics. (1) we propose a robust test (MERT) and MAX based on the measurement of the secondary allele more than 5%, based on the measurement of the maximum and minimum efficiency of the two robust test series. 3 test (recessive, dominant, the maximum of the absolute value of the trend test statistics calculated under the genetic model) - the joint analysis method, obtains the large sample asymptotic distribution of the MERT joint analysis test statistics and gives a efficient and feasible parameter Bootstrap method for calculating the p value and the work effect of the joint analysis method of the MAX3. A large number of simulated studies on MAX3 joint analysis, MERT joint analysis and repeated analysis, joint analysis and repeated analysis based on additive model trend test statistics, and comparison of statistical efficacy based on the combined analysis method and repeated analysis method based on allele test statistics, and numerical results. The effectiveness of the combined analysis was generally higher than repeated analysis and the MAX3 combined analysis had the best performance. An analysis of the actual data of a study of type 2 diabetes was carried out. A new risk single nucleotide polymorphisms were reported by the p value calculated by the MAx3 combined analysis. (2) the frequency of secondary alleles was less than 5%. In rare variations, we propose a Beta test based repeated analysis method and a joint analysis method. The theoretical proof that the p value of the Beta test is asymptotically obeying the standard uniform distribution is given. The first class error rate and efficiency of the repeated and joint analysis are compared by simulation. The results show that the two methods can control the first type of error well. The combined analysis was more effective than repeated analysis. The two methods proposed in this study were used to analyze the actual data of rheumatoid arthritis. It was confirmed that the single nucleotide polymorphisms were significantly associated with rheumatoid arthritis.
(3) based on the asymptotic Bias factor, we propose a robust two stage Bias analysis method, and define the detection probability to evaluate the asymptotic Bias factor ranking method. By comparing the maximum asymptotical Bias factor combined analysis method, the genetic model average asymptotic Bias factor joint analysis method can be added. The results show that the maximum asymptotic Bias factor combined analysis method has the most robust performance. The analysis of a group of actual data shows that the maximum asymptotic Bias factor sorting method can effectively detect the single nucleotide polymorphic loci of the recessive or dominant model and the single nucleotide polymorphic loci of the hidden or dominant model. The association between diseases.
The full text is divided into six chapters. The first chapter is introduction, introduces some basic concepts and research background. The second chapter is preparatory knowledge, introduces some common statistics and test methods in the study of whole genome association; the third chapter discusses the two stage design and analysis of common genetic variation; the fourth chapter studies the two phase design of rare genetic variation. Chapter 5 discusses two-stage design and analysis based on asymptotic Bayesian factors; Chapter 6 is a summary and outlook for future work.
【学位授予单位】:云南大学
【学位级别】:博士
【学位授予年份】:2012
【分类号】:R346
【相似文献】
相关期刊论文 前10条
1 区宝娇;不合理用药分析[J];新医学;1989年09期
2 Varro E.Tyler;张治针;;药用植物的研究[J];江西中医学院学报;1991年02期
3 聂波;刘勇;徐青;梁鑫淼;肖培根;;地参反相高效液相色谱分析方法的建立[J];世界科学技术-中医药现代化;2006年01期
4 张秋菊;崔世勇;;水中痕量铁分析进展[J];中国卫生检验杂志;2007年07期
5 潘俊杰;郑琴;杨明;;三七中三七总皂苷的提取、分离纯化及分析方法的研究进展[J];世界科学技术-中医药现代化;2007年06期
6 何涛;张岚;鄂学礼;;饮用水中二氧化氯及其消毒副产物分析方法研究进展[J];国外医学(卫生学分册);2008年02期
7 吴剑威;杨美华;高微微;赵润怀;;镰刀菌毒素分析方法研究进展[J];中草药;2008年04期
8 郑勤云;朱智碧;;癌症患者麻醉药品用药调查与分析[J];中国药业;2008年12期
9 刘福艳;李军;谢元超;刘福强;;中成药中非法添加化学药品的现状与分析检测对策[J];中国药事;2008年12期
10 欧灿纯;;2008年我院门诊第二类精神药品的使用与分析[J];广西医学;2009年10期
相关会议论文 前10条
1 于辉;刘洋;;应急物资的两阶段局内分配策略[A];经济全球化与系统工程——中国系统工程学会第16届学术年会论文集[C];2010年
2 郑青山;杨娟;;不同实验数据合并分析方法[A];定量药理研究方法学培训班讲义[C];2010年
3 马蔚;;锰的分析方法进展[A];新世纪预防医学面临的挑战——中华预防医学会首届学术年会论文摘要集[C];2002年
4 顾昌明;;关于复配农药分析方法的探讨[A];江苏省农药学术研讨会论文集[C];1997年
5 陆启亮;翟永梅;;两阶段MPA法的改进研究[A];上海防灾救灾研究所20周年庆典会议研究短文集[C];2009年
6 石锋;;语音格局的分析方法[A];第六届全国现代语音学学术会议论文集(上)[C];2003年
7 邵国建;苏静波;;区间可靠性分析方法及在地下隧道结构计算中的应用[A];庆祝中国力学学会成立50周年暨中国力学学会学术大会’2007论文摘要集(下)[C];2007年
8 叶钟;;汽轮机调速系统的一些设计思想和分析方法[A];中国动力工程学会成立四十周年文集[C];2002年
9 孙严荣;闻章辉;乔志;范胜槐;;利用NaOAC—EDTA—NaOH煮沸浸提比色法估测中性土壤有机质含量[A];江苏土壤肥料科学与农业环境[C];2004年
10 张雪莲;蔡莲珍;仇士华;;生物体中~(13)C、~(15)N的分析方法[A];第三届全国现代生物物理技术学术讨论会论文摘要汇编[C];2000年
相关重要报纸文章 前10条
1 赵俊豪;质量经济效益分析方法种种[N];中国质量报;2008年
2 晓王;市场发生了转折 分析方法也会变[N];黄山日报;2006年
3 meiying88;股票分析贵在专一而不在多[N];上海证券报;2007年
4 何泳涛;初入市者应重视技术分析[N];期货日报;2008年
5 柴宁;处处留心皆学问[N];期货日报;2007年
6 大时代投资 胡红霞;权证投资之分时K线技巧[N];证券日报;2005年
7 黄永忠;现行短期偿债能力分析方法的缺陷[N];中国财经报;2002年
8 ;石油开采废水回注应达到《碎屑盐油藏注水水质推荐指标及分析方法》规定的标准[N];中国环境报;2005年
9 郑武;瓦楞纸箱设备电气故障检查和分析方法[N];中国包装报;2006年
10 曹健美;企业会计报表分析方法刍议[N];中国城乡金融报;2002年
相关博士学位论文 前10条
1 潘东东;全基因组关联研究中的两阶段设计与分析[D];云南大学;2012年
2 蒋爱华;泛(火用)分析方法及其应用研究[D];中南大学;2011年
3 任曼;环境与生物样品中PCDD/Fs和DL-PCBs的分析方法与环境行为初步研究[D];中国科学院研究生院(广州地球化学研究所);2006年
4 陈福南;高效液相色谱—化学发光分析研究[D];西南大学;2008年
5 李剑;大气中羰基化合物PFPH/GC/MS分析方法的建立及其应用[D];上海大学;2009年
6 王玉t,
本文编号:2164377
本文链接:https://www.wllwen.com/xiyixuelunwen/2164377.html