基于数字签名与Trie的保序子矩阵约束查询
发布时间:2018-11-02 19:16
【摘要】:目前,基因芯片技术飞速发展,促使生物学家积累了大量的不同实验条件下的基因表达数据.事实证明,基因芯片数据分析在理解基因功能、基因调控和分子生命过程中发挥着重要作用.保序子矩阵(order-preserving submatrix,简称OPSM)是基因芯片数据分析技术中的一种有效模型,其可以发现在部分基因和不同实验条件下具有相同表达趋势的聚类.在分析基因表达机理的过程中,OPSM的检索无疑节省了生物学家的时间与精力.目前,OPSM的查询主要是基于关键词的检索方法,但是分析者对结果具有微弱的控制力.通常,分析者所能决定的临时的参数设置往往偏离其领域知识,致使检索结果与真实想要的结果相去甚远.为了解决上述问题,提出两类基于数字签名与Trie的OPSM索引与约束查询方法.在真实数据上进行了大量的实验,实验结果表明,所提出的方法具有良好的有效性与可扩展性.
[Abstract]:At present, the rapid development of gene chip technology has prompted biologists to accumulate a large amount of gene expression data under different experimental conditions. It has been proved that gene chip data analysis plays an important role in understanding gene function, gene regulation and molecular life. Order-preserving submatrix, matrix (OPSM) is an effective model in gene chip data analysis. It can find clusters with the same expression trend in some genes and different experimental conditions. In the process of analyzing the mechanism of gene expression, the search of OPSM saves biologists' time and energy. At present, OPSM query is mainly based on keyword retrieval method, but analysts have weak control over the results. Usually, the temporary parameter settings that analysts can decide often deviate from their domain knowledge, so the retrieval results are far from the desired results. In order to solve the above problems, two kinds of OPSM indexing and constraint query methods based on digital signature and Trie are proposed. Experiments on real data show that the proposed method is effective and scalable.
【作者单位】: 西北工业大学计算机学院;
【基金】:国家重点基础研究发展计划(973)(2012CB316203) 国家自然科学基金(61033007,61272121,61332014,61572367,61472321,61502390) 国家高技术研究发展计划(863)(2015AA015307) 中央高校基本科研业务费专项资金(3102015JSJ0011) 西北工业大学研究生创业种子基金(Z2012128)~~
【分类号】:TN918.91;TP311.13
本文编号:2306712
[Abstract]:At present, the rapid development of gene chip technology has prompted biologists to accumulate a large amount of gene expression data under different experimental conditions. It has been proved that gene chip data analysis plays an important role in understanding gene function, gene regulation and molecular life. Order-preserving submatrix, matrix (OPSM) is an effective model in gene chip data analysis. It can find clusters with the same expression trend in some genes and different experimental conditions. In the process of analyzing the mechanism of gene expression, the search of OPSM saves biologists' time and energy. At present, OPSM query is mainly based on keyword retrieval method, but analysts have weak control over the results. Usually, the temporary parameter settings that analysts can decide often deviate from their domain knowledge, so the retrieval results are far from the desired results. In order to solve the above problems, two kinds of OPSM indexing and constraint query methods based on digital signature and Trie are proposed. Experiments on real data show that the proposed method is effective and scalable.
【作者单位】: 西北工业大学计算机学院;
【基金】:国家重点基础研究发展计划(973)(2012CB316203) 国家自然科学基金(61033007,61272121,61332014,61572367,61472321,61502390) 国家高技术研究发展计划(863)(2015AA015307) 中央高校基本科研业务费专项资金(3102015JSJ0011) 西北工业大学研究生创业种子基金(Z2012128)~~
【分类号】:TN918.91;TP311.13
【相似文献】
相关期刊论文 前5条
1 尚凤军;潘英俊;潘雪增;毕斌;;基于随机分布的多比特Trie树IP数据包分类算法研究[J];通信学报;2008年07期
2 华泽;马涛;赵梅;;基于定位代码和多分支Trie的快速多维数据包分类[J];苏州科技学院学报;2006年02期
3 尚凤军,王海霞;基于跳转表Trie树的IP分类算法[J];计算机工程;2004年24期
4 尚凤军;;一种IP数据包快速分类算法[J];东南大学学报(自然科学版);2006年S1期
5 尚凤军,王海霞;基于完全无冲突哈希的IP数据包分类算法研究[J];计算机工程与应用;2004年34期
,本文编号:2306712
本文链接:https://www.wllwen.com/kejilunwen/xinxigongchenglunwen/2306712.html