基于RDF元数据查询和存储的研究
发布时间:2018-11-28 16:28
【摘要】:在网络信息资源剧增的今天,如何从海量且杂乱无章的Web数据中查找有价值的信息已经成为一个重要难题。语义网通过对当前的万维网进行扩展允许基于语义web信息的表示和处理,为Web信息提供形式化的含义,使跨应用、团体和企业的数据共享与重用成为了可能。 RDF作为语义网广泛的数据结构,实现了Web上信息资源的语义描述,是语义Web的基础,人们对它的研究已经成为了热点之一。随着RDF的应用范围的不断扩大,传统的数据库管理系统目前已经不能满足人们日益增长的需求,因此对RDF元数据查询和存储的研究越来越重要,本文就是在这种背景之下对RDF元数据的查询和存储进行了一些相关的研究。 主要完成的工作主要有: 首先本文对RDF元数据查询和存储的现状、概念背景进行了介绍,包括语义网的相关概念标准、语义网的七层体系结构、元数据、XML、RDF、本体技术等等,为下文RDF的查询和存储的研究奠定了基础。 其次对现有的存储和查询技术进行了简要概括,分析了当前经典的RDF数据查询和存储的技术,并重点对W3C推荐的查询语言SPARQL进行了分析。 接着分析了查询效率低下原因是现存的存储技术大多存在自身连接的问题,于是本文参考垂直分块思想,在现有的三级索引技术之上,增加索引结构来解决三级索引结构的局限性问题。改进后的索引结构使得在查询语句可以不同情况下进行不同处理,进而提高查询效率。本文还在查询时使用了能够选择最优的计算顺序的动态规划算法对查询进行优化,使得查询时可以选择更好的连接顺序,进一步提高了查询效率。 最后在改进的存储方案和查询优化基础上搭建了原型系统,并通过原型系统对提出的改进存储方案和查询优化进行实验验证,实验结果表明本文提出的方法确实能够明显的提高查询的效率。
[Abstract]:Nowadays, with the rapid increase of network information resources, how to find valuable information from massive and unorganized Web data has become an important problem. By extending the current Web to allow the representation and processing of semantic web information, the semantic Web provides a formal meaning for Web information, making it possible to share and reuse data across applications, groups and enterprises. As an extensive data structure of semantic web, RDF has realized the semantic description of information resources on Web, which is the basis of semantic Web. With the continuous expansion of the scope of RDF application, the traditional database management system can no longer meet the increasing needs of people, so the research of RDF metadata query and storage is becoming more and more important. In this paper, we do some research on RDF metadata query and storage under this background. The main work is as follows: firstly, this paper introduces the current situation of RDF metadata query and storage, and the concept background, including the semantic Web related concept standards, semantic Web seven-tier architecture, metadata, XML,RDF,. Ontology technology and so on, for the following RDF query and storage research laid the foundation. Secondly, the existing storage and query technologies are briefly summarized, the current classic RDF data query and storage techniques are analyzed, and the W3C recommended query language SPARQL is emphatically analyzed. Then, the paper analyzes the reason for the inefficiency of query is that most of the existing storage technologies have the problem of joining themselves, so this paper refers to the idea of vertical partitioning, and based on the existing three-level index technology, Add index structure to solve the limitation problem of tertiary index structure. The improved index structure enables the query statements to be processed differently under different circumstances, thus improving the query efficiency. This paper also uses the dynamic programming algorithm which can select the optimal computing order to optimize the query, so that the query can choose a better join order, and further improve the query efficiency. Finally, a prototype system is built on the basis of the improved storage scheme and query optimization, and the proposed improved storage scheme and query optimization are verified by the prototype system. Experimental results show that the proposed method can obviously improve the efficiency of the query.
【学位授予单位】:广西师范大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:TP391.1;TP333
本文编号:2363513
[Abstract]:Nowadays, with the rapid increase of network information resources, how to find valuable information from massive and unorganized Web data has become an important problem. By extending the current Web to allow the representation and processing of semantic web information, the semantic Web provides a formal meaning for Web information, making it possible to share and reuse data across applications, groups and enterprises. As an extensive data structure of semantic web, RDF has realized the semantic description of information resources on Web, which is the basis of semantic Web. With the continuous expansion of the scope of RDF application, the traditional database management system can no longer meet the increasing needs of people, so the research of RDF metadata query and storage is becoming more and more important. In this paper, we do some research on RDF metadata query and storage under this background. The main work is as follows: firstly, this paper introduces the current situation of RDF metadata query and storage, and the concept background, including the semantic Web related concept standards, semantic Web seven-tier architecture, metadata, XML,RDF,. Ontology technology and so on, for the following RDF query and storage research laid the foundation. Secondly, the existing storage and query technologies are briefly summarized, the current classic RDF data query and storage techniques are analyzed, and the W3C recommended query language SPARQL is emphatically analyzed. Then, the paper analyzes the reason for the inefficiency of query is that most of the existing storage technologies have the problem of joining themselves, so this paper refers to the idea of vertical partitioning, and based on the existing three-level index technology, Add index structure to solve the limitation problem of tertiary index structure. The improved index structure enables the query statements to be processed differently under different circumstances, thus improving the query efficiency. This paper also uses the dynamic programming algorithm which can select the optimal computing order to optimize the query, so that the query can choose a better join order, and further improve the query efficiency. Finally, a prototype system is built on the basis of the improved storage scheme and query optimization, and the proposed improved storage scheme and query optimization are verified by the prototype system. Experimental results show that the proposed method can obviously improve the efficiency of the query.
【学位授予单位】:广西师范大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:TP391.1;TP333
【参考文献】
相关期刊论文 前1条
1 邓志鸿,唐世渭,张铭,杨冬青,陈捷;Ontology研究综述[J];北京大学学报(自然科学版);2002年05期
,本文编号:2363513
本文链接:https://www.wllwen.com/kejilunwen/jisuanjikexuelunwen/2363513.html