基于本体的垂直搜索引擎模型研究
发布时间:2018-07-24 17:11
【摘要】:当前的时代是一个互联网迅速发展的时代,互联网上充满了各种类型的信息资源,并且这些信息资源的数量正迅速膨胀。正是因为这个世界的不断信息化、数字化的发展,搜索引擎技术就成为了人们获取网络信息资源的重要途径,它的重要性深入人心。而普通的通用搜索引擎已经很难快速、准确的找到用户需要的网页,,所以垂直搜索引擎的产生就成为了必然。 与通用搜索引擎相比较,垂直搜索引擎可以提供更加专业、精确的和有一定深度的检索服务。但从垂直搜索引擎与通用搜索引擎的相关关键技术上来讲,他们在此方面还是十分相似的,它们主要的区别,还是在于是否在网页信息抽取时进行结构化抽取,形成结构化的信息数据。所以,垂直搜索引擎虽然在一定程度上改善了检索结果的效果,但仍旧没有摆脱依靠关键词检索的方式,满足不了用户进行语义检索的需求。而随着本体技术在各个领域的应用逐渐广泛,同时为了满足某些特定领域、有着特定需求的用户,就出现了基于本体的垂直搜索引擎研究。 本文就是对本体与垂直搜索引擎的相关理论知识、设计理念及相关的实现技术等做了介绍及研究,希望通过本体模型与垂直搜索引擎的结合来提高搜索的查全率、查准率,最后主要的工作在于实现领域本体构建并设计实现简单模型。 最后通过以上理论的研究,运用本体构建工具Protégé4.0构建了影院领域本体,对基于本体的垂直搜索引擎的模型进行分析设计。在总体设计方面来用的是模块化思想,垂直搜索引擎被分为信息抓取系统、信息预处理子系统、索引子系统和检索子系统,各子系统相对独立。
[Abstract]:The current era is an era of rapid development of the Internet, the Internet is full of various types of information resources, and the number of these information resources are expanding rapidly. Because of the constant information and the development of digitization in this world, search engine technology has become an important way for people to obtain network information resources, and its importance is deeply rooted in people's hearts. But the common general search engine has been very difficult to quickly and accurately find the web pages that users need, so the vertical search engine has become inevitable. Vertical search engines can provide more professional, accurate and in-depth search services than general search engines. But from the point of view of the key technologies of the vertical search engine and the general search engine, they are very similar in this respect. The main difference between them is whether to carry out structured extraction in the process of web page information extraction. Form structured information data. Therefore, the vertical search engine has improved the result of retrieval to some extent, but still can not get rid of the way of relying on keyword search, which can not meet the needs of users for semantic retrieval. With the wide application of ontology technology in various fields, in order to meet the specific needs of users in certain areas, there is a vertical search engine research based on ontology. This paper introduces and studies the related theory knowledge, design idea and related implementation technology of ontology and vertical search engine, hoping to improve the recall and precision of search through the combination of ontology model and vertical search engine. Finally, the main work is to build domain ontology and design and implement a simple model. Finally, through the research of the above theory, we use Prot 茅 g 茅 4.0 to construct the domain ontology of cinema, and analyze and design the model of vertical search engine based on ontology. The vertical search engine is divided into information capture system, information preprocessing subsystem, index subsystem and retrieval subsystem, each subsystem is relatively independent.
【学位授予单位】:东北师范大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:G254.336
本文编号:2142073
[Abstract]:The current era is an era of rapid development of the Internet, the Internet is full of various types of information resources, and the number of these information resources are expanding rapidly. Because of the constant information and the development of digitization in this world, search engine technology has become an important way for people to obtain network information resources, and its importance is deeply rooted in people's hearts. But the common general search engine has been very difficult to quickly and accurately find the web pages that users need, so the vertical search engine has become inevitable. Vertical search engines can provide more professional, accurate and in-depth search services than general search engines. But from the point of view of the key technologies of the vertical search engine and the general search engine, they are very similar in this respect. The main difference between them is whether to carry out structured extraction in the process of web page information extraction. Form structured information data. Therefore, the vertical search engine has improved the result of retrieval to some extent, but still can not get rid of the way of relying on keyword search, which can not meet the needs of users for semantic retrieval. With the wide application of ontology technology in various fields, in order to meet the specific needs of users in certain areas, there is a vertical search engine research based on ontology. This paper introduces and studies the related theory knowledge, design idea and related implementation technology of ontology and vertical search engine, hoping to improve the recall and precision of search through the combination of ontology model and vertical search engine. Finally, the main work is to build domain ontology and design and implement a simple model. Finally, through the research of the above theory, we use Prot 茅 g 茅 4.0 to construct the domain ontology of cinema, and analyze and design the model of vertical search engine based on ontology. The vertical search engine is divided into information capture system, information preprocessing subsystem, index subsystem and retrieval subsystem, each subsystem is relatively independent.
【学位授予单位】:东北师范大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:G254.336
【参考文献】
相关期刊论文 前10条
1 邓志鸿,唐世渭,张铭,杨冬青,陈捷;Ontology研究综述[J];北京大学学报(自然科学版);2002年05期
2 徐周昶;章美仁;;垂直搜索引擎系统的架构研究[J];福建电脑;2011年11期
3 于斌斌;;本体构建方法及构建工具研究[J];边疆经济与文化;2012年12期
4 文坤梅;卢正鼎;孙小林;李瑞轩;;语义搜索研究综述[J];计算机科学;2008年05期
5 赵建伟;郑诚;吴永俊;;基于语义查询扩展的垂直搜索研究[J];计算机工程;2010年12期
6 白万民;苏希乐;;Heritrix在垂直搜索引擎中的应用[J];计算机时代;2011年09期
7 杨秋芬,陈跃新;Ontology方法学综述[J];计算机应用研究;2002年04期
8 白坤;耿国华;;基于Lucene/Heritrix的垂直搜索引擎的研究与应用[J];计算机应用与软件;2009年01期
9 王长霞;李冠宇;陈布伟;;语义网本体构建工具现状及发展趋势研究[J];计算机与现代化;2009年07期
10 贺宏朝,何丕廉,高剑峰,黄昌宁;一种基于上下文的中文信息检索查询扩展[J];中文信息学报;2002年06期
本文编号:2142073
本文链接:https://www.wllwen.com/kejilunwen/sousuoyinqinglunwen/2142073.html