当前位置:主页 > 科技论文 > 搜索引擎论文 >

基于Lucene的全文检索系统的设计与实现

发布时间:2018-05-11 22:29

  本文选题:Lucene + 全文搜索 ; 参考:《厦门大学》2014年硕士论文


【摘要】:二十世纪九十年代开始,计算机技术和互联网技术获得了巨大的发展,随着计算机以及互联网技术的大规模普及应用,人们所接触到的信息量也呈现指数级的增长,信息量的增大迫使人们必须想出各种方法来快速获得所需要的有用信息,为此,人们发明了各式各样的信息查找技术,但是,如何才能快速高效地完成信息的存储以及查找操作呢,这是非常值得国内外读者去研究的课题。 当前,搜索引擎已经成为信息网络化时代最主流的技术之一,作为搜索引擎核心的技术,全文检索(Full-text Retrieval)是指使用自然语言进行检索,基于全文索引并以文本数据为主要处理对象的检索技术。全文检索与普通的数据库检索设计不太一致,前者需要处理包括结构化数据以及非结构化数据,而后者只能处理结构化数据,所以,比起普通的数据库检索,全文检索具有更强大的功能,更容易满足用户的需求。 论文主要是探讨艺术学院办公系统的全文检索模块,全文检索的基本要求就是能够实现对公文内容,通知公告,内部新闻等文本信息进行内容检索。系统基于J2EE体系架构进行开发,采用SSH2项目开发技术架构,使用MYSQL数据库系统。 本文先论述相关技术,从搜索引擎的原理、组成、数据结构、工作流程等方面做深入细致地研究分析,然后根据项目的实际需求,以Lucene工具库为基础,设计并且实现一个基于全文检索的站内搜索引擎系统,为用户提供更为方便的搜索功能。
[Abstract]:Since the 1990s, computer technology and Internet technology have gained tremendous development. With the large-scale popularization and application of computer and Internet technology, the amount of information that people come into contact with has also increased exponentially. The increasing amount of information has forced people to come up with ways to get the useful information they need quickly. For this reason, people have invented various information lookup techniques, but, How to quickly and efficiently complete the information storage and search operation, this is a very worthy of domestic and foreign readers to study the subject. At present, search engine has become one of the most popular technologies in the era of information networking. As the core technology of search engine, Full-text Retrieval (Full-text Retrieval) refers to the use of natural language for retrieval. Retrieval technology based on full-text index and taking text data as main processing object. Full-text retrieval is not exactly the same as the common database retrieval design, which involves both structured and unstructured data, while the latter can only handle structured data, so, compared to ordinary database retrieval, Full-text retrieval has more powerful functions and is easier to meet the needs of users. This paper mainly discusses the full-text retrieval module of the office system of the College of Art. The basic requirement of full-text retrieval is to achieve the content retrieval of official document content, notice announcement, internal news and other text information. The system is developed on the basis of J2EE architecture, SSH2 project development technology framework and MYSQL database system. This article first discusses the related technology, from the search engine principle, the constitution, the data structure, the work flow and so on aspect makes the thorough detailed research and analysis, then according to the project actual demand, takes the Lucene tool library as the foundation, A web search engine system based on full-text search is designed and implemented to provide users with more convenient search functions.
【学位授予单位】:厦门大学
【学位级别】:硕士
【学位授予年份】:2014
【分类号】:TP391.3

【参考文献】

相关期刊论文 前3条

1 刘宁;陆荣国;缪万胜;;MVC体系架构从模式到框架的持续抽象进化[J];计算机工程;2008年04期

2 曹强;;基于Lucene的Web站点站内全文检索系统的设计与实现[J];图书情报工作;2007年09期

3 曹大有;王瑜;;基于MyEclipse的Hibernate持久层框架的开发过程[J];计算机系统应用;2007年12期



本文编号:1875905

资料下载
论文发表

本文链接:https://www.wllwen.com/kejilunwen/sousuoyinqinglunwen/1875905.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户c4320***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com