当前位置:主页 > 科技论文 > 搜索引擎论文 >

分布式搜索引擎的设计与实现

发布时间:2018-03-26 19:23

  本文选题:分布式 切入点:搜索引擎 出处:《华东师范大学》2008年硕士论文


【摘要】: 目前,网络上存在大量的资源共享服务器,这些服务器一般存储了一定量的资源,并以web服务的方式供用户和其它服务器访问。但是随着服务器分布越来越广泛,信息量也会越来越丰富,并且不同服务器之间信息组织形式也趋向多样化,用户难以快速、准确的检索到自己需要的资源,因此设计一个良好的分布式搜索引擎将是搜索引擎能否面相未来的关键因素。 在本文中,我们首先结合当前分布式搜索引擎的研究现状,深入介绍了ajax、xml等相关技术,并对分布式搜索引擎的开发可行性和应用前景进行了研究分析。根掘这些分析结果对系统进行了概要设计,并将其分为十个功能模块--客户端验证功能模块、系统检索代理功能模块、资源预览功能模块、(XML型)服务器资源检索功能模块、(SQL型)本地服务器资源检索功能模块、高级搜索功能处理用户检索信息功能模块、检索精度(任意关键字检索)功能模块、后台管理(登陆实现)功能模块、后台管理添加资源信息功能模块、后台管理服务器注册与注销功能模块。在详细设计过程中介绍了每一个模块的功能,优点以及相关算法。本文最后详细介绍了系统的使用与测试过程。 总体上,本文论述了一种分布式搜索引擎的设计方法。经验证,所实现的分布式搜索引擎具有良好的可用性,解决了因当前服务器信息量逐渐增多,信息组织形式多样化而导致的用户难以快速、准确的检索到自己需要的资源的问题。
[Abstract]:At present, there are a large number of resource sharing servers on the network, which generally store a certain amount of resources and are accessed by users and other servers in the form of web services. The amount of information will become more and more abundant, and the forms of information organization between different servers will also tend to be diversified. It is difficult for users to quickly and accurately retrieve the resources they need. Therefore, the design of a good distributed search engine will be a key factor for the future of search engines. In this paper, we first introduce the relevant technologies, such as ajaxer XML, in combination with the current research status of distributed search engine. The feasibility and application prospect of distributed search engine are studied and analyzed. System retrieval agent function module, resource preview function module / XML) server resource retrieval function module / SQL) local server resource retrieval function module, advanced search function processing user retrieval information function module, Retrieval accuracy (any keyword retrieval) function module, background management (login implementation) function module, background management add resource information function module, In the process of detailed design, the functions, advantages and related algorithms of each module are introduced. Finally, the use and testing process of the system are introduced in detail. As a whole, this paper discusses a design method of distributed search engine. It is proved that the distributed search engine has good usability, which solves the problem that the amount of server information is increasing gradually. The diversity of information organization results in the problem that users can not retrieve the resources they need quickly and accurately.
【学位授予单位】:华东师范大学
【学位级别】:硕士
【学位授予年份】:2008
【分类号】:TP391.3

【引证文献】

相关期刊论文 前1条

1 王俊生;施运梅;张仰森;;基于Hadoop的分布式搜索引擎关键技术[J];北京信息科技大学学报(自然科学版);2011年04期

相关硕士学位论文 前1条

1 龚秋艳;并行网络爬虫设计与实现[D];华东师范大学;2010年



本文编号:1669260

资料下载
论文发表

本文链接:https://www.wllwen.com/kejilunwen/sousuoyinqinglunwen/1669260.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户c7f25***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com