基于P2P全文检索系统的设计与实现
发布时间:2018-05-28 00:38
本文选题:C/S + P2P ; 参考:《吉林大学》2013年硕士论文
【摘要】:P2P技术作为一种新兴的网络模型,占据了互联网业务总量的百分之六十以上,被人们称为宽带互联网应用的“杀手级”技术。P2P技术与传统的客户端/服务器模型相对比,在消除服务器瓶颈以及网络资源利用率等方面优势较为明显。 JXTA是Sun微系统对等网络(P2P)的标准,供P2P程序所需的基础服务。该技术致力于创建一个通用的平台,以简单而有效的方式构建特定的对等式和分布式服务与应用。使得开发者不需要过多考虑如何解决对等计算的技术问题,而可以专注于如何实现与完善可扩展、互操作性强且具有高可用性的高层应用。 Lucene是一个开源的项目,提供了相对完整的文本检索的功能,Apache开发此项目的目的就在于为程序开发者设计一个即容易理解,又功能完备的检索工具。以此为基础,开发人员可以快速实现全文检索的功能。 本文首先介绍了搜索引擎的发展趋势、关键技术,在此基础上对传统搜索引擎面临的挑战以及基于P2P的搜索引擎的优势进行了深入的分析。接着本文对P2P搜索引擎的关键技术——JXTA技术,,Lucene检索工具进行了研究,包括JXTA的基本概念及协议规范,然后对Lucene技术做了研究,包括其特点,以及一些实用类进行详尽的研究介绍。本文接着介绍了请求路由算法的基本思想,研究论述了基于k-高频词主题相关搜索路由算法。本文建立了基于P2P的全文检索引擎系统原型,描述系统工作流程,详细介绍各模块的具体设计。本文在对相关只是进行详尽研究的基础上编程实现基于P2P的文献检索系统各模块功能,并对系统功能进行了测试。最后本文对本课题的研究进行了总结,并对未来的研究做了简单的展望。
[Abstract]:As a new network model, P2P technology accounts for more than 60% of the total amount of Internet services. It is called "killer level" technology of broadband Internet application. P2P technology is compared with the traditional client / server model. In the elimination of server bottlenecks and network resource utilization and other aspects of the advantages are obvious. JXTA is the standard of Sun Peer-to-Peer Network (P2P), which provides the basic services for P2P programs. The technology is dedicated to creating a common platform for building specific peer-to-peer and distributed services and applications in a simple and efficient manner. So that developers do not need to think too much about how to solve the technical problems of peer-to-peer computing, but can focus on how to implement and improve the extensible, interoperability and high availability of high-level applications. Lucene is an open source project that provides a relatively complete function of text retrieval. On this basis, developers can quickly achieve full-text retrieval function. This paper first introduces the development trend and key technologies of search engines, and then analyzes the challenges faced by traditional search engines and the advantages of P2P based search engines. Then, this paper studies the key technology of P2P search engine, JXTA technology and Lucene retrieval tool, including the basic concept and protocol specification of JXTA, and then makes a research on Lucene technology, including its characteristics. As well as some practical classes to carry on the detailed research introduction. Then this paper introduces the basic idea of request routing algorithm and discusses the search routing algorithm based on k- high frequency word topic correlation. In this paper, the prototype of P2P based full-text search engine is established, the workflow of the system is described, and the specific design of each module is introduced in detail. Based on the detailed study of the correlation, this paper implements the function of each module of the document retrieval system based on P2P, and tests the function of the system. Finally, this paper summarizes the research of this topic, and makes a simple prospect for the future research.
【学位授予单位】:吉林大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:TP391.3
【参考文献】
相关期刊论文 前10条
1 程立考;李绍静;;对等网络的研究与应用[J];电脑与信息技术;2006年04期
2 罗峰;;基于网络编码的P2P网络系统研究[J];电视技术;2007年02期
3 石友康;;P2P技术业务模式与安全问题探讨[J];电信网技术;2007年03期
4 曾楚轩;;P2P应用技术发展浅析[J];电信网技术;2007年03期
5 孙力;陈兰;袁媛;;基于节点兴趣的非结构化P2P搜索机制[J];计算机工程;2009年23期
6 戴明坚;张大方;;书面汉语自动分词技术与实现[J];计算技术与自动化;1990年03期
7 盛明超;张代远;;纯P2P在私网中的应用[J];计算机时代;2008年05期
8 庄雷;常玉存;董西广;;一种P2P文件共享系统中的激励机制[J];计算机应用研究;2009年01期
9 叶剑虹;孙世新;张运生;周益民;;基于P2P的自组织网络路由算法研究[J];计算机应用研究;2009年01期
10 黄昌宁,张小凤;自然语言处理技术的三个里程碑[J];外语教学与研究;2002年03期
本文编号:1944503
本文链接:https://www.wllwen.com/kejilunwen/sousuoyinqinglunwen/1944503.html