面向手机信息的垂直搜索引擎
发布时间:2018-01-18 15:37
本文关键词:面向手机信息的垂直搜索引擎 出处:《西安工业大学》2012年硕士论文 论文类型:学位论文
更多相关文章: 垂直搜索引擎 Heritrix Lucene HTMLParser Rose Jade
【摘要】:随着Internet的发展,互联网上的信息越来越丰富。互联网中包括了人们生活中方方面面的信息,面对如此丰富多彩、错综复杂的信息,人们如何得到自己想要的信息呢? 搜索引擎的出现大大地提高了人们的检索能力,但是,随着信息量的不断增长,通用搜索引擎的局限性逐渐显现。主要表现为检索结果准确率低,相关度低,广告信息泛滥等。为了解决通用搜索引擎的不足,从2006年开始,垂直搜索引擎开始兴起,并取得了蓬勃的发展。 垂直搜索引擎是针对某一个行业的专业搜索引擎。它的特点就是“专、精、深”,具有很强的行业色彩,与通用搜索引擎相比垂直搜索引擎则显得更加专注、具体和深入。 本文针对用户检索手机信息的需求,首先介绍了垂直搜索引擎的相关知识。然后,提出了面向手机信息的垂直搜索引擎的系统结构,并介绍了每个模块的功能、知识背景和具体实现。最后,对设计的搜索引擎进行了测试,对本文的研究内容进行了总结,并提出了下一步的工作。本文的主要研究内容包括。 (1)搜索引擎相关知识。 (2)面向手机信息的垂直搜索引擎的系统结构及其实现。 (3)网络爬虫框架Heritrix。 (4)全文检索工具Lucene。 (5)网页分析工具HTMLparser。 (6)Web开发框架Rose, ORM框架Jade。
[Abstract]:With the development of Internet, the information on the Internet is more and more abundant. The Internet includes all aspects of information in people's lives, facing such colorful and intricate information. How do people get the information they want? The appearance of search engine has greatly improved the retrieval ability of people, but with the increasing of information, the limitations of general search engine gradually appear, mainly for the low accuracy of retrieval results and low correlation. In order to solve the deficiency of general search engine, since 2006, vertical search engine has started to rise, and has made vigorous development. Vertical search engine is a professional search engine for a certain industry. Its characteristics are "specialized, fine, deep", with a strong industry color, compared with the general search engine vertical search engine is more focused. Concrete and in-depth. Aiming at the demand of users to retrieve mobile phone information, this paper first introduces the related knowledge of vertical search engine. Then, it puts forward the system structure of vertical search engine for mobile phone information. And introduced the function of each module, knowledge background and specific implementation. Finally, the designed search engine is tested, and the research content of this paper is summarized. The main contents of this paper are as follows. Knowledge of search engines. The system structure and implementation of vertical search engine for mobile phone information. Heritrix. Lucene. HTML parser. A web development framework named Rose, a ORM framework named Jade.
【学位授予单位】:西安工业大学
【学位级别】:硕士
【学位授予年份】:2012
【分类号】:TP391.3
【参考文献】
相关博士学位论文 前1条
1 刘东飞;智能双语搜索方法及搜索引擎的研究[D];武汉理工大学;2009年
,本文编号:1441510
本文链接:https://www.wllwen.com/kejilunwen/sousuoyinqinglunwen/1441510.html