基于Lucene的电话号码智能搜索算法研究及系统实现
发布时间:2019-02-15 01:23
【摘要】:电话发明至今的100多年历史中,其相关技术发展迅速,使在人们通话交流时不断得到更好的体验。在移动通信终端普及的今日,人们通话交流变得更加方便快捷。据调查显示,2014年中国电话普及率(包括移动电话)为112.26部每百人。因为通话交流有沟通更有效,反应时间更快等特点,电话深入人们生活的各个方面,并已经成为人们获取服务信息的一个重要来源。传统的黄页模式已经不能满足移动互联网时代的号码搜索需求,如何使用户获得想要拨打的电话号码成为提升用户电话沟通效率的重要问题。用户搜索商业电话号码往往是对商户提供的服务进行咨询,进而开展下一步的服务,而商户信息在互联网时代呈现爆炸式的增长,如何利用好这些信息,提供更精准的电话号码搜索服务是实现电话号码搜索系统的关键。网页搜索引擎使用的经典搜索排序算法通常基于词频与位置的相关性分析或页面间链接关系的分析来对搜索结果进行排序,在搜索电话号码时这样的搜索排序精度还不够。本文通过对号码搜索的需求进行深入分析,设计了适用于电话号码搜索的排序算法,实现了针对电话号码搜索需求的垂直搜索引擎,为智能终端用户提供更精准电话号码的搜索服务。本文首先阐述了论文的研究背景,介绍了垂直搜索引擎的相关知识,重点分析了全文检索技术。其次对Lucene进行了详细分析,从源码层面分析了索引建立,全文搜索与结果评分等搜索引擎核心实现。接着对PageRank等经典搜索排序算法进行分析后,结合电话号码搜索的需求分析,提出了一种电话号码智能排序算法。基于以上三点,最终设计实现了电话号码垂直搜索系统,实验结果表明,应用号码智能搜索排序算法后的号码垂直搜索系统更准确地提供给用户想要拨打的号码。
[Abstract]:In the more than 100 years since the invention of telephone, the related technology has developed rapidly, which makes people get better experience when they talk to each other. With the popularity of mobile communication terminals, communication becomes more convenient and faster. China's telephone penetration rate, including mobile phones, was 112.26 per 100 people in 2014, according to the survey. Because telephone communication has the characteristics of more effective communication and faster reaction time, telephone has become an important source for people to obtain service information. The traditional yellow page mode can not meet the needs of number search in the era of mobile Internet. How to make users get the number they want to dial has become an important problem to improve the efficiency of telephone communication. Customer search for business phone numbers is often to consult the services provided by merchants, and then to carry out the next step of the service, and business information in the Internet age explosive growth, how to make good use of this information, To provide more accurate telephone number search service is the key to realize the telephone number search system. The classical search sorting algorithms used in web search engines usually sort the search results based on the correlation analysis of word frequency and location or the analysis of the link relationship between pages, but the precision of search sorting is not enough when searching telephone numbers. By analyzing the requirement of the number search, this paper designs a sort algorithm suitable for the telephone number search, and realizes the vertical search engine aiming at the demand of the telephone number search. For intelligent end users to provide more accurate telephone number search services. This paper firstly introduces the research background of the thesis, introduces the knowledge of vertical search engine, and analyzes the technology of full-text search. Secondly, the Lucene is analyzed in detail, and the core implementation of the search engine, such as index building, full-text search and result scoring, is analyzed from the source level. Then, after analyzing the classical search sorting algorithm such as PageRank and combining with the demand analysis of telephone number search, an intelligent sorting algorithm of telephone number is proposed. Based on the above three points, a telephone number vertical search system is designed and implemented. The experimental results show that the number vertical search system based on the intelligent number search sorting algorithm is more accurate to provide the number users want to dial.
【学位授予单位】:北京邮电大学
【学位级别】:硕士
【学位授予年份】:2016
【分类号】:TP391.3
[Abstract]:In the more than 100 years since the invention of telephone, the related technology has developed rapidly, which makes people get better experience when they talk to each other. With the popularity of mobile communication terminals, communication becomes more convenient and faster. China's telephone penetration rate, including mobile phones, was 112.26 per 100 people in 2014, according to the survey. Because telephone communication has the characteristics of more effective communication and faster reaction time, telephone has become an important source for people to obtain service information. The traditional yellow page mode can not meet the needs of number search in the era of mobile Internet. How to make users get the number they want to dial has become an important problem to improve the efficiency of telephone communication. Customer search for business phone numbers is often to consult the services provided by merchants, and then to carry out the next step of the service, and business information in the Internet age explosive growth, how to make good use of this information, To provide more accurate telephone number search service is the key to realize the telephone number search system. The classical search sorting algorithms used in web search engines usually sort the search results based on the correlation analysis of word frequency and location or the analysis of the link relationship between pages, but the precision of search sorting is not enough when searching telephone numbers. By analyzing the requirement of the number search, this paper designs a sort algorithm suitable for the telephone number search, and realizes the vertical search engine aiming at the demand of the telephone number search. For intelligent end users to provide more accurate telephone number search services. This paper firstly introduces the research background of the thesis, introduces the knowledge of vertical search engine, and analyzes the technology of full-text search. Secondly, the Lucene is analyzed in detail, and the core implementation of the search engine, such as index building, full-text search and result scoring, is analyzed from the source level. Then, after analyzing the classical search sorting algorithm such as PageRank and combining with the demand analysis of telephone number search, an intelligent sorting algorithm of telephone number is proposed. Based on the above three points, a telephone number vertical search system is designed and implemented. The experimental results show that the number vertical search system based on the intelligent number search sorting algorithm is more accurate to provide the number users want to dial.
【学位授予单位】:北京邮电大学
【学位级别】:硕士
【学位授予年份】:2016
【分类号】:TP391.3
【参考文献】
相关期刊论文 前10条
1 田晓辉;;面向垂直的搜索引擎的设计[J];福建电脑;2014年11期
2 赵悦阳;崔雷;;HITS算法在文本聚类结果类别描述中的应用尝试[J];情报理论与实践;2013年03期
3 张翼飞;;Heaps定律在中英文文本中的统计验证与分析[J];中国外资;2011年10期
4 王少康;董科军;阎保平;;使用特征文本密度的网页正文提取[J];计算机工程与应用;2010年20期
5 吴文昭;;搜索引擎页面排序融合算法[J];计算机工程与设计;2010年08期
6 索红光;孙鑫;;针对中文检索的Lucene改进策略[J];计算机应用与软件;2009年06期
7 李s,
本文编号:2422812
本文链接:https://www.wllwen.com/kejilunwen/ruanjiangongchenglunwen/2422812.html