云环境下个性化推送搜索引擎的设计
发布时间:2018-02-21 04:47
本文关键词: 个性化 推送搜索 云计算 个性化搜索 主题相关搜索算法 出处:《北京邮电大学》2012年硕士论文 论文类型:学位论文
【摘要】:随着互联网技术的发展和普及,大量信息以网站作为载体向经济,社会和生活的各个领域提供服务,但是从2001年到2011年互联网上的数据信息从1万P增值到1亿P,从浩如烟海的信息中快速查找用户需要的信息成为所有互联网用户的迫切需求。史坦福大学的几个学生为此做出了巨大贡献,搜索引擎Google的出现迅速改变人们原有的上网习惯,但是伴随着互联网的进一步发展,尤其分布式云计算技术的发展,传统的包罗万象的搜索引擎已经不能满足用户的需求,在现实需求的驱动下基于云计算的个性化推送搜索服务技术诞生了。推送搜索是针对某一个特定的需求或一类特定的用户群的专业搜索引擎,是传统搜索引擎的细分和延伸,是对网页库中的类别信息进行分类细化,即搜索领域的行业化分工和对用户的精确定位和细化。例如推送服务是指搜索引擎通过记录并分析用户的上网行为,建立多维的学习模型。依据建立的用户模型,当用户接入互联网时,推送搜索引擎可以直接从浩如烟海的信息中过滤用户需要的信息。于是用户在互联网访问的任何信息都是针对他个人的模型定制且由推送搜索引擎提供的信息。 本文来源于和某电信运营商的合作项目,主要完成了以下工作 (1)分析了搜索引擎特别是推送搜索引擎和云计算计算的发展现状,阐述了相关技术的优点和前景,介绍了本系统的工作原理和工作流程; (2)根据电信行业移动互联网的发展趋势,改进了信息搜索的设计思想,针对移动互联网对信息精确性和有效性的更高要求,引入关键词基础词库和基础拓展; (3)结合云计算架构强大的存储和运算能力设计并实现了一个基于网页数据的全文搜索引擎系统,实现网页分词统计,用户个性化模型,网页去同质化等功能;
[Abstract]:With the development and popularization of Internet technology, a large amount of information takes the website as the carrier to provide services to various fields of economy, society and life. But from 2001 to 2011, the data on the Internet increased from 10,000 P to 100 million P, and it became an urgent need for all Internet users to quickly find the information that users needed from the vast amount of information. The students have made great contributions to this. The emergence of search engine Google changes people's Internet habits rapidly, but with the further development of the Internet, especially the development of distributed cloud computing technology, the traditional all-encompassing search engine can no longer meet the needs of users. Driven by the actual demand, the personalized push search service technology based on cloud computing is born. Push search is a professional search engine aimed at a particular demand or a specific group of users, and it is the subdivision and extension of traditional search engines. It is the classification and refinement of the category information in the web page library, that is, the division of labor in the field of search and the precise location and refinement of the user. For example, push service is a search engine that records and analyzes the user's online behavior by recording and analyzing the user's behavior on the Internet. Establish a multi-dimensional learning model. According to the established user model, when the user is connected to the Internet, The push search engine can filter the information the user needs directly from the vast amount of information, so any information accessed by the user on the Internet is customized for his own model and provided by the push search engine. This paper comes from a cooperation project with a telecom operator, which mainly completes the following work. 1) this paper analyzes the development of search engine, especially push search engine and cloud computing, expounds the advantages and prospects of related technologies, and introduces the working principle and workflow of the system. (2) according to the development trend of mobile Internet in telecom industry, the design idea of information search is improved, and the keyword basic lexicon and basic expansion are introduced to meet the higher requirement of accuracy and validity of information in mobile Internet. 3) Design and implement a full-text search engine system based on web page data combining the powerful storage and computing ability of cloud computing architecture, and realize the functions of page segmentation statistics, user personalization model, page de-homogeneity and so on.
【学位授予单位】:北京邮电大学
【学位级别】:硕士
【学位授予年份】:2012
【分类号】:TP391.3
【参考文献】
相关期刊论文 前3条
1 王尧;;高频词汇提取[J];程序员;2006年09期
2 严威,赵政;开发中文搜索引擎汉语处理的关键技术[J];计算机工程;1999年06期
3 崔维梅;范荣鹏;;搜索引擎技术的现状和热点[J];青年记者;2006年16期
相关硕士学位论文 前1条
1 董超;基于主题信息服务的垂直搜索引擎的设计与实现[D];北京邮电大学;2010年
,本文编号:1521037
本文链接:https://www.wllwen.com/kejilunwen/sousuoyinqinglunwen/1521037.html