基于关联规则算法的旅游推荐研究
发布时间:2018-09-08 10:45
【摘要】:随着移动互联网的快速发展和普及应用,传统的旅游信息服务已经不能满足大众的需求,旅游产业作为国家十二五规划的重点产业之一,“智慧旅游”的概念应运而生,它往往体现在旅游服务、管理和营销的智慧。然而这三方面最被关心的就是旅游信息服务的智慧,如何做到给用户智慧的旅游服务,这是我们将要面临的挑战。如今,大数据也是热点问题,旅游数据符合大数据的特点,大量的旅游数据杂乱无章、数据巨大,如何处理和利用好这些数据,传统的方法已经不能够解决问题,必须采用新的技术平台和方法。目前通常采用的方法是利用新的云计算和物联网技术,来处理旅游业中的旅游服务问题,将旅游产业中的旅游服务问题、数据处理问题和资源整合问题在新的大数据环境下利用数据挖掘的技术手段来为旅游管理者提供指导,并为大众服务提供一种全新的旅游形态和方法。在大数据时代,Hadoop平台作为优秀的分布式处理平台,给大数据的处理和存储提供了可能性,利用它特有的编程模型和数据存储方式,可以将传统的旅游信息服务移植到云计算环境下进行,通过编写相应的数据挖掘算法,可以从大量的旅游数据中获取有效的信息,针对用户提供更加满意、高效的服务。本文从知名旅游网站百度旅游和马蜂窝网站中获取游客的属性数据,利用改进的数据挖掘算法,在对数据分析挖掘处理之后,针对具体的游客提供旅游信息服务。本文具体的研究工作如下:(1)研究Hadoop的工作原理,深入学习MapReduce的编程模型和运行流程和HDFS工作原理和存储原理,针对问题,编写对应的MapReduce JOB任务程序,为后续的算法改进打下了基础。(2)深入学习传统的关联规则算法,并且分析了算法的优缺点,在现有算法的基础上,提出了基于项合并剪枝的关联规则算法和基于频繁闭项集邻接图的关联规则算法,前者解决了在挖掘过程中的重复挖掘问题,后者减少冗余规则的生成问题,最终的实验结果表明,改进后的算法可以保证挖掘精准度的同时,提高算法的运行和挖掘效率。(3)从知名旅游网站百度旅游和马蜂窝网站获取游客的旅游数据,研究了数据的获取和处理方法,针对具体问题对数据进行了清洗和预处理工作,转化为可以处理的事务数据,为后续的实验做好基础。(4)最后,我们利用(3)中获取到的游客的旅游数据,应用改进后的算法进行挖掘分析,在游客的景点推荐和吃住行混合推荐方面给出了一种可行的方法,符合具体的实际情况。
[Abstract]:With the rapid development and popularization of the mobile Internet, the traditional tourism information service has not been able to meet the needs of the public. As one of the key industries in the 12th Five-Year Plan, the concept of "intelligent tourism" has emerged as the times require. It is often reflected in travel services, management and marketing wisdom. However, these three aspects are most concerned about the wisdom of tourism information services, how to provide intelligent travel services to users, which is the challenge we will face. Nowadays, big data is also a hot issue. Tourism data conform to the characteristics of big data. A large amount of tourism data is messy and huge. How to deal with and make good use of these data, the traditional methods can no longer solve the problem. New technology platforms and methods must be adopted. The current approach is to use new cloud computing and Internet of things technologies to deal with tourism services in the tourism industry, and tourism services in the tourism industry. Data processing problem and Resource Integration problem in the new environment of big data, the technology of data mining is used to provide guidance for tourism managers, and to provide a new form and method of tourism for mass service. In big data's time, Hadoop platform, as an excellent distributed processing platform, provided the possibility for the processing and storage of big data, and made use of its unique programming model and data storage method. Traditional tourism information services can be transplanted to cloud computing environment. By writing the corresponding data mining algorithm, we can obtain effective information from a large number of travel data, and provide more satisfied and efficient services for users. In this paper, the attribute data of tourists are obtained from well-known tourist websites Baidu Travel and Horse Honeycomb, and the improved data mining algorithm is used to provide tourist information services for specific tourists after processing the data analysis and mining. The specific research work of this paper is as follows: (1) study the working principle of Hadoop, deeply study the programming model and running flow of MapReduce, the working principle and storage principle of HDFS, and write the corresponding MapReduce JOB task program for the problem. It lays a foundation for further improvement of the algorithm. (2) deeply study the traditional association rules algorithm, and analyze the advantages and disadvantages of the algorithm, on the basis of the existing algorithms, An association rule algorithm based on item merging and pruning is proposed, and an association rule algorithm based on frequent closed itemsets adjacent graph is proposed. The former solves the problem of repeated mining in mining process, and the latter reduces the generation of redundant rules. The final experimental results show that the improved algorithm can ensure the accuracy of mining and improve the efficiency and efficiency of the algorithm. (3) the travel data of tourists are obtained from well-known tourism websites Baidu Travel and Horse Honeycomb. The methods of data acquisition and processing are studied. The data is cleaned and preprocessed for specific problems, which can be converted into transactional data. (4) finally, Using the tourist data obtained in (3), we apply the improved algorithm to mining and analysis, and give a feasible method in the tourist attraction recommendation and the mixed recommendation of eating, lodging and traveling, which is in line with the actual situation.
【学位授予单位】:陕西师范大学
【学位级别】:硕士
【学位授予年份】:2015
【分类号】:TP311.13;TP391.3
本文编号:2230292
[Abstract]:With the rapid development and popularization of the mobile Internet, the traditional tourism information service has not been able to meet the needs of the public. As one of the key industries in the 12th Five-Year Plan, the concept of "intelligent tourism" has emerged as the times require. It is often reflected in travel services, management and marketing wisdom. However, these three aspects are most concerned about the wisdom of tourism information services, how to provide intelligent travel services to users, which is the challenge we will face. Nowadays, big data is also a hot issue. Tourism data conform to the characteristics of big data. A large amount of tourism data is messy and huge. How to deal with and make good use of these data, the traditional methods can no longer solve the problem. New technology platforms and methods must be adopted. The current approach is to use new cloud computing and Internet of things technologies to deal with tourism services in the tourism industry, and tourism services in the tourism industry. Data processing problem and Resource Integration problem in the new environment of big data, the technology of data mining is used to provide guidance for tourism managers, and to provide a new form and method of tourism for mass service. In big data's time, Hadoop platform, as an excellent distributed processing platform, provided the possibility for the processing and storage of big data, and made use of its unique programming model and data storage method. Traditional tourism information services can be transplanted to cloud computing environment. By writing the corresponding data mining algorithm, we can obtain effective information from a large number of travel data, and provide more satisfied and efficient services for users. In this paper, the attribute data of tourists are obtained from well-known tourist websites Baidu Travel and Horse Honeycomb, and the improved data mining algorithm is used to provide tourist information services for specific tourists after processing the data analysis and mining. The specific research work of this paper is as follows: (1) study the working principle of Hadoop, deeply study the programming model and running flow of MapReduce, the working principle and storage principle of HDFS, and write the corresponding MapReduce JOB task program for the problem. It lays a foundation for further improvement of the algorithm. (2) deeply study the traditional association rules algorithm, and analyze the advantages and disadvantages of the algorithm, on the basis of the existing algorithms, An association rule algorithm based on item merging and pruning is proposed, and an association rule algorithm based on frequent closed itemsets adjacent graph is proposed. The former solves the problem of repeated mining in mining process, and the latter reduces the generation of redundant rules. The final experimental results show that the improved algorithm can ensure the accuracy of mining and improve the efficiency and efficiency of the algorithm. (3) the travel data of tourists are obtained from well-known tourism websites Baidu Travel and Horse Honeycomb. The methods of data acquisition and processing are studied. The data is cleaned and preprocessed for specific problems, which can be converted into transactional data. (4) finally, Using the tourist data obtained in (3), we apply the improved algorithm to mining and analysis, and give a feasible method in the tourist attraction recommendation and the mixed recommendation of eating, lodging and traveling, which is in line with the actual situation.
【学位授予单位】:陕西师范大学
【学位级别】:硕士
【学位授予年份】:2015
【分类号】:TP311.13;TP391.3
【参考文献】
相关期刊论文 前3条
1 王端祥;;贵州旅游电子商务现状及发展对策研究[J];贵州师范学院学报;2010年05期
2 石培华;冯凌;;新时期中国旅游研究十大创新方向展望[J];北京第二外国语学院学报;2010年05期
3 何华春,石军南;基于GIS系统的旅游区研究——以武功山风景旅游区为例[J];经济地理;2003年06期
,本文编号:2230292
本文链接:https://www.wllwen.com/guanlilunwen/yingxiaoguanlilunwen/2230292.html