当前位置:主页 > 科技论文 > 搜索引擎论文 >

机票票价预测系统设计与实现

发布时间:2018-09-05 11:38
【摘要】:随着社会的高速发展,人们的生活水平不断提高。在人们日常的商务活动或者相关的日常出行中,飞机因为其便捷速度,使得乘坐飞机出行已经成为越来越多用户的第一选择。同时,互联网业务的高速发展,也使得人们无论何时何地,只要在联网的情况下可以方便的从互联网在预定到出行的飞机票。很多的飞机乘客可能认为飞机票越早所能得到的票价优惠最高,其实真的是这样么。实际上在很多时候,也许是越靠近飞机起飞的时候,机票价格越低。 为了帮助客户在平时的生产生活中可以以最低的价格预定到需要乘坐的航班,我们开发了这套机票预测系统。这套系统的原理是,目前提供机票预定的网站很多,除了相关的航空公司官网之外,几乎所有的和旅行相关的网站都提供了这项业务。那么我们通过网络爬虫,定期的从这些网站上获取相关的机票价格,对这些机票的价格进行统计分析,通过这样巨大的数据量以及之前的机票价格情况,来预测之后的相关机票价格情况,之后将这些情况返回给用户,帮助用户对购买机票价格的时间进行选择。 由于我们只针对于机票价格这样一个特定的领域,所以,类似于百度、Google这类通用搜索引擎并不能很好的满足我们的要求,我们需要更加专业,更加具有针对性精确性的搜索方案,这就是垂直搜索技术。同时,由于我们这个系统的设计的数据量巨大,我们采用了当前主流的HBase分布式数据库,作为后台的支持数据库,为我们提供数据处理功能。在论文系统的开发之前,作为系统的主要开发人员,首先对本系统需要满足的用户需求以及最后需要具有的必要功能进行了系统的分析,有明确的系统目标。之后对本系统进行了体系结构设计,功能结构设计,数据库设计,详细设计等。本系统使用Netbeans集成开发环境,采用Java作为开发语言,应用B/S架构,对各模块功能进行了逐步的编码实现。 通过对此机票预测系统的实现,对当今的大数据处理模式也有了进一步的体会与认识,通过实践,实现了之前的目标。满足了开发初衷,可以为用户之后的订票时间提供帮助,具有较大的现实意义。
[Abstract]:With the rapid development of society, people's living standard is improving constantly. In people's daily business activities or related daily travel, airplane travel has become the first choice for more and more users because of its convenient speed. At the same time, with the rapid development of Internet services, people can easily book to travel air tickets from the Internet whenever and wherever they are connected. Many airline passengers may think that the earlier the plane ticket, the higher the fare concession. Is that really the case?. In fact, in many cases, perhaps the closer the plane takes off, the lower the ticket price. In order to help customers to book the required flights at the lowest price in their daily life, we have developed the ticket prediction system. The rationale for the system is that there are many websites that currently offer airline reservations, and nearly all travel-related websites offer the service, except for the airline's official website. Well, through the web crawler, we regularly obtain the relevant ticket prices from these websites, and make a statistical analysis of the price of these tickets, through such a huge amount of data and the price situation of the previous tickets, To predict the relevant ticket prices, and then return them to the user to help them choose when to buy the ticket price. Since we only focus on a specific area of ticket pricing, general search engines such as Baidu and Google are not good enough to meet our requirements. We need to be more professional. More targeted and accurate search scheme, this is the vertical search technology. At the same time, because of the huge amount of data designed by our system, we use the current mainstream HBase distributed database as the backstage support database to provide us with data processing functions. Before the development of the system, as the main developer of the system, the system needs to meet the needs of users and the final need to have the necessary functions for a system analysis, with clear system goals. Then the system architecture design, functional structure design, database design, detailed design and so on. The system uses Netbeans integrated development environment, Java as the development language, and the B / S architecture. The functions of each module are coded step by step. Through the realization of this air ticket prediction system, the author also has a further understanding and understanding of big data's handling mode. Through practice, the former goal has been realized. It satisfies the original intention of the development and can provide help for the booking time after the user. It is of great practical significance.
【学位授予单位】:大连理工大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:TP311.52

【参考文献】

相关期刊论文 前10条

1 张斌;周尔宁;;基于Nutch的分布式纺织垂直搜索引擎研究[J];电脑知识与技术;2009年21期

2 杨丽萍;;网页正文提取技术的分析与研究[J];计算机光盘软件与应用;2012年22期

3 张健沛,刘洋,杨静,代坤;搜索引擎结果聚类算法研究[J];计算机工程;2004年05期

4 强士卿;程光;;基于流的哈希函数比较分析研究[J];南京师范大学学报(工程技术版);2008年04期

5 严良达;;基于Lucene搜索引擎的设计与实现[J];宁波职业技术学院学报;2009年02期

6 李晓明,凤旺森;两种对URL的散列效果很好的函数[J];软件学报;2004年02期

7 林子雨;赖永炫;林琛;谢怡;邹权;;云数据库研究[J];软件学报;2012年05期

8 丁振国;吴宝贵;辛友强;;基于Bloom Filter的大规模网页去重策略研究[J];现代图书情报技术;2008年03期

9 孙皓;董守斌;;基于标签密度的自适应正文提取方法[J];郑州大学学报(理学版);2009年01期

10 张俊;李鲁群;周熔;;基于Lucene的搜索引擎的研究与应用[J];计算机技术与发展;2013年06期



本文编号:2224154

资料下载
论文发表

本文链接:https://www.wllwen.com/kejilunwen/sousuoyinqinglunwen/2224154.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户39fe7***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com