网站异常变化监测系统的研究与实现
[Abstract]:With the development of Internet technology and the arrival of big data era, website has become a platform for government agencies, enterprises and institutions, cultural media, scientific research institutions, financial and securities institutions and other information dissemination and comprehensive application platform. The usage of websites is increasing year by year, and the content of web pages is huge and complicated. It is the responsibility of website owners to ensure the security, authority and accuracy of website information and to provide correct information and services for the public. However, the security threats to websites are becoming more and more serious, and the behavior of illegal intrusion and tampering is emerging in endlessly. The real-time monitoring and tamper-proof technology of websites has become a hot research topic in the field of information security. The design and development of a system to monitor the abnormal changes of websites is of great significance to the security of websites. In this paper, a set of abnormal change monitoring system is proposed to solve this problem. Firstly, this paper studies the characteristics of the abnormal changes of web pages, inquires the principles and techniques of the software of various kinds of website content security system, and finally selects the monitoring system of the abnormal changes of the website based on Hadoop platform through the comprehensive comparison and research of the advantages and disadvantages. The main functions expected to be realized in this system include the acquisition of website file data, abnormal change detection and monitoring and alarm. These include crawling a large number of complete website file data, storing the file data by HDFS, and carrying out preliminary filtering, and then detecting the specific changing content of the website, as well as the legitimacy judgment of the change, the management of abnormal data, and so on. Using the distributed computing model of HDFS and MapReduce, a file management system provided by Hadoop platform is used to deal with a large number of web site file data. The system carries on the HDFS storage to a large number of website file data which crawls, and speeds up the data search through the index storage way. The MD5 information digest algorithm and the improved text comparison algorithm based on graph theory are used to detect the abnormal change in the system, and the MapReduce computing model is used to realize the fast and accurate anomaly change detection. To judge the illegal link, URL address is translated into IP address, matching filter is used to judge the illegal word, Chinese word segmentation technology is combined with naive Bayes classification algorithm in data mining, and abnormal information is filtered out. Through the system design, implementation and testing, the system basically meets the requirements of monitoring the abnormal changes of the website in function and performance. The system also shows stable, efficient and error-free operation in use.
【学位授予单位】:辽宁大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP393.092
【参考文献】
相关期刊论文 前10条
1 董春涛;李文婷;沈晴霓;吴中海;;Hadoop YARN大数据计算框架及其资源调度机制研究[J];信息通信技术;2015年01期
2 黄爱明;;基于软件测试的策略与测试方法应用分析[J];电脑知识与技术;2015年02期
3 赵明芳;王学明;刘锐;;文本比较算法分析[J];电子世界;2014年04期
4 戴艳芳;;软件可靠性与测试方法探析[J];软件导刊;2012年11期
5 郝树魁;;Hadoop HDFS和MapReduce架构浅析[J];邮电设计技术;2012年07期
6 薛辉;邓军;叶柏龙;;一种分布式网站安全防护系统[J];计算机系统应用;2012年03期
7 陈琳;王箭;;三种中文文本自动分类算法的比较和研究[J];计算机与现代化;2012年02期
8 郝大志;;网络数据库的安全管理[J];科技创新与应用;2012年02期
9 侯建;帅仁俊;侯文;;基于云计算的海量数据存储模型[J];通信技术;2011年05期
10 李彬;;垃圾短信过滤器的研究与实现[J];科技传播;2011年01期
相关硕士学位论文 前10条
1 吴俊;基于Hadoop的MapReduce作业调度系统的研究与应用[D];南京邮电大学;2016年
2 靳佩瑶;基于内容的网页文本信息过滤技术研究[D];西南石油大学;2015年
3 黄翼彪;开源中文分词器的比较研究[D];郑州大学;2013年
4 靳瑞敏;网页关键字过滤研究及改进[D];内蒙古大学;2012年
5 童明;基于HDFS的分布式存储研究与应用[D];华中科技大学;2012年
6 何超;数据管理和数据挖掘技术的研究和应用[D];北京邮电大学;2012年
7 马松华;门户网站Web页面防篡改技术的研究与实现[D];东华大学;2012年
8 徐文强;基于HDFS的云存储系统研究[D];上海交通大学;2011年
9 孙志坚;政务网隔离与监控技术研究与应用[D];中国海洋大学;2010年
10 齐晓彤;一种主动的网页防篡改机制的研究与实现[D];北京交通大学;2010年
,本文编号:2155156
本文链接:https://www.wllwen.com/guanlilunwen/ydhl/2155156.html