HBase数据库的安全索引研究
[Abstract]:With the arrival of big data era, the traditional relational database can not meet the needs of massive data processing in terms of access rate and so on. Therefore, NoSQL database emerges as the times require. Compared with relational database, NoSQL database has shortcomings in security. HBASE distributed database is a kind of NoSQL database with perfect security mechanism, but the data files in HBase database are stored in plain text. This forms a certain security loophole. It is very meaningful to encrypt the data files in HBase database and construct the secure secondary index of ciphertext for HBase database, which makes the HBase ciphertext database have the ability of conditional query. According to the different data types of column data in HBase distributed database, numerical ciphertext retrieval and text ciphertext retrieval can be studied respectively. For encrypted text retrieval of numerical data in HBase distributed database, a search strategy based on improved secure R-tree is proposed. When constructing the R-tree secure index for the data of the data table of HBase, the dynamic characteristics of the R-tree are used to store the RowKey row in the proper location of the index table according to the set encryption policy, according to the column value needed to build the index of the data table. At the same time, the index table IndexTable corresponding to the data table DataTable of HBase is stored in HBase to ensure the use and maintenance of the index table. This method can well support the column equivalent retrieval and range retrieval. A secure inverted index-based retrieval strategy for Chinese text-based data retrieval in HBase distributed database is proposed in this paper. When constructing text-based ciphertext index, we combine the coprocessor framework of HBase, which is commonly used in building secondary index, and Lucene, an open source search engine toolkit, to encrypt word segmentation after standardizing the Column Family/Q ualifer-qualified column in HBase as text content. At the same time, the data integrity and availability are ensured by storing the relevant files in the Lucene database in the HBase database. This method can well support the equivalent query and substring query for columns. The prototype system of ciphertext retrieval based on HBase is designed and implemented, which includes user management module, encryption and decryption module, key management module, ciphertext index generation and retrieval module, and so on. The security index of HBASE makes full use of the existing fast retrieval ability of HBase, and the current research results still have some limitations on the support of the diversity of ciphertext data query, and the security index of HBASE has made full use of the fast retrieval ability of the existing HBASE. This direction still has great research space and significance.
【学位授予单位】:华中科技大学
【学位级别】:硕士
【学位授予年份】:2016
【分类号】:TP311.13;TP309
【参考文献】
相关期刊论文 前10条
1 李文成;赵逢禹;;企业云存储数据的加密与密文全文检索研究[J];小型微型计算机系统;2013年02期
2 曹夕;许力;陈兰香;;云存储系统中数据完整性验证协议[J];计算机应用;2012年01期
3 邹燕飞;于成尊;赵亮;;基于Lucene的文本搜索引擎的设计和实现[J];计算机与现代化;2011年09期
4 卢冬海;何先波;;浅析NoSQL数据库[J];中国西部科技;2011年02期
5 郑榕增;林世平;;基于Lucene的中文倒排索引技术的研究[J];计算机技术与发展;2010年03期
6 田秀彦;李忠;罗士美;邓力强;;基于Struts+Hibernate+Spring的整合架构及其在Web开发中的应用[J];计算机与现代化;2008年11期
7 王静疆;;搜索引擎评价指标体系比较研究[J];图书情报工作;2008年10期
8 冯朝胜;秦志光;袁丁;;数据库加密系统密钥管理模块的设计[J];电子科技大学学报;2007年05期
9 胡立春;武友新;张烨;姜晓东;;LDAP环境下的统一用户管理系统的研究与实现[J];计算机工程与设计;2007年04期
10 管建和;甘剑峰;;基于Lucene全文检索引擎的应用研究与实现[J];计算机工程与设计;2007年02期
相关硕士学位论文 前6条
1 付文静;基于HBase的大数据存储查询技术研究[D];电子科技大学;2015年
2 蔡平;基于Hadoop的NoSQL数据库安全研究[D];上海交通大学;2013年
3 关尔昱;基于数据库外层加密的数据库加密系统的设计与实现[D];北京邮电大学;2010年
4 宋赛;密文全文检索系统的安全索引结构研究[D];华中科技大学;2009年
5 刘可;数据库加密系统研究[D];合肥工业大学;2006年
6 余登峰;基于R树的空间数据索引技术研究与实现[D];中国地质大学;2006年
,本文编号:2440504
本文链接:https://www.wllwen.com/kejilunwen/ruanjiangongchenglunwen/2440504.html