个人行为数据的可视化查询方法与应用

发布时间:2018-12-31 12:31
【摘要】:个人行为数据正在改变人们的生活和工作方式。个人行为数据体量大、格式多样,来源广泛,从中挖掘有价值的信息极为困难。可视化查询是探索式数据分析的重要工具,在个人行为数据分析方面,具有先天优势。然而个人行为数据的可视化查询是存储和计算双密集型应用,存在多方面的挑战:缺少成熟的可视化查询模型;缺乏有效的数据组织方法;数据查询困难;缺少灵活的交互方式;需要新颖直观的可视化设计。本文总结相关工作并提出可视化查询认知模型,将可视化技术与数据查询技术相结合,对个人行为数据开展探索式分析。本文从社交媒体空间的社交行为分析、局部路段的行车行为分析以及城市空间的出行行为分析三个方面由简到繁地介绍可视化查询方法的具体应用。大量的实验、案例以及领域专家的反馈验证了可视化查询在分析个人行为数据上的有效性。具体而言,本文主要研究内容和贡献概括为以下几点:提出语义感知的社交行为可视化查询方法。本文利用社交行为时间上的语义相关性抽取数据中隐含的话题和命名实体集合,并对二者分类以增强语义。在此基础上,构建层次语义图模型,将社交数据呈现的复杂图结构简化为层次结构,同时为可视化查询提供多重索引。本文设计了基于多重分类导航的交互界面便于查询者分析社交行为,通过分裂环形图编码的多重语义数据。对推特数据的案例分析和用户调查表明,分裂环形图视觉特征明显,有利于发现社交行为。提出基于可视化查询的微观行车行为分析方法。本文实现了基于出租车轨迹数据的可视化查询系统,分析真实的行车行为,由此推断道路交通状况。为了支持可视化查询,本文还设计了笔划式交互工具以及协同式多视图。特别地,本文提出了一种新颖的基于道路的查询模型,方便领域专家交互式执行分析任务。查询模型的核心是双向链接哈希索引TripHash,该索引能保证大规模轨迹数据的实时查询。对超过30GB的真实轨迹数据的案例分析表明,本文的可视化查询方法能有效帮助领域专家发现局部路段的交通状况。-提出面向多源数据的可视化查询平台,分析城市人群出行模式。平台基于统一的数据抽象,消除不同数据源之间的差异。对于数据存储,本文讨论了四种数据划分方法便于分布式存储海量多源数据。为支持多源出行数据的可视化查询,本文设计了一种自适应的交互界面,允许任意选择时空区间。本文提供了两类交互查询工具用于输入时空查询条件,并且实现了展现出行人群静态和动态空间分布的可视化方法。为了保证查询的实时性,本文基于MapReduce计算框架对查询过程进行了优化。结合出租车轨迹数据和手机通话记录,本文分析了城市人群的出行规律,如聚散模式。
[Abstract]:Personal behavior data are changing the way people live and work. It is difficult to mine valuable information from personal behavior data because of its large volume, diverse format and wide range of sources. Visual query is an important tool for exploratory data analysis, which has inherent advantages in personal behavior data analysis. However, visual query of personal behavior data is a dual intensive application of storage and computation, which has many challenges: lack of mature visual query model, lack of effective methods of data organization, difficulty of data query, lack of visual query model. Lack of flexible interaction; need novel visual design. This paper summarizes the related work and puts forward a cognitive model of visual query, which combines visualization technology with data query technology to carry out exploratory analysis of personal behavior data. This paper introduces the concrete application of visual query method from three aspects: social behavior analysis of social media space, traffic behavior analysis of local section and travel behavior analysis of urban space. A large number of experiments, cases and feedback from domain experts have demonstrated the effectiveness of visual queries in the analysis of personal behavior data. Specifically, the main contents and contributions of this paper are summarized as follows: a semantic aware visual query method for social behavior is proposed. In this paper, the semantic correlation of social behavior time is used to extract the implicit topics and named entity sets from the data, and classify them to enhance the semantics. On this basis, a hierarchical semantic graph model is constructed, which simplifies the complex graph structure presented by social data as a hierarchical structure and provides multiple indexes for visual queries. In this paper, an interactive interface based on multiple classification navigation is designed to facilitate the query to analyze social behavior, and the multiple semantic data encoded by split loop graph is used. Case studies and user surveys of Twitter data show that the split loop image has obvious visual features and is conducive to discovering social behavior. This paper presents a method of microscopic traffic behavior analysis based on visual query. In this paper, a visual query system based on taxi track data is implemented, and the real traffic behavior is analyzed. In order to support visual query, a stroke interactive tool and collaborative multi-view are designed. In particular, a novel road-based query model is proposed, which is convenient for domain experts to interactively perform analytical tasks. The core of the query model is the bidirectional link hashing index (TripHash,), which can guarantee the real-time query of large scale trajectory data. A case study of real track data over 30GB shows that the visual query method in this paper can effectively help domain experts to find the traffic situation of local sections.-A visual query platform for multi-source data is proposed. Analyze the travel mode of urban population. The platform is based on unified data abstraction to eliminate differences between different data sources. For data storage, this paper discusses four data partitioning methods for distributed storage of massive multi-source data. In order to support the visual query of multi-source travel data, an adaptive interactive interface is designed to allow the arbitrary selection of space-time intervals. In this paper, two kinds of interactive query tools are provided to input spatio-temporal query conditions, and a visualization method to show the static and dynamic spatial distribution of travel population is implemented. In order to ensure the real-time query, this paper optimizes the query process based on MapReduce computing framework. Based on the taxi track data and mobile phone call records, this paper analyzes the travel patterns of urban population, such as aggregation pattern.
【学位授予单位】:浙江大学
【学位级别】:博士
【学位授予年份】:2016
【分类号】:TP311.13

【相似文献】

相关期刊论文 前7条

1 柳锦森;高飞;过家春;李卫国;;基于MapInfo的测绘标志普查信息管理系统的实现[J];测绘与空间地理信息;2008年06期

2 黄波,林珲;GeoSQL:一种可视化空间扩展SQL查询语言[J];武汉测绘科技大学学报;1999年03期

3 胡金莲,陈扬友;工程数据库的可视化查询技术[J];长江科学院院报;1997年03期

4 周林;刘海行;宋转玲;贾贞;;基于WebGIS技术的PANDA断面海洋主题数据库系统可视化查询的设计与实现[J];海洋科学;2009年05期

5 李开伟;方源敏;;基于MapObjects的区域环境规划系统的设计与实现[J];地矿测绘;2006年04期

6 张景;;矿产地数据库可视化查询系统[J];计算机应用与软件;2011年08期

7 ;[J];;年期

相关重要报纸文章 前1条

1 王冲;河南厅全力打造电子会商中心[N];中国国土资源报;2011年

相关博士学位论文 前2条

1 汪飞;个人行为数据的可视化查询方法与应用[D];浙江大学;2016年

2 钱新林;面向自发地理信息的空间数据表达与管理方法研究[D];武汉大学;2011年

相关硕士学位论文 前2条

1 方锦秀;基于GIS的可视化查询系统研究[D];武汉大学;2005年

2 张景;矿产地数据库可视化查询系统[D];合肥工业大学;2007年



本文编号:2396584

资料下载
论文发表

本文链接:https://www.wllwen.com/shoufeilunwen/xxkjbs/2396584.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户cafdc***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com