数据智能可视化系统中图形透视表配置的生成与推荐
发布时间:2019-04-13 16:08
【摘要】:大数据时代中,每一条数据都蕴含巨大的价值,但是很少有企业意识到可以通过数据可视化将这些数据转换为实际的经济价值,而这很大程度上是因为当前的许多数据可视化系统表达能力较弱,同时也缺乏一些智能性,使决策者无法及时作出正确的决策。本文设计并实现了一个数据智能可视化系统,解决了两个最核心的技术难题:1)具有高可扩展性的图形透视表配置的生成;2)如何给用户推荐更有价值的图形透视表配置,而这又包括"下一步怎么看"和"应该怎么开始看"这两个重要问题。针对图形透视表配置的生成,本文提出了新的表代数算子,并用其生成透视结构配置,然后基于改进的图形语言来生成图形设计,并归纳了图形透视表配置的显式信息与隐式信息。而在智能推荐的问题上,本文总结并设计了基于数据特征的标记类型推导规则;提出了数据特征组合的三原则,并结合先验知识设计了启发式算法解决了单字段配置问题;设计了基于优先原则的多字段图表类型优先级算法,提出了基于图形语言的图形透视表配置推荐算法。本文设计实现的系统已经作为商业智能可视化平台网易有数的一个模块,应用在网易多款产品中,为运营决策提供了能够进行数据探索的智能可视化工具。
[Abstract]:In the era of big data, every piece of data contained great value, but few enterprises realized that they could convert the data into actual economic value through data visualization. This is largely due to the weak expression ability of many current data visualization systems and the lack of some intelligence, which makes the decision-makers unable to make the correct decision in time. In this paper, a data intelligent visualization system is designed and implemented, which solves two core technical problems: 1) the generation of high expansibility graphics perspective table configuration; 2) how to recommend more valuable graphical PivotTable configuration to users, and this includes two important questions: "what to think next" and "how to start looking". In this paper, a new table algebra operator is proposed, which is used to generate the perspective structure configuration, and then the graphic design is generated based on the improved graphic language, and a new table algebra operator is proposed for the generation of the configuration of the graphic PivotTable. The explicit information and implicit information of graphic PivotTable configuration are summarized. On the issue of intelligent recommendation, this paper summarizes and designs the derivation rules of tag types based on data features, puts forward three principles of data feature combination, and designs a heuristic algorithm based on prior knowledge to solve the problem of single-field configuration. This paper designs a multi-field chart type priority algorithm based on the priority principle, and proposes a graphic PivotTable configuration recommendation algorithm based on graphics language. The system designed and implemented in this paper has been used as a module of NetEase, a business intelligence visualization platform, which has been used in many NetEase products, and provides intelligent visualization tools for operational decision-making that can carry on data exploration.
【学位授予单位】:浙江大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP311.52
本文编号:2457730
[Abstract]:In the era of big data, every piece of data contained great value, but few enterprises realized that they could convert the data into actual economic value through data visualization. This is largely due to the weak expression ability of many current data visualization systems and the lack of some intelligence, which makes the decision-makers unable to make the correct decision in time. In this paper, a data intelligent visualization system is designed and implemented, which solves two core technical problems: 1) the generation of high expansibility graphics perspective table configuration; 2) how to recommend more valuable graphical PivotTable configuration to users, and this includes two important questions: "what to think next" and "how to start looking". In this paper, a new table algebra operator is proposed, which is used to generate the perspective structure configuration, and then the graphic design is generated based on the improved graphic language, and a new table algebra operator is proposed for the generation of the configuration of the graphic PivotTable. The explicit information and implicit information of graphic PivotTable configuration are summarized. On the issue of intelligent recommendation, this paper summarizes and designs the derivation rules of tag types based on data features, puts forward three principles of data feature combination, and designs a heuristic algorithm based on prior knowledge to solve the problem of single-field configuration. This paper designs a multi-field chart type priority algorithm based on the priority principle, and proposes a graphic PivotTable configuration recommendation algorithm based on graphics language. The system designed and implemented in this paper has been used as a module of NetEase, a business intelligence visualization platform, which has been used in many NetEase products, and provides intelligent visualization tools for operational decision-making that can carry on data exploration.
【学位授予单位】:浙江大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP311.52
【参考文献】
相关期刊论文 前1条
1 戴国忠;陈为;洪文学;刘世霞;屈华民;袁晓如;张加万;张康;;信息可视化和可视分析:挑战与机遇——北戴河信息可视化战略研讨会总结报告[J];中国科学:信息科学;2013年01期
,本文编号:2457730
本文链接:https://www.wllwen.com/shoufeilunwen/xixikjs/2457730.html