数据空间中基于数据世系的关联关系获取方法研究
[Abstract]:With the continuous development of information technology, data information gradually presents the characteristics of mass, diversity, unstructured. However, the traditional database technology can not manage these complex data effectively, and a new data management model, data space, emerges as the times require, which can not only support many different heterogeneous data sources, such as document and Web, etc. Moreover, it has the characteristics of integration and evolution, emphasizing the relevance and evolution of data. The patent literature contains abundant structured information and unstructured information. This paper selects massive patent data to analyze the potential technological relationships between patents and find new patents. Due to the lack of citation in patent literature and the difficulty in judging the author's citation motivation, the citation relation cannot be directly used as the evaluation index of patent technology relevance. To solve this problem, a comprehensive semantic similarity model between patents is constructed to evaluate the technical association between patents. First of all, according to the structured information of patent author WA and WC; based on IPC patent classification number, the same relationship matrix WA and WC; are constructed respectively. The patent text similarity matrix (Ws,) is constructed with the text information such as the specification. Finally, the comprehensive semantic similarity model is constructed by multi-dimensional fusion. Then, the temporal factors are introduced and combined with the comprehensive semantic similarity model among patents to construct the patent lineage correlation network. According to the patent data lineage, the evolution path of the related technology is analyzed to evaluate the patent value and explore novel patents. Firstly, by using the potential direct or indirect citation relationship between patents in the related network of patent lineages, the factors of exponential decay of patent value over time and the contribution of potential direct or indirect cited patents to patent value are considered synthetically. Due to the influence of the new patent on the value of the patent in the original patent-related network, in order to save a lot of time of repeated calculation, a dynamic updating algorithm of patent value is put forward. When there is a potential technical correlation between the newly added patent and the original patent at T1, the value of the patent is the sum of the value transfer degrees of all adjacent points, thus improving the computational efficiency of the algorithm. Finally, the patent data set is used to carry on the related experiments, and the accuracy of the patent synthesis semantic similarity model and the efficiency of the patent value dynamic updating algorithm are verified by the comparison and analysis of the experimental results.
【学位授予单位】:哈尔滨工程大学
【学位级别】:硕士
【学位授予年份】:2016
【分类号】:TP391.1
【参考文献】
相关期刊论文 前10条
1 冯岭;彭智勇;刘斌;车敦仁;;一种基于潜在引用网络的专利价值评估方法[J];计算机研究与发展;2015年03期
2 黄斌;黄鲁成;吴菲菲;苗红;;基于专利共类的技术间关联性评估[J];情报杂志;2015年02期
3 王鑫;赵蕴华;高芳;;基于分类号和引文的专利相似度测量方法研究[J];数字图书馆论坛;2015年01期
4 刘峰;吴瑞红;徐川;吕学强;;专利文献中关键词抽取方法的改进[J];情报杂志;2014年12期
5 胡阿沛;张静;张晓宇;;基于专利文献的技术演化分析方法评述[J];现代情报;2013年10期
6 张杰;刘美佳;翟东升;;基于专利共词分析的RFID领域技术主题研究[J];科技管理研究;2013年10期
7 汪雪锋;赵晨晓;衡晓帆;王有国;张琪;;基于时间序列的关联分析在技术监测中的应用研究[J];情报杂志;2013年04期
8 陈立新;梁立明;;技术领域的集成与整合研究——基于美国专利IPC的关联分析[J];情报杂志;2013年01期
9 钟华;邓辉;;基于技术生命周期的专利组合判别研究[J];图书情报工作;2012年18期
10 曾淑琴;吴扬扬;;基于数据空间的数据源内容关系发现机制[J];微型机与应用;2012年14期
相关会议论文 前1条
1 张树良;王金平;赵亚娟;;国际半导体照明材料专利技术发展态势分析[A];第七届中国功能材料及其应用学术会议论文集(第4分册)[C];2010年
相关硕士学位论文 前4条
1 谢寿峰;基于专利分析的技术演变与预测研究[D];南京理工大学;2014年
2 刘倩楠;基于专利引文网络的技术演进路径识别研究[D];大连理工大学;2010年
3 曹菲菲;基于内容分析的专利挖掘技术研究[D];东北大学;2008年
4 侯筱蓉;基于引文路径分析的专利技术演进图研究[D];重庆大学;2008年
,本文编号:2223553
本文链接:https://www.wllwen.com/kejilunwen/ruanjiangongchenglunwen/2223553.html