当前位置:主页 > 科技论文 > 搜索引擎论文 >

Stanbol系统及其在国外应用概述

发布时间:2018-09-13 07:47
【摘要】:web中大量新闻网页、博客、电子邮件等非结构化信息中蕴含着大量的知识,对其进行处理以自动获得知识具有重要意义。目前,一些基于信息抽取等技术抽取简单关联关系的知识获取应用系统存在明显的局限性,本文引入Apache Stanbol——Apache下的一种从非结构化信息中自动获取知识的开源项目,它是一个为语义内容管理设计的模块化的软件集和可重用组件,旨在将传统内容管理系统(CMS)拓展为支持语义服务的语义内容管理系统(SCMS),在此基础上,为改善搜索引擎关于内容的搜索、分类,实体消歧及语义化查询等带来帮助。
[Abstract]:A large number of news pages, blogs, emails and other unstructured information in web contain a lot of knowledge, so it is of great significance to deal with them in order to acquire knowledge automatically. At present, some knowledge acquisition systems based on information extraction and other technologies have obvious limitations. This paper introduces an open source project based on Apache Stanbol--Apache, which automatically acquires knowledge from unstructured information. It is a modular software set and reusable component designed for semantic content management. It aims to extend traditional content management system (CMS) to semantic content management system (SCMS),) which supports semantic services. In order to improve search engine on content search, classification, entity disambiguation and semantic query bring help.
【作者单位】: 中国科学技术信息研究所;
【分类号】:TP391.1


本文编号:2240517

资料下载
论文发表

本文链接:https://www.wllwen.com/kejilunwen/sousuoyinqinglunwen/2240517.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户ea3b3***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com