识别和抽取Web列表中的关系信息

发布时间：2018-03-01 12:22

本文关键词： Web列表关系信息识别抽取　出处：《计算机科学》2004年06期 　论文类型：期刊论文

【摘要】：有大量的关系信息存在于各种各样的Web列表中,但使用目前的搜索引擎却难以找到它们。本文提出了一种基于语义和数据特征的方法,用于识别和抽取Web列表中的关系信息。我们首先建立一个模型,描述所要的关系信息,然后寻找Web上的列表并估计它们是否包含所要的关系信息,当估计值足够大时,则从中抽取所要的关系信息。
[Abstract]:There are a lot of relational information in various Web lists, but it is difficult to find them by using the current search engines. This paper presents a method based on semantic and data features. Used to identify and extract relational information from the Web list. We first build a model to describe the desired relational information, then look for the lists on the Web and estimate whether they contain the desired relational information, when the estimate is large enough, Then the relational information is extracted from it.
【作者单位】：国立华侨大学计算机科学系国立华侨大学计算机科学系
【基金】：国家计委重点项目国务院侨办部门专项(ZX2000) 福建省自然科学基金(A0210017)
【分类号】：TP393.092

【相似文献】