当前位置:主页 > 文艺论文 > 汉语言论文 >

面向本体的汉语专有叙词词间关系细化研究

发布时间:2018-11-21 18:15
【摘要】:网络时代的到来,,无结构和半结构化数据大量出现,从而加剧了信息检索工作的困难,为此人们提出了构建本体这一方案,旨在从语义和知识层次上描述信息系统,组织归纳海量的信息数据,为用户提供精确而高效的网络检索服务。叙词表己经汇集了众多领域专家的知识,包括了各学科领域中比较完整的词汇。因此,利用现有叙词表,将其转化为相应的本体,是本体构建的一条捷径。 在从汉语叙词表向本体转换的过程中,现有叙词表中的词间关系与本体推理所需要的更为细化的概念间关系还相差甚远,因此,就需要细化修订传统叙词表中叙词词间关系就显得尤为重要。 传统叙词表通常由普通叙词和专有叙词组成,专有叙词是指表达单独概念的某一特定事物的专有名称主题词,它包括以下范围:地理名称、国家名称、时代名称、人名、机关团体名称、产品型号、事物专有名称等等。目前,面向本体构建的汉语叙词其词间关系研究主要集中在普通叙词词间关系研究,对专有叙词词间关系研究很少涉及。 本文在本课题组原有普通叙词词间关系研究成果的基础上,通过比较、归纳等方法对传统叙词表专有叙词词间关系进行了分析,将汉语专有叙词的词间关系化分为相同关系、上下位关系和相关关系3个层次,包括相同关系、整部关系、属种关系、实例关系、限定关系、事物与来源关系、交叉关系、并列关系、事物与时间关系、事物与空间关系、亲朋关系、人物(组织)与事物关系、文献与覆盖范围关系、学科(领域)与事物关系、组织与成员关系、人物与职业(称谓)关系等17个细分关系,完善了叙词词间关系体系(含专有叙词)。同时,本文还针对在研究过程中出现的同形异义词现象,以及叙词词间关系泛化和细化问题进行了分析讨论,旨在为今后本体构建工作中相关标准的制定提供重要参考。
[Abstract]:With the advent of the network era, unstructured and semi-structured data appear in large numbers, which aggravate the difficulty of information retrieval. For this reason, people put forward the plan of constructing ontology, which aims to describe information system from semantic and knowledge levels. Organization induces massive information data to provide users with accurate and efficient network retrieval services. The thesaurus has brought together the knowledge of experts in many fields, including relatively complete vocabulary in various disciplines. Therefore, it is a shortcut to construct ontology by transforming it into the corresponding ontology by using the existing thesaurus. In the process of transforming from Chinese thesaurus to ontology, the relationship between the words in the existing thesaurus and the more detailed concepts needed for ontology reasoning are still far from each other, so, It is particularly important to refine and revise the relationship between the words in the traditional thesaurus. The traditional thesaurus is usually composed of ordinary and exclusive thesaurus, which refers to the exclusive title of a particular thing that expresses a separate concept. It includes the following areas: geographical name, country name, time name, person name, Organization name, product type, exclusive name, etc. At present, the research on the relationship between the words of the ontology-oriented Chinese thesaurus is mainly focused on the study of the relationship between the common words, but the study on the relationship between the exclusive words is seldom involved. On the basis of the original research results of the relationship between common thesaurus in our group, this paper analyzes the relationship between the exclusive words of the traditional thesaurus by means of comparison, induction and other methods, and divides the relationship between the words of the Chinese exclusive thesaurus into the same relationship. There are three levels of relationship: the same relation, the whole relation, the genus relation, the instance relation, the limited relation, the thing and the source relation, the cross relation, the parallel relation, the thing and time relation, the thing and space relation, the relation between object and space, the relationship between object and space. Relationships between family and friends, relationships between people (organizations) and things, relationships between literature and coverage, relationships between disciplines (fields) and things, relationships between organizations and members, relationships between people and occupations (appellations), The system of relationship between thesaurus (including exclusive thesaurus) has been improved. At the same time, this paper also analyzes and discusses the phenomenon of homomorphic words and the generalization and refinement of the relations between thesaurus, in order to provide an important reference for the establishment of relevant standards in ontology construction in the future.
【学位授予单位】:河北大学
【学位级别】:硕士
【学位授予年份】:2012
【分类号】:H136

【参考文献】

相关期刊论文 前10条

1 邓志鸿,唐世渭,张铭,杨冬青,陈捷;Ontology研究综述[J];北京大学学报(自然科学版);2002年05期

2 程华道;编制专业分面叙词表的三点实践把握[J];湖北公安高等专科学校学报;1998年04期

3 孙鑫;;Ontology及其在知识组织中的应用[J];经济研究导刊;2011年27期

4 张华平,刘群;基于角色标注的中国人名自动识别研究[J];计算机学报;2004年01期

5 仓定兰;;基于叙词表的领域本体半自动构建的研究和实现[J];科学技术与工程;2009年24期

6 陈海霞;;关于CNMARC格式个人名称主题字段的标引[J];农业图书情报学刊;2005年12期

7 蔡柏生 ,黄居仁 ,曾淑娟 ,林贞仪 ,陈克健 ,庄元s

本文编号:2347860


资料下载
论文发表

本文链接:https://www.wllwen.com/wenyilunwen/hanyulw/2347860.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户f2145***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com