国际辞书现代化技术的新理念:辞书语料数据化
发布时间:2018-03-23 07:23
本文选题:语料库 切入点:数据化 出处:《辞书研究》2012年02期 论文类型:期刊论文
【摘要】:在辞书现代化技术方面,国内辞书界的主要精力仍放在语料库的建设和使用上。然而,国际研究重点已转向语料的深加工和数据库建设,因为他们认识到,编者要想梳理海量语料并从中找到有用的东西绝对是一件既耗时又费力的事情。文章结合国际辞书现代技术的经验,阐述辞书现代化的新理念——辞书语料数据化,即应用语言学研究的新成果和数据挖掘技术,在海量的语料中提取词典所需的各种有效语言数据,把语料库变为词汇/词典数据库,从而大大提高语料使用和词典编纂的效率。
[Abstract]:In terms of modern techniques for dictionaries, the main focus of the domestic lexicon community is still on the construction and use of the corpus. However, the focus of international research has shifted to the deep processing of the corpus and the building of the database, as they recognize that, It is absolutely time consuming and laborious for the editor to comb out the huge amount of data and find something useful from it. This paper, based on the experience of modern technology of international dictionaries, expounds the new idea of the modernization of dictionaries, that is, the digitization of lexicographic corpus. That is, using the new achievements of linguistic research and data mining technology, we can extract all kinds of effective language data needed by the dictionary in a large amount of corpus, and turn the corpus into a vocabulary / dictionary database, thus greatly improving the efficiency of the use of the corpus and the compilation of the dictionary.
【作者单位】: 广东外语外贸大学外国语言学及应用语言学研究中心/词典学研究中心;
【基金】:上海市科学技术委员会的资助,资助课题编号为08dz1501100
【分类号】:H06
,
本文编号:1652451
本文链接:https://www.wllwen.com/wenyilunwen/yuyanxuelw/1652451.html