汉语缩略语及其词频分析
发布时间:2018-09-03 15:29
【摘要】:本文是从现代汉语缩略语的定义和使用范围分类出发,在大规模文本语料库的运用基础上对缩略语词长与词频的关系、缩略语发生机制、缩略语缩略规律三个方面的问题进行验证研究。 本文的研究工作分为五个部分: 第一部分是对缩略语近年来研究的回顾以及对现存问题的思考,为本文的研究奠定基础,介绍论文研究的目的、意义以及论文的结构。 第二部分是对缩略语的多角度研究。缩略语的使用存在一个语言环境的界限,相同语音及词汇形式的缩略语在不同的语言环境中可能会存在完全不同的意义。为了避免在使用缩略语时发生词语混淆、意义指向不明确的情况,有必要对缩略语进行分类。 第三部分首先对词频分析及其研究状况进行简述和阐释,再从词频角度分析缩略语的发生机制及构成规律。这其中包括计算机分词统计效果和人工分词统计效果的比对以及大规模语料库的应用,最后在对大规模语料库进行缩略语词频统计的基础上总结出缩略语的缩略规律,以求对缩略语的形成及构成有更好的认识。 第四部分是在以上词频分析的基础上探讨缩略语的产生机制和缩略规律,以及验证缩略语的缩略过程与霍夫曼编码理论之间的关系。缩略语之所以会发生缩略是受语言信息传递省力原则的影响,本部分的研究结果将说明,缩略语词长是受使用频率影响的,并且这种缩略过程与信息论中霍夫曼编码理论是一致的。 第五部分是结语,概括本文所进行的研究及统计工作,提出对本文不足之处的认识。
[Abstract]:This paper is based on the definition and classification of the scope of use of modern Chinese acronyms, on the basis of the use of large-scale text corpus, the relationship between the length of acronyms and word frequency, and the mechanism of acronym generation. The acronyms are verified in three aspects: the rules of acronyms. The research work of this paper is divided into five parts: the first part is a review of the research of acronyms in recent years and reflections on the existing problems, which lays a foundation for the study of this paper, and introduces the purpose of the research. Meaning and structure of the paper. The second part is the study of acronyms from various angles. The use of acronyms has a linguistic boundary, and abbreviations of the same phonetic and lexical forms may have completely different meanings in different language environments. In order to avoid the confusion in the use of acronyms and the ambiguity of the meaning, it is necessary to classify the abbreviations. In the third part, the analysis of word frequency and its research status are briefly described and explained, and then the mechanism and formation of acronyms are analyzed from the angle of word frequency. This includes the comparison between the statistical effect of computer segmentation and artificial segmentation, and the application of large-scale corpus. Finally, the acronyms are summed up on the basis of the acronym frequency statistics of large-scale corpus. In order to have a better understanding of the formation and formation of acronyms. The fourth part discusses the mechanism of acronyms and the rules of acronyms on the basis of the frequency analysis above, and verifies the relationship between the acronyms and Hoffman's coding theory. The acronyms are influenced by the principle of saving effort in the transmission of language information. The results of this part will show that the length of acronyms is influenced by the frequency of use. And the acronym process is consistent with Hoffman's coding theory in information theory. The fifth part is the conclusion, summarizes the research and statistical work carried out in this paper, and puts forward the understanding of the deficiency of this paper.
【学位授予单位】:安徽大学
【学位级别】:硕士
【学位授予年份】:2012
【分类号】:H136
[Abstract]:This paper is based on the definition and classification of the scope of use of modern Chinese acronyms, on the basis of the use of large-scale text corpus, the relationship between the length of acronyms and word frequency, and the mechanism of acronym generation. The acronyms are verified in three aspects: the rules of acronyms. The research work of this paper is divided into five parts: the first part is a review of the research of acronyms in recent years and reflections on the existing problems, which lays a foundation for the study of this paper, and introduces the purpose of the research. Meaning and structure of the paper. The second part is the study of acronyms from various angles. The use of acronyms has a linguistic boundary, and abbreviations of the same phonetic and lexical forms may have completely different meanings in different language environments. In order to avoid the confusion in the use of acronyms and the ambiguity of the meaning, it is necessary to classify the abbreviations. In the third part, the analysis of word frequency and its research status are briefly described and explained, and then the mechanism and formation of acronyms are analyzed from the angle of word frequency. This includes the comparison between the statistical effect of computer segmentation and artificial segmentation, and the application of large-scale corpus. Finally, the acronyms are summed up on the basis of the acronym frequency statistics of large-scale corpus. In order to have a better understanding of the formation and formation of acronyms. The fourth part discusses the mechanism of acronyms and the rules of acronyms on the basis of the frequency analysis above, and verifies the relationship between the acronyms and Hoffman's coding theory. The acronyms are influenced by the principle of saving effort in the transmission of language information. The results of this part will show that the length of acronyms is influenced by the frequency of use. And the acronym process is consistent with Hoffman's coding theory in information theory. The fifth part is the conclusion, summarizes the research and statistical work carried out in this paper, and puts forward the understanding of the deficiency of this paper.
【学位授予单位】:安徽大学
【学位级别】:硕士
【学位授予年份】:2012
【分类号】:H136
【参考文献】
相关期刊论文 前10条
1 王化鹏;论现代汉语词的双音节化及其发展规律[J];北方论丛;2000年06期
2 刘洪波;;词频统计的发展[J];图书与情报;1991年02期
3 曾庆娜;;试论缩略语的规范问题[J];呼伦贝尔学院学报;2008年05期
4 吴翠芹;缩略语及其与原词语的关系[J];广西社会科学;2005年03期
5 刘云;;汉语词汇统计研究述评[J];汉语学习;2009年01期
6 郑阳寿;语言缩略语和言语缩略语[J];汉字文化;2001年02期
7 郭国权;;关于缩略语的界定及相关问题的探讨[J];佳木斯大学社会科学学报;2009年06期
8 刘晓静;赵雪;张晓磊;;现代汉语缩略语结构及语法特点探析[J];佳木斯大学社会科学学报;2010年01期
9 田,
本文编号:2220396
本文链接:https://www.wllwen.com/wenyilunwen/hanyulw/2220396.html