基于Android的联机手写维吾尔文识别研究

发布时间:2018-07-24 09:02
【摘要】:维吾尔文是新疆维吾尔自治区少数名族的主要交流文字,为了便利当地人们交流,开展有关维吾尔文文字处理技术的研究是很有必要。在当今社会,常用电子信息设备都已经进入了平常百姓家,特别是手机移动终端设备,已经成为了人们日常生活之中不可缺少的通信工具,开展有关手机移动终端上的维吾尔文文字信息处理技术已显得迫切需要。目前,针对维吾尔文字的识别技术主要有印刷体识别和联机手写文字识别。在印刷体文字识别方面,已经取得了较大发展,但有关联机手写文字识别方面的研究还比较少并且大部分研究都是以字母为基本识别单位的。以字母为基本识别单位,其优点就是需要建立的字符模型少,识别效率快。但其缺点也较为明显,首先要面临的就是字母切分困难问题。因为维吾尔文属于黏着性语言文字,其大部分字母都可以粘连书写,具有天然的草书特点,并且字母变体多,如何切分出正确的字母一直都是研究难点。再就是针对基本字母的识别无法满足人们在触屏手机上进行单词输入的实际应用需求。为了解决此问题,本文主要针对基于Android移动终端上的手写维吾尔文字识别方法进行了研究。首先研究了在Android手机上针对维吾尔文单词进行采样的方法,其次,以采集的样本信息为基础,对维吾尔文单词的预处理和特征提取方法进行了研究。同时为了满足可以识别维吾尔文整词的需求,本文中根据维吾尔文单词都是由多个连体段组成的特点,选取以连体段为基本单位进行特征的提取,然后通过拼接连体段特征构成一个单词的特征向量。最后,通过本文中提出的特征压缩方法将已提取的单词特征向量转化为离散隐马尔可夫模型可以读取的观察值序列,接下来使用隐马尔科夫模型完成对单词的建模和识别测试。本文主要研究了在安卓移动设备上对手写维吾尔文单词进行识别的方法。通过在手机上完成维吾尔文单词采样后,前期研究工作主要是在电脑端完成的,后期再将识别程序移植到手机设备,初步在安卓移动终端上完成对手写维吾尔文单词的识别。
[Abstract]:Uygur language is the main communication character of a few ethnic groups in Xinjiang Uygur Autonomous region. In order to facilitate the exchange of local people, it is necessary to carry out research on Uygur character processing technology. In today's society, common electronic information devices have entered the homes of ordinary people, especially mobile terminal devices, and have become an indispensable communication tool in people's daily lives. It is urgent to develop Uighur character information processing technology on mobile terminal. At present, Uygur character recognition technology mainly includes print recognition and online handwriting recognition. In the field of printed character recognition, great progress has been made, but there are few researches on on-line handwritten character recognition and most of the researches are based on letters. Taking letters as the basic recognition unit, it has the advantages of less character models and faster recognition efficiency. But its shortcomings are obvious, the first is the problem of alphabetic segmentation. Because Uyghur belongs to adherent language and characters, most of its letters can be stuck and written, and has the characteristics of natural cursive script, and there are many variations of letters. How to cut out the correct letters has always been a research difficulty. Second, the recognition of basic letters can not meet the practical needs of word input on touch-screen mobile phones. In order to solve this problem, this paper mainly focuses on handwritten Uighur character recognition based on Android mobile terminal. Firstly, the method of sampling Uygur words on Android mobile phone is studied. Secondly, based on the sample information collected, the preprocessing and feature extraction methods of Uygur words are studied. At the same time, in order to meet the need to recognize Uygur integer words, according to the characteristics of Uygur words are composed of multiple conjoined segments, we select the conjoined segment as the basic unit for feature extraction. Then the feature vectors of a word are constructed by splicing the joint segment features. Finally, the extracted word feature vector is transformed into a sequence of observation values that can be read by discrete hidden Markov model through the proposed feature compression method in this paper. Then, the hidden Markov model is used to complete the word modeling and recognition test. This paper mainly studies the recognition method of handwritten Uygur words on Android mobile devices. After completing the Uygur word sampling on the mobile phone, the previous research work is mainly completed on the computer, and the recognition program is transplanted to the mobile phone device at the later stage, and the recognition of the handwritten Uygur words is preliminarily completed on the Android mobile terminal.
【学位授予单位】:新疆大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP391.43;TP316

【参考文献】

相关期刊论文 前10条

1 热依曼·吐尔逊;吾守尔·斯拉木;;一种维吾尔语联机手写识别系统[J];中文信息学报;2014年03期

2 姜文;卢朝阳;李静;;基于方向线素特征的手写体维文字符识别[J];微电子学与计算机;2013年10期

3 祖丽菲亚·卡哈尔;玛依热·依布拉音;艾斯卡尔·艾木都拉;地里木拉提·吐尔逊;;组合特征的联机手写维吾尔字母识别[J];通信技术;2013年05期

4 玛依热·依布拉音;张恒;刘成林;艾斯卡尔·艾木都拉;;联机手写维吾尔文字母识别方法[J];模式识别与人工智能;2012年06期

5 许亚美;卢朝阳;李静;;部件字典结合时分方向特征的手写维吾尔字符识别[J];吉林大学学报(工学版);2013年03期

6 柳玲玲;赵晖;;联机手写维吾尔文单词识别中两种语言模型的比较研究[J];计算机应用与软件;2012年09期

7 邹霞;哈力木拉提·买买提;艾尔肯·赛甫丁;;维吾尔新文字印刷体识别系统的研究与开发[J];新疆大学学报(自然科学版);2012年02期

8 陈卿;袁保社;李晓;任宏宇;张建华;;基于模板匹配的印刷维吾尔文字符识别研究[J];计算机技术与发展;2012年04期

9 艾力·居麦;哈力旦·A;黄浩;;视频图像中维吾尔文字的识别研究[J];计算机工程与应用;2011年36期

10 木塔力甫·沙塔尔;李春庚;艾斯卡尔·艾木都拉;安居白;;基于可训练机制的联机维吾尔手写字母识别技术研究[J];计算机应用与软件;2011年09期

相关硕士学位论文 前6条

1 陆钢锋;印刷体维吾尔文识别系统识别技术相关研究[D];新疆大学;2013年

2 皮桂林;基于HMM模型的联机手写维文单词识别方法研究[D];新疆大学;2012年

3 姜文;维吾尔文单字符Gabor特征提取与识别[D];西安电子科技大学;2012年

4 贾建忠;脱机印刷体维吾尔文字识别特征选择和分类器设计方法的研究[D];苏州大学;2008年

5 万芳;联机手写维吾尔文字识别技术的研究与实现[D];新疆大学;2007年

6 梁涌;印刷体汉字识别系统的研究与实现[D];西北工业大学;2006年



本文编号:2140883

资料下载
论文发表

本文链接:https://www.wllwen.com/shoufeilunwen/xixikjs/2140883.html


Copyright(c)文论论文网All Rights Reserved | 网站地图 |

版权申明:资料由用户cf697***提供,本站仅收录摘要或目录,作者需要删除请E-mail邮箱bigeng88@qq.com