基于Android平台文字识别应用的设计与实现
发布时间:2018-08-16 20:00
【摘要】:随着OCR技术的日渐成熟,移动互联网的蓬勃发展,拥有OCR技术的移动终端产品逐渐深入到了人们的日常生活。移动终端的便携性使其适合作为大规模移动OCR商用平台。本文对Android框架和JNI技术进行综述,将Android平台与OCR技术结合,实现了基于Android平台文字识别系统应用的设计与开发。论文主要工作内容有:(1)针对文字识别系统相应的技术和算法进行了概括。研究对自然场景下所拍摄图像进行文本标记的算法,为后续OCR部分中的文字分割提供相关参数。同时研究图像预处理相关算法,在极值中值滤波基础上,提出了一种适用于Android平台的图像预处理方案。(2)分析了 Tesseract工作流程与相关算法,利用JNI进行Android应用程序和Tesseract库的交互,并将其移植到Android平台,作为移动终端文字识别的引擎。设计了系统的框架和实现方式。最终实现了对英文字符和中文字符的识别。(3)测试和分析了系统性能。根据分析结果验证了 OCR系统性能良好,并针对不足之处给出了有效的改进方案,为进一步深入研究提供了可靠的数据支持。本文实现了 Android终端上中英文字符识别应用的设计、开发,对OCR理论研究和商业应用都十分有意义。
[Abstract]:With the maturity of OCR technology and the vigorous development of mobile Internet, mobile terminal products with OCR technology have gradually penetrated into people's daily life. The portability of mobile terminal makes it suitable for large scale mobile OCR commercial platform. In this paper, Android framework and JNI technology are summarized, and the design and development of character recognition system based on Android platform are realized by combining Android platform with OCR technology. The main contents of this paper are as follows: (1) the corresponding techniques and algorithms of character recognition system are summarized. This paper studies the text marking algorithm of the images taken in natural scene, and provides the relevant parameters for the text segmentation in the following OCR part. At the same time, the correlation algorithm of image preprocessing is studied. On the basis of extremum median filter, an image preprocessing scheme suitable for Android platform is proposed. (2) the Tesseract workflow and related algorithms are analyzed, and the interaction between Android application program and Tesseract library is carried out by using JNI. And transplant it to Android platform, as mobile terminal text recognition engine. The framework and implementation of the system are designed. Finally, the recognition of English characters and Chinese characters is realized. (3) the system performance is tested and analyzed. According to the analysis results, the performance of the OCR system is proved to be good, and an effective improvement scheme is given in view of the shortcomings, which provides reliable data support for further research. This paper implements the design and development of Chinese and English character recognition application on Android terminal, which is of great significance to the research of OCR theory and commercial application.
【学位授予单位】:北京邮电大学
【学位级别】:硕士
【学位授予年份】:2016
【分类号】:TP391.41;TP316
[Abstract]:With the maturity of OCR technology and the vigorous development of mobile Internet, mobile terminal products with OCR technology have gradually penetrated into people's daily life. The portability of mobile terminal makes it suitable for large scale mobile OCR commercial platform. In this paper, Android framework and JNI technology are summarized, and the design and development of character recognition system based on Android platform are realized by combining Android platform with OCR technology. The main contents of this paper are as follows: (1) the corresponding techniques and algorithms of character recognition system are summarized. This paper studies the text marking algorithm of the images taken in natural scene, and provides the relevant parameters for the text segmentation in the following OCR part. At the same time, the correlation algorithm of image preprocessing is studied. On the basis of extremum median filter, an image preprocessing scheme suitable for Android platform is proposed. (2) the Tesseract workflow and related algorithms are analyzed, and the interaction between Android application program and Tesseract library is carried out by using JNI. And transplant it to Android platform, as mobile terminal text recognition engine. The framework and implementation of the system are designed. Finally, the recognition of English characters and Chinese characters is realized. (3) the system performance is tested and analyzed. According to the analysis results, the performance of the OCR system is proved to be good, and an effective improvement scheme is given in view of the shortcomings, which provides reliable data support for further research. This paper implements the design and development of Chinese and English character recognition application on Android terminal, which is of great significance to the research of OCR theory and commercial application.
【学位授予单位】:北京邮电大学
【学位级别】:硕士
【学位授予年份】:2016
【分类号】:TP391.41;TP316
【参考文献】
相关期刊论文 前10条
1 刘淼;杨镇豪;谢韵玲;谢冬青;唐春明;;Android图文同步识别系统的设计和实现[J];计算机工程与设计;2014年06期
2 童立靖;张艳;舒巍;占国亮;钱W,
本文编号:2187031
本文链接:https://www.wllwen.com/kejilunwen/ruanjiangongchenglunwen/2187031.html