自然场景下字符区域的定位与识别
[Abstract]:With the development of various smart mobile devices such as mobile phones, people pay more and more attention to the acquisition and utilization of text information in natural scenes such as life and industrial environment. Text information in natural scenes is different from printed text, it has a variety of languages, different fonts and sizes, complex background interference, damage and the impact of defacement, All these bring some difficulties and challenges to the acquisition and utilization of text information in natural scenes. This thesis is devoted to the research of the exact location and recognition of the following regions of the natural scene, focusing on the localization and recognition of the English character and digital regions; On this basis, a special scene of text location and recognition of natural scene, regional location and recognition of railway tanker number in complex industrial environment is further studied. As a part of the following area of natural scene, the railway tanker car number area is characterized by the rupture of its characters. In this paper, the location and recognition of the railway tanker number in complex industrial environment is studied as a special scene of the location and recognition of the local area in the following natural scene. In order to locate the area of railway tanker number accurately from many disturbances in complex industrial environment, and to separate and recognize the broken characters. On the basis of comparing and summarizing all kinds of text region localization methods, this paper presents a new location method which is suitable for English alphabet and digital region, which is widely used in natural scene. It is also applicable to the regional location of railway tanker number in complex industrial environment. This method has better localization effect for text regions with different sizes and tilting, which are affected by illumination changes. Firstly, the maximum stable extremum region (Maximally Stable Extremal Regions,MSER) is used to detect and obtain the extremum region, and the obtained extreme value region is screened. Then the triplet region is connected to the candidate text region by the effective region pair, and then the candidate text region is selected by using support vector machine (Support Vector Machine,SVM). In order to verify the generality of this method, this paper studies the acquisition of railway tanker number information in complex industrial environment, which is a special application scene in natural scene. In this paper, according to the character of railway tanker car number character break, a separation method suitable for breaking character is given. The recognition of characters takes into account the variety of characters and the variety of fonts in natural scenes, while the type of characters of railway tanker car number is fixed and the font changes are relatively little, so different methods are adopted to recognize the characters in these two major application scenarios. Tesseract-OCR is used to train and recognize the common English letters and numbers in natural scenes, and SVM is used to classify and recognize the vehicle number characters. A large number of experiments have proved that the method adopted in this paper has a good effect on the location and recognition of text information in the two research scenarios.
【学位授予单位】:哈尔滨工程大学
【学位级别】:硕士
【学位授予年份】:2016
【分类号】:TP391.41
【参考文献】
相关期刊论文 前10条
1 肖意;姜军;;基于最大稳定极值区域的车牌定位与字符分割[J];计算机与数字工程;2015年12期
2 王斌;;基于MSER和PHOG特征的交通标志检测方法[J];智慧工厂;2015年10期
3 刘新瀚;钱侃;王宇飞;朱向霄;孙知信;;自然场景下基于连通域检测的文字识别算法研究[J];计算机技术与发展;2015年05期
4 王雁;穆春阳;马行;;基于颜色标准化模型和HOG特征的交通标志检测[J];软件导刊;2015年03期
5 张伟伟;汤光明;孙怡峰;苏伟;;一种针对汉字特点的场景图像中文文本定位算法[J];信息工程大学学报;2014年06期
6 余旺盛;田孝华;侯志强;;基于区域边缘统计的图像特征描述新方法[J];计算机学报;2014年06期
7 郭兰图;余芳;陈金凤;;一种局部与全局特征结合的图像检索算法[J];微型机与应用;2013年18期
8 刘晓佩;卢朝阳;李静;;基于蚁群算法和LBP-HF的场景文本定位[J];光电工程;2012年03期
9 刘静;;几种车牌字符识别算法的比较[J];电脑与电信;2008年08期
10 杨凌霄;武建平;;机器学习方法在人脸检测中的应用[J];计算机与数字工程;2008年03期
相关博士学位论文 前3条
1 吴锐;自然场景中文本识别技术研究及实现[D];哈尔滨工业大学;2010年
2 李学勇;金属标牌压印凹凸字符的特征提取和识别方法研究[D];山东大学;2008年
3 黄剑华;自然场景中文本信息提取方法[D];哈尔滨工业大学;2007年
相关硕士学位论文 前10条
1 刘亚亚;图像中文本区域定位的研究[D];江南大学;2015年
2 袁俊淼;基于几何约束的笔划宽度变换(SWT)算法及其字幕文本定位应用[D];电子科技大学;2015年
3 万松;基于Tesseract-OCR的名片识别系统的研究与实现[D];华南理工大学;2014年
4 胡倩;自然场景下的文本定位[D];淮北师范大学;2014年
5 张宏丹;汽车牌照自动识别系统的设计与实现[D];电子科技大学;2014年
6 孙海侠;超商货架商品分割与识别方法研究[D];南京理工大学;2014年
7 刘萌萌;基于神经网络的压印字符识别系统研究[D];天津大学;2013年
8 张龙;蚁群算法在铁谱图像处理中的应用研究[D];南京航空航天大学;2013年
9 刘大方;基于CPN的视频人工文本提取方法研究[D];哈尔滨工程大学;2013年
10 李佳新;复杂背景下的视频文本定位与分割[D];西安科技大学;2012年
,本文编号:2245488
本文链接:https://www.wllwen.com/kejilunwen/ruanjiangongchenglunwen/2245488.html