利用pytesser模块实现图片文字识别

最新推荐文章于 2023-08-22 16:12:35 发布

Dream_weaver928

最新推荐文章于 2023-08-22 16:12:35 发布

阅读量2.7k

点赞数

分类专栏： python 文章标签： python 图片文字识别

本文链接：https://blog.csdn.net/Dream_weaver928/article/details/44629525

版权

Pytesser——OCR in Python using the Tesseract engine from Google

pytesser是谷歌OCR开源项目的一个模块，在python中导入这个模块即可将图片中的文字转换成文本。

链接： https://code.google.com/p/pytesser/

pytesser 调用了 tesseract。在python中调用pytesser模块，pytesser又用tesseract识别图片中的文字。

下面是整个过程的实现步骤：

1、首先要在code.google.com下载pytesser。https://code.google.com/p/pytesser/downloads/detail?name=pytesser_v0.0.1.zip

这个是免安装的，可以放在python安装文件夹的\Lib\site-packages\ 下直接使用

pytesser里包含了tesseract.exe和英语的数据包（默认只识别英文），还有一些示例图片，所以解压缩后即可使用。

可通过以下代码测试：

[python]view plaincopy 
    
 >>> from pytesser import *  
 >>> image = Image.open('fnord.tif')  # Open image object using PIL  
 >>> print image_to_string(image)     # Run tesseract.exe on image  
 fnord  
 >>> print image_file_to_string('fnord.tif')  
 fnord  

[python]view plaincopy 
    
 <pre name="code" class="python">from pytesser import *   
 #im = Image.open('fnord.tif')   
 #im = Image.open('phototest.tif')   
 #im = Image.open('eurotext.tif')  
 im = Image.open('fonts_test.png')  
 text = image_to_string(im)   
 print text</pre>  
 <pre></pre>  
 <pre></pre>  
 <pre></pre>  

注：该模块需要PIL库的支持。

2、解决识别率低的问题

可以增强图片的显示效果，或者将其转换为黑白的，这样可以使其识别率提升不少：

[python]view plaincopy 
    
 enhancer = ImageEnhance.Contrast(image1)  
 image2 = enhancer.enhance(4)  

可以再对image2调用 image_to_string识别

3、识别其他语言

tesseract是一个命令行下运行的程序，参数如下：

tesseract imagename outbase [-l lang] [-psm N] [configfile...]

imagename是输入的image的名字

outbase是输出的文本的名字，默认为outbase.txt

-l lang 是定义要识别的的语言，默认为英文

详见 http://tesseract-ocr.googlecode.com/svn-history/r725/trunk/doc/tesseract.1.html

通过以下步骤可以识别其他语言：

（1）、下载其他语言数据包：

https://code.google.com/p/tesseract-ocr/downloads/list

将语言包放入pytesser的tessdata文件夹下

接下来修改pytesser.py的参数，下面是一个例子：

[python]view plaincopy 
    
 """OCR in Python using the Tesseract engine from Google 
 http://code.google.com/p/pytesser/ 
 by Michael J.T. O'Kelly 
 

最低0.47元/天解锁文章

Dream_weaver928

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
利用pytesser模块实现图片文字识别

Pytesser——OCR in Python using the Tesseract engine from Googlepytesser是谷歌OCR开源项目的一个模块，在python中导入这个模块即可将图片中的文字转换成文本。链接：https://code.google.com/p/pytesser/pytesser 调用了 tesseract。在python中调用py
复制链接

扫一扫