PyTesser
PyTesser is an Optical Character Recognition module for Python. It takes as input an image or image file and outputs a string.
PyTesser uses the Tesseract OCR engine, converting images to an accepted format and calling the Tesseract executable as an external script. A Windows executable is provided along with the Python scripts. The scripts should work in other operating systems as well.
这是官网的介绍,用法很简单,下载,解压,比如E:\QQDownload\pytesser_v0.0.1
打开命令行,cd到当前目录,运行python,
>>> from pytesser import *
>>> image = Image.open('fnord.tif') # Open image object using PIL
>>> print image_to_string(image) # Run tesseract.exe on image
fnord
>>> print image_file_to_string('fnord.tif')
fnord
先试了下自带的png图片,确实识别出来了,然后又去12306上弄下来验证码图片,直接哑火了,哎,用起来确实很简单,可是这渣一样的识别率。。。。。