【OCR识别验证码】--基于tesseract

Fx_x

已于 2022-11-03 17:20:53 修改

阅读量599

点赞数 1

分类专栏： Python 文章标签： python 图像处理

于 2022-10-30 18:25:57 首次发布

本文链接：https://blog.csdn.net/Fx_2003/article/details/127602505

版权

Python 专栏收录该内容

9 篇文章 3 订阅

订阅专栏

1、环境准备（windows)

打开cmd(命令符窗口)输入以下命令： pip install pytesseract
安装Tesseract-OCR：下载地址为https://sourceforge.net/projects/tesseract-ocr-alt/files/，可以下载exe程序安装。

等待几秒自动下载（进不去多试几次）

安装（next就行了，不过建议安装在C盘之外的盘）。。有点久

验证：cmd输入 tesseract -v

有下列信息即为成功：

2、实现目的:

识别一张图片上的英文字母

3、代码实现

# -*- coding: utf-8 -*-
"""
@File  : OCR.py
@author: FxDr
@Time  : 2022/10/30 18:12
"""
from PIL import Image
import pytesseract

th = Image.open("img_out5.png")
print(pytesseract.image_to_string(th))

输出如下：

5、二值化处理去掉一些杂质

如下图：

二值化处理。上图文本的部分颜色比较深，

通过把大于某个临界灰度值的像素灰度设为灰度极大值，

把小于这个值的像素灰度设为灰度极小值，从而实现二值化

通过代码：

# -*- coding: utf-8 -*-
"""
@File  : OCR处理验证码.py
@author: FxDr
@Time  : 2022/10/30 17:17
"""

from PIL import Image
# 灰度处理
im = Image.open("img.png")
g = im.convert('L')
# g.show()

# 二值化处理
threshold = 150
table = []
for i in range(256):
    if i < threshold:
        table.append(0)
    else:
        table.append(1)
out = g.point(table, '1')
out.show()
out.save("img_out.png")

import pytesseract

th = Image.open("img_out.png")
print(pytesseract.image_to_string(th))

会打开一张图片：img_out.png