Python 图片文字识别和 tesseract 问题解决

有请小发菜

已于 2023-11-05 19:28:53 修改

阅读量678

点赞数 3

分类专栏： Python 文章标签： python 开发语言

于 2023-11-05 19:20:51 首次发布

本文链接：https://blog.csdn.net/weixin_50357986/article/details/134233359

版权

Python 专栏收录该内容

32 篇文章 4 订阅

订阅专栏

1.图片文字识别测试代码

安装需要的工具包

pip install Pillow
pip install pytesseract

测试代码

import pytesseract
from PIL import Image
# 定义图片地址变量
image_path = '/Users/guanfawang/Downloads/Untitled-31.png'
# 打开图片文件
image_open = Image.open(image_path)
# 使用 PyTesseract 进行 OCR 文字识别
image_text = pytesseract.image_to_string(Image.open(image_path), lang='chi_sim')
# 打印结果
print(image_text)

2.存在问题和解决

执行步骤1代码发现报以下错误：

pytesseract.pytesseract.TesseractNotFoundError: tesseract is not installed or it’s not in your PATH. See README file for more information.

原因是tesseract未安装，使用pip install tesseract 会存在问题，可以使用以下命令安装和查看：

# 安装 tesseract
brew install tesseract
# 查看是否安装
tesseract --version

如果没有brew说明没有安装Homebrew，可以查看 Homebrew 安装卸载和配置使用文章。

找到并复制tesseract的路径位置，将pytesseract.py文件的tesseract_cmd变量更改为对应路径；

在这里插入图片描述

也可以不调整pytesseract.py文件内容，在步骤1的代码增加一个全局变量；

pytesseract.pytesseract.tesseract_cmd = r'/opt/homebrew/Cellar/tesseract/5.3.3/bin/tesseract'

再次执行步骤1发现报以下错误：

pytesseract.pytesseract.TesseractError: (1, ‘Error opening data file /opt/homebrew/share/tessdata/chi_sim.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your “tessdata” directory. Failed loading language ‘chi_sim’ Tesseract couldn’t load any languages! Could not initialize tesseract.’)

该错误提示表明在运行 pytesseract 时遇到了问题，具体是由于无法找到中文语言包 chi_sim.traineddata 或无法初始化 tesseract 导致的。

Tesseract OCR 官方的语言包下载页面：https://github.com/tesseract-ocr/tessdata

搜索并下载chi_sim.traineddata，需要将 chi_sim.traineddata 文件移动到 Tesseract 的 “tessdata” 目录中，重新执行就可以了。

常见的语言参数选择有 chi_sim（简体中文）、eng（英文）、osd、snum。

print(pytesseract.get_languages(config=''))  # 输出 ['chi_sim', 'eng', 'osd', 'snum']

3.完整代码

import pytesseract
from PIL import Image
# 调整tesseract路径
pytesseract.pytesseract.tesseract_cmd = r'/opt/homebrew/Cellar/tesseract/5.3.3/bin/tesseract'
# 定义图片地址变量
image_path = '/Users/guanfawang/Downloads/Untitled-48.png'
# 打开图片文件
image_open = Image.open(image_path)
# 使用 PyTesseract 进行 OCR 文字识别
image_text = pytesseract.image_to_string(Image.open(image_path), lang='chi_sim')
# 打印结果
print(image_text)
# print(text.replace(' ',''))