Python中使用tesserocr库进行图文识别

王球球啊

已于 2024-03-11 14:28:40 修改

阅读量772

点赞数 17

分类专栏： python 自动化测试文章标签： python 开发语言 ocr

于 2024-03-11 14:04:17 首次发布

本文链接：https://blog.csdn.net/qq_36737982/article/details/136616646

版权

python 同时被 2 个专栏收录

4 篇文章 0 订阅

订阅专栏

自动化测试

3 篇文章 0 订阅

订阅专栏

文章目录

一、tesserocr简介
二、下载安装
三、简单使用
四、问题记录
- 1.内网环境安装语言包
- 2.从文件中识别文字报错RuntimeError: Failed to read picture
五、参考文献

一、tesserocr简介

tesserocr是一个简单、Pillow友好、基于tesseract-ocr API封装的用于光学字符识别(OCR，Optical Character Recognition)的Python库。在使用tesserocr之前需要先安装tesseract。

二、下载安装

提示：以下以在Wondiws中安装为例，其他环境中的安装参考【参考文献1】

使用tesserocr需要先安装tesseract，进入下载页面，根据自己的需求选择对应安装包
运行.exe文件进行安装，其中可根据自己需求勾选Additional language data(download)选项或其展开后的选项来安装OCR支持的其他语言包(需要联网，内网环境的话见后续章节【四、问题记录】)
配置环境变量
1>将tesseract的安装目录（tesseract.exe所在目录）添加到系统变量path

2>新建系统变量TESSDATA_PREFIX，将tesseract安装目录下的tessdata目录添加到该变量

3>在命令提示符窗口中输入tesseract --version检查确保已正常安装tesseract并完成环境变量的配置

三、简单使用

提示：以下以在Wondiws中Python3.8环境下使用为例

安装tesserocr、pillow库
```
pip install tesserocr pillow
```
直接pip install tesserocr可能会失败，可以下载对应.whl文件进行安装（注意选择和自己的python版本以及系统匹配的文件，下面.whl文件的路径需替换为自己的实际路径）
```
pip install E:\pypi\tesserocr\tesserocr-2.6.0-cp38-cp38-win_amd64.whl
```

简单使用

# encoding=utf-8
"""
文字识别工具类
"""
import tesserocr

from PIL import Image


class TesserOCRUtil:

    @staticmethod
    def get_languages():
        """
        获取当前环境中支持的语言
        :return: 包含两个元素的元组，分别为tessdata父级目录路径、当前环境中支持的语言集合
        """
        return tesserocr.get_languages()

    @staticmethod
    def image_to_text(img: Image, lang: str = 'chi_sim'):
        """
        从图片中识别文字
        :param img: 图片
        :param lang: 图片中的语言
        :return: 识别出的文字内容
        """
        return tesserocr.image_to_text(img, lang)

    @staticmethod
    def file_to_text(img_path: str, lang: str = 'chi_sim'):
        """
        从文件中识别文字
        :param img_path: 文件路径（包含中文会报错）
        :param lang: 文件中的语言
        :return: 识别出的文字内容
        """
        return tesserocr.file_to_text(img_path, lang)


if __name__ == '__main__':
    # print(TesserOCRUtil.get_languages())

    # image = Image.open('2024-03-08_111808.png')
    # image = Image.open('chinese.png')
    # print(TesserOCRUtil.image_to_text(image, 'chi_sim'))

    print(TesserOCRUtil.file_to_text('chinese.png', 'chi_sim'))