图像识别

最新推荐文章于 2024-01-09 09:39:27 发布

科大小笨

最新推荐文章于 2024-01-09 09:39:27 发布

阅读量227

点赞数 1

分类专栏： python基础知识 python深度学习

python基础知识同时被 2 个专栏收录

21 篇文章 0 订阅

订阅专栏

python深度学习

12 篇文章 1 订阅

订阅专栏

Python图片识别汉字字母数字，tesseract-ocr

2018年03月09日 Python LEO 2607

环境：ubuntu + python2.7

代码：

#/usr/bin/env python

# -*- coding: UTF-8 -*-

from PIL import Image

import pytesseract

text=pytesseract.image_to_string(Image.open('/root/Desktop/444.jpg'),lang='chi_sim')

print(text)

效果：

步骤：

1：这里我们需要用到两个库：pytesseract和PIL

2：同时我们还需要安装识别引擎tesseract-ocr

3：下载中文简体字库chi_sim.traineddata

安装pytesseract和PIL

pip install PIL

pip install pytesseract

安装识别引擎tesseract-ocr

安装Tesseract

sudo apt-get install tesseract-ocr

安装中文

sudo apt-get install tesseract-ocr-chi-sim

下载中文简体字库

地址：https://download.csdn.net/download/leoeitail/10275552

存放路劲：/usr/local/share/tessdata/

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。