python检测边界_使用python-tesseract获取已识别单词的边界框

最新推荐文章于 2022-10-01 18:29:26 发布

weixin_39897449

最新推荐文章于 2022-10-01 18:29:26 发布

阅读量209

点赞数

文章标签： python检测边界

I am using python-tesseract to extract words from an image. This is a python wrapper for tesseract which is an OCR code.

I am using the following code for getting the words:

import tesseract

api = tesseract.TessBaseAPI()

api.Init(".","eng",tesseract.OEM_DEFAULT)

api.SetVariable("tessedit_char_whitelist", "0123456789abcdefghijklmnopqrstuvwxyz")

api.SetPageSegMode(tesseract.PSM_AUTO)

mImgFile = "test.jpg"

mBuffer=open(mImgFile,"rb").read()

result = tesseract.ProcessPagesBuffer(mBuffer,len(mBuffer),api)

print "result(ProcessPagesBuffer)=",result

This returns only the words and not their location/size/orientation (or in other words a bounding box containing them) in the image. I was wondering if there is any way to get that as well

解决方案

tesseract.GetBoxText() method returns the exact position of each character in an array.

Besides, there is a command line option tesseract test.jpg result hocr that will generate a result.html file with each recognized word's coordinates in it. But I'm not sure whether it can be called through python script.

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_39897449

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python检测边界_使用python-tesseract获取已识别单词的边界框

I am using python-tesseract to extract words from an image. This is a python wrapper for tesseract which is an OCR code.I am using the following code for getting the words:import tesseractapi = tesser...
复制链接

扫一扫