python图片识别表格_OCR Table - 从包含表格的扫描图片中识别表格和文字

该项目包含一个DLL和一个EXE,用于识别包含表格的扫描文档,保留表格结构并以Microsoft Word文档形式保存结果。使用Visual C++开发的DLL实现核心的表格结构识别和文本识别功能,C#开发的EXE提供用户界面。支持英文和简体中文字符识别,依赖于Tesseract开源项目。提供开发环境和修订历史信息。
摘要由CSDN通过智能技术生成

OCR Table

Introduction

For scanning copies containing tables or forms, many OCR softwares recognize text in entire page as whole by discarding all tables. Sometimes it is inconvenient for users. This project retains table structures as well and save the recognizing result as a Microsoft Word document.

This project consists of a DLL and an EXE, both of which are 64-bit. The subdirectory corresponding to DLL is tableocr, developed by Visual C++. It implements core functions, including table structure recognition and text recognition. the subdirectory corresponding to EXE is ocrtable, developed by C#, which provides user interface. Below the pictures directory are sample scanning copies.

Suggestions are welcome. In addition to submitting an issue, you can email me as well. My email address is 31416@msn.cn.

Recognition Performance

English Character Recognition Example

Please do not

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值