OCR项目汇总

最新推荐文章于 2025-04-14 11:54:40 发布

oneTaken

最新推荐文章于 2025-04-14 11:54:40 发布

阅读量4.1k

点赞数 1

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/u011394059/article/details/77076190

版权

基本介绍

1 OCR文字识别用的是什么算法？|知乎
2 深度学习文字识别论文综述|CSDN，综述中涉及到的论文都很旧，
3 文字检测与识别资源|CSDN,涉及的论文都很新，五颗星
4 Awesome Scene Text Recognition,awesome,五颗星
5 OCR, 这个博主的质量都很高，五颗星
6 YunOS场景文字识别|阿里云

paper

reading text in the wild, VGG 组
1 Reading Text in the Wild with Convolutional Neural Networks, VGG组，, IJCV2016
阅读笔记|CSDN
2 Synthetic Data for Text Localisation in Natural Images， VGG组， CVPR2016，
阅读笔记|CSDN，code
3 Deep Features for Text Spotting
, VGG组， ECCV2014
4 Detecting Text in Natural Image with
Connectionist Text Proposal Network,
code, ECCV2016

CVPR2017相关paper

Awesome Typography: Statistics-Based Text Effects Transfer,文字生成，效果很酷炫
EAST: An Efficient and Accurate Scene Text Detector, 快&准的场景文字检测
Detecting Oriented Text in Natural Images by Linking Segments
Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection
Unambiguous Text Localization and Retrieval for Cluttered Scenes
, 文本定位和检索

数据集

gtihub code

1 tesseract, stars 12k, C/C++接口
2 tesseract.js, stars 12k, pure js,支持62种语言的OCR
3 paperless, stars 3.6k, 主打document OCR
4 pyocr, starts 606, A Python wrapper for Tesseract and Cuneiform
5 doc2text, stars 1k, 依赖opencv与tesseract
6 pdftabextract, stars 668,pdf中的表格提取转换到excel中
7 tesserocr,tesseract-ocr API
的python 接口
8 SSD_scene_text_detection, 将SSD用于场景文本检测中

复现点：
1 paper: reading text in the wild with deep convolutional neural network
论文阅读笔记：论文阅读：Reading Text in the Wild with Convolutional Neural Networks,
部分代码为code|matlab

文章的主要思想为先利用region proposal产生出足够多的候选区域，再resize这些候选框到固定大小，用一个CNN来对这些候选框进行单词的分类，超过90k个单词。使用生成的带文本的图片的方法，能够保证文本单词的样本量。
思路很清晰，限制条件也很明显，不能出现样本外的单词，诸如一些合成词；此外，候选框也需要完整地包含单词。

2 paper : EAST: An Efficient and Accurate Scene Text Detector
旷视的最新成果。

博客等级

码龄12年

480
原创

196
点赞

322
收藏

62
粉丝

关注

私信

热门文章

分类专栏

展开全部收起

上一篇：: 188

下一篇：: 190

最新评论

pycharm 专业版激活
排球游戏开发程序猿: 发片网站啊？？？
ffmpeg 提取关键帧
ilunye: 这个是关键帧吗？好像就只是每隔帧率个帧截取一次....
pytorch runtime error: CUDNN_STATUS_MAPPING_ERROR
LH Y: Traceback (most recent call last): File "E:/code_elder_femal/others_code/re and de/Github_code/MDA_GAN-main/TRAIN_CODE/train.py", line 153, in <module> main() File "E:/code_elder_femal/others_code/re and de/Github_code/MDA_GAN-main/TRAIN_CODE/train.py", line 100, in main scaler.scale(g_loss).backward() File "E:\Studing_Enviroment\anaconda3\envs\pytorch\lib\site-packages\torch\_tensor.py", line 307, in backward torch.autograd.backward(self, gradient, retain_graph, create_graph, inputs=inputs) File "E:\Studing_Enviroment\anaconda3\envs\pytorch\lib\site-packages\torch\autograd\__init__.py", line 156, in backward allow_unreachable=True, accumulate_grad=True) # allow_unreachable flag RuntimeError: cuDNN error: CUDNN_STATUS_MAPPING_ERROR Process finished with exit code 1
pydev debugger: warning: trying to add breakpoint to file that does not exist
Serendipity_CQ: 有用，之前的断点没删
pydev debugger: warning: trying to add breakpoint to file that does not exist
weimengchuan: 有可能是因为没有设置路径映射。在 Run -> Edit configurations -> Path mapping 设置

大家在看

最新文章

目录

展开全部

收起

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。