Nougat：科学文档的OCR 使用记录

TYUT_xiaoming

已于 2024-01-04 10:38:21 修改

阅读量1.1k

点赞数 9

文章标签： ocr nougat

于 2024-01-04 10:36:09 首次发布

本文链接：https://blog.csdn.net/TYUT_xiaoming/article/details/135380159

版权

本文介绍了如何在Python环境下安装和配置NougatOCR模型，特别关注了使用GPU（如NVIDIACUDA11.8）的需求，以及在3090GPU上处理16页PDF的性能，提到了识别率受文字清晰度影响的问题。

摘要由CSDN通过智能技术生成

https://github.com/facebookresearch/nougat

python环境需要在3.8以上

安装：pip install nougat-ocr

模型默认下载地址：/home/****/.cache/torch/hub/nougat-0.1.0-small

环境安装好之后默认使用cpu

UserWarning: CUDA initialization: The NVIDIA driver on your system is too old (found version 11080). Please update your GPU driver by downloading and installing a new version from the URL: http://www.nvidia.com/Download/index.aspx Alternatively, go to: https://pytorch.org to install a PyTorch version that has been compiled with your version of the CUDA driver. (Triggered internally at ../c10/cuda/CUDAFunctions.cpp:108.)
return torch._C._cuda_getDeviceCount() > 0
WARNING:root:No GPU found. Conversion on CPU is very slow.

如果需要使用GPU，则需要重新安装和自己cuda版本对应的torch等，我这边是cuda11.8