基于Pytorch的身份证及其他证件检测矫正模型应用

最新推荐文章于 2025-03-06 10:18:57 发布

番茄小能手

最新推荐文章于 2025-03-06 10:18:57 发布

阅读量2.4k

点赞数 31

分类专栏： Pytorch 文章标签： pytorch 人工智能 python

本文链接：https://blog.csdn.net/YY007H/article/details/135614249

版权

Pytorch 专栏收录该内容

2 篇文章

订阅专栏

本文介绍了一种利用Python和PyTorch库进行身份证等证件图片自动摆正的算法，CardDetectionCorrection，以提高OCR文字识别的准确性。该方法适用于多证混贴和任意角度检测，准确率高达99%。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

前言

在做身份证和其他证件识别的时候，图片基本都不是摆正的状态，此时在进行OCR文字识别的提取文字信息的时候会出现很多误差，如何将证件摆正，再进行OCR文字识别就可以大大提高准确率。

准备工作

1、Python环境，在Python官网下载安装

2、项目代码，下载地址在文章最后

开始

以上准备工作完成后，就可以开始使用

1、下载依赖包

pip install pyaml
pip install torch
pip install opencv-python

2、编写预测代码，cpu中运行

import cv2

from core.infer import CardDetectionCorrection

card_detection_correction = CardDetectionCorrection(
    model_path="./models/card_correction/model.pt",
    config_path="./models/card_correction/config.json",
    device="cpu"
)
img = cv2.imread("images/image3.jpg")
results = card_detection_correction(img)


for i, result in enumerate(results):
    output_img = result["output_img"]
    cv2.imwrite('output/image_' + str(i) + '.jpg', output_img)

3、gpu中运行

默认使用cpu运行，如果需要在gpu中运行，首先要配置GPU环境，可通过这篇文章进行配置【Ubuntu系统配置深度学习环境之nvidia显卡驱动和cuda安装】。

安装完成后，初始化方法改成：

card_detection_correction = CardDetectionCorrection(
    model_path="./models/card_correction/model.pt",
    config_path="./models/card_correction/config.json",
    device="gpu"
)