OCR训练时，将txt文件和图片数据转为lmdb格式

最新推荐文章于 2024-02-04 10:06:21 发布

小龙呮

最新推荐文章于 2024-02-04 10:06:21 发布

阅读量485

点赞数

文章标签： python

本文链接：https://blog.csdn.net/weixin_44472033/article/details/120586065

版权

本文介绍如何使用Python脚本create_lmdb_dataset.py将OCR项目的txt文件和图像数据转换为lmdb数据库格式，以便于后续的训练过程。

摘要由CSDN通过智能技术生成

create_lmdb_dataset.py

""" a modified version of CRNN torch repository https://github.com/bgshih/crnn/blob/master/tool/create_dataset.py """

import fire
import os
import lmdb
import cv2

import numpy as np


def checkImageIsValid(imageBin):
    if imageBin is None:
        return False
    imageBuf = np.frombuffer(imageBin, dtype=np.uint8)
    img = cv2.imdecode(imageBuf, cv2.IMREAD_GRAYSCALE)
    imgH, imgW = img.shape[0], img.shape[1]
    if imgH * imgW == 0:
        return False
    return True


def writeCache(env, cache):
    with env.

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

小龙呮

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
OCR训练时，将txt文件和图片数据转为lmdb格式

create_lmdb_dataset.py""" a modified version of CRNN torch repository https://github.com/bgshih/crnn/blob/master/tool/create_dataset.py """import fireimport osimport lmdbimport cv2import numpy as npdef checkImageIsValid(imageBin): if imageBi
复制链接

扫一扫