【文本】ICdar2015 文本数据集

hjxu2016

已于 2022-09-23 11:39:25 修改

阅读量2.2k

点赞数 1

分类专栏：计算机视觉|文本检测识别文章标签： python opencv 开发语言

于 2022-09-21 19:16:30 首次发布

本文链接：https://blog.csdn.net/hjxu2016/article/details/126973984

版权

计算机视觉|文本检测识别专栏收录该内容

1 篇文章

订阅专栏

文章目录

一、官网介绍
二、任务介绍
- - 2.1 文本定位
  - 2.2 单词识别

一、官网介绍

https://rrc.cvc.uab.es/?ch=4&com=tasks
官网有四个任务
下载地址链接：https://pan.baidu.com/s/160FzXzWdo8IPyp88XIIsIg 提取码：qdr2 --来自百度网盘超级会员V2的分享

来自 https://blog.csdn.net/weixin_45779880/article/details/105642393 的分享。

二、任务介绍

2.1 文本定位

每一个图像对应一个 text文件，并且每行按照 x1, y1, x2, y2, x3, y3, x4, y4, transcription 标注
且每个点是按照顺时针进行标注。“###” 指无需关注文本内容信息

训练集 1000张
测试集 500张

提供一个python显示的脚本

#!/usr/bin/env python3
# -*- coding: utf-8 -*-
"""                  
*  * *** *  * *  *      
*  *  *   **  *  *             
****  *   **  *  *                 
*  *  *   **  *  *         
*  * **  *  * ****                

@File    : read_idcar.py
@Date    : 2022/9/21/021
@Require :
@Author  : hjxu
@Funtion : 读取IDCAR2015  文本信息

"""

import cv2
import numpy as np

if __name__ == "__main__":
    img_path = "./train_images/img_2.jpg"
    txt_file_path = "./train_gts/img_2.jpg.txt"

    img = cv2.imread(img_path)

    txt_file = open(txt_file_path, "r", encoding='utf_8_sig')
    for line in txt_file.readlines():
        x1, y1, x2, y2, x3, y3, x4, y4, text = line.strip().split(",")
        # print(int(x1), int(y1), int(x2), int(y2), int(x3), int(y3), int(x4), int(y4), text)
        contours = [np.array([
            [int(x1), int(y1)], [int(x2), int(y2)], [int(x3), int(y3)],[int(x4), int(y4)]])]
        cv2.drawContours(img, contours, -1, (0, 0, 255), 1)
        cv2.putText(img, text,  [int(x1), int(y1)], 1, 1,  (0, 0, 255))
     # #
    cv2.imshow("name", img)
    cv2.waitKey(0)