自然图像里的文本检测数据库网址收集

最新推荐文章于 2024-05-09 15:26:00 发布

PParis

最新推荐文章于 2024-05-09 15:26:00 发布

阅读量1w

点赞数

分类专栏：自然图像中的文本识别文章标签： dataset text localization natural image

本文链接：https://blog.csdn.net/chenxi_wind/article/details/42084753

版权

http://www.ee.surrey.ac.uk/CVSSP/demos/chars74k/

Char74k dataset

In this dataset, symbols used in both English and Kannada are available.

In the English language, Latin script (excluding accents) and Hindu-Arabic numerals are used. For simplicity we call this the "English" characters set. Our dataset consists of:

64 classes (0-9, A-Z, a-z)
7705 characters obtained from natural images
3410 hand drawn characters using a tablet PC
62992 synthesised characters from computer fonts

This gives a total of over 74K images (which explains the name of the dataset).

http://openresearch.baidu.com/activitybulletin/618.jhtml

一段文字识别代码

http://prir.ustb.edu.cn/TexStar/MOMV-text-detection/

这个网址介绍 Multi-Orientation Scene Text Detection and USTB-SV1K Dataset 并且提供了多方向多视角自然图像文本数据库 USTB-SV1K

Text detection in natural scene images is an important prerequisite for many content-based image analysis tasks, while most current research efforts only focus on horizontal or near horizontal scene text. In our paper, first we present a unified distance metric learning framework for adaptive hierarchical clustering, which can simultaneously learn similarity weights (to adaptively combine different feature similarities) and the clustering threshold (to automatically determine the number of clusters). Then, we propose an effective multi-orientation scene text detection system, which constructs text candidates by grouping characters based on this adaptive clustering. Our text candidates construction method consists of several sequential coarse-to-fine grouping steps: morphology-based grouping via single-link clustering, orientation-based grouping via divisive hierarchical clustering, and projection-based grouping also via divisive clustering. The effectiveness of our proposed system is evaluated on several public scene text databases, e.g., ICDAR Robust Reading Competition datasets (2011 and 2013), and MSRA-TD500. Specifically, on the multi-orientation text dataset MSRA-TD500, the f measure of our system is 70%, much better than 60% of one recent state-of-the-art performance。

We also construct and release a practical challenging multi-orientation scene text dataset (USTB-SV1K), which is available at http://prir.ustb.edu.cn/TexStar/MOMV-text-detection/.

Dataset description .

We annotate an image in which a list of words to label with bounding boxes by the coordinates of the left-top point, width, height and inclination angle along with the ground truth word, which is similar to MSRA-TD500. We collect 1000 (500 for training and 500 for testing) street view (patch) images from 6 USA cities, i.e., New York, Boston, Los Angle, Washington DC, San Francisco, and Seattle. The set from each city includes about 160 ~ 180 images, about half of which are for training, and the rest for testing. There are three main challenges for detection and recognition on this dataset (se

最低0.47元/天解锁文章

PParis

关注

0
点赞
踩
7

收藏

觉得还不错? 一键收藏
14
评论
自然图像里的文本检测数据库网址收集

http://openresearch.baidu.com/activitybulletin/618.jhtml一段文字识别代码http://prir.ustb.edu.cn/TexStar/MOMV-text-detection/这个网址介绍 Multi-Orientation Scene Text Detection and USTB-SV1K Dataset 并且提供了多
复制链接

扫一扫