cnn卷积神经手写数字识别_基于一维CNN的全卷积手写识别模型

最新推荐文章于 2024-06-08 19:07:59 发布

weixin_26712095

最新推荐文章于 2024-06-08 19:07:59 发布

阅读量895

点赞数

文章标签：神经网络 python 人工智能语音识别图像识别

原文链接：https://towardsdatascience.com/1d-cnn-based-fully-convolutional-model-for-handwriting-recognition-7853976f5784

版权

cnn卷积神经手写数字识别

Handwriting Recognition also termed as HTR(Handwritten Text Recognition) is a machine learning method that aims at giving the machines an ability to read human handwriting from real-world documents(images).

手写识别也称为HTR(手写文本识别)，它是一种机器学习方法，旨在使机器具有从真实文档(图像)中读取人类手写内容的能力。

The traditional Optical Character Recognition systems(OCR systems) are trained to understand the variations and font-styles in the machine-printed text(from documents/images) and they work really well in practice(example-Tesseract). Handwriting Recognition on the other hand is a more challenging task due to a large number of variations among the handwritings of people.

对传统的光学字符识别系统(OCR系统)进行了培训，以了解机器打印的文本(来自文档/图像)中的变化和字体样式，并且它们在实践中非常有效(example-Tesseract)。另一方面，由于人的笔迹之间存在大量差异，因此笔迹识别是一项更具挑战性的任务。

Recent progress in deep learning has led to the development of efficient OCR/HTR solutions. Although these models perform remarkably well in practice, these aren’t easy to train, understand and deploy due to the following limitations:-

深度学习的最新进展已导致开发有效的OCR / HTR解决方案。尽管这些模型在实践中表现非常出色，但是由于以下限制，它们很难训练，理解和部署：

They require a huge amount of labeled training data.
他们需要大量带标签的培训数据。
Due to a large number of training parameters, they are hard to train and slow in inference.
由于训练参数很多，因此难以训练并且推理速度较慢。
As they are slow, they require huge deployment cost(hardware requirements) to make them useful in real-time applications.
由于速度很慢，因此需要巨大的部署成本(硬件要求)才能使其在实时应用程序中有用。
Models are complex in nature and difficult to scale(stacked LSTMs, complex attention layers).
模型本质上是复杂的，难以扩展(堆叠的LSTM，复杂的关注层)。

In this article, we will talk about a novel deep learning architecture (EASTER) that solves the above-listed challenges to some extent. This architecture in a fast, scalable, simple, and also efficient than many complex choices for the task of OCR and HTR.

在本文中，我们将讨论一种新颖的深度学习架构(EASTER)，该架构在一定程度上解决了上述挑战。与OCR和HTR的许多复杂选择相比，该体系结构具有快速，可扩展，简单且高效的特点。

EASTER model utilizes only one-dimensional convolutional layers for the task of HTR and OCR.

EASTER模型仅使用一维卷积层来完成HTR和OCR。

Link to the original paper:

链接到原始文件：

EASTER: Efficient and Scalable Text Recognizer

EASTER：高效且可扩展的文本识别器

Here is a list of items that this article is going to cover regarding the EASTER model-

这是本文将要介绍的有关EASTER模型的项目列表，

EASTER Overview
复活节概述
1D-CNN on images? Really? how?
图片上的1D-CNN？ 真？ 怎么样？
EASTER Model Architecture
EASTER模型架构
OCR/HTR Capability with zero Training Data
具有零培训数据的OCR / HTR功能
Results
结果
Summary
概要

复活节概述(EAS

最低0.47元/天解锁文章

weixin_26712095

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
cnn卷积神经手写数字识别_基于一维CNN的全卷积手写识别模型

cnn卷积神经手写数字识别Handwriting Recognition also termed as HTR(Handwritten Text Recognition) is a machine learning method that aims at giving the machines an ability to read human handwriting from real-world...
复制链接

扫一扫