递归神经网络对中文字符的读写——读后感

最新推荐文章于 2022-08-11 13:06:29 发布

yeler082

最新推荐文章于 2022-08-11 13:06:29 发布

阅读量999

点赞数

分类专栏：论文阅读

论文阅读专栏收录该内容

16 篇文章 1 订阅

订阅专栏

论文链接：Drawing and Recognizing Chinese Characters with Recurrent Neural Network

一、文章标题

从标题我们可以看出本文的研究内容是采用递归神经网络实现的中文字符读写的基本操作，我们可以联想到对中文字符的读取是不是识别，那么写又是什么呢？什么又是递归神经网络？

二、看摘要

Previous research has mainly focused onrecognizing handwritten Chinese characters. However, recognition is only oneaspect for understanding a language, another challenging and interesting taskis to teach a machine to automatically write (pictographic) Chinese characters.

这里回答了以上的第两个问题：首先是对文字的识别，其次是教会计算机自动的去写中文字符。

In this paper, we propose a framework by using the recurrent neural network(RNN) as both a discriminative model for recognizing Chinese characters and agenerative model for drawing (generating) Chinese characters.

这个网络既可以用于识别也可以用于训练计算机去写文字，第一感觉这个递归神经网络好牛逼，那他究竟是什么鬼呢？

To recognizeChinese characters, previous methods usually adopt the convolutional neuralnetwork (CNN) models which require transforming the online handwriting trajectory into image-like representations. Instead, our RNN based approach is an end-to-end system which directly deals with the sequential structure and does not require any domain-specific knowledge.

这里通过CNN和RNN的对比说明了，CNN在做文字识别的时候是将手写的字迹转化为图像进一步处理；但是，基于RNN的方法就不需要什么预处理和特定领域的知识了。

三、介绍

For the task of automatic recognition of handwritten Chinese characters, there are two main categories of approaches:online and offline methods. With the success of deep learn-ing [5], [6], the convolutional neural network (CNN) [7] has been widely applied for handwriting recognition. The strong priori knowledge of convolution makes the CNN a powerful tool for image classification. Since the offline characters are naturally represented as scanned images, it is natural and works well to apply CNNs to the task of offline recognition [8],[9], [10], [11]. However, in order to apply CNNs to online characters, the online handwriting trajectory should firstly be transformed to some image-like representations, such as the AMAP [12], the path signature maps [13] or the directional feature maps [14].

对于文字的识别有两种类型：离线的和在线的。对于离线的文字（本身就已经是图像了）识别基本上还是采用了CNN，因为它具有很强大的先验知识和图像分类功能，那么将CNN应用于在线的识别，就需要第一步将其转化为图像了。

we propose to use recurrent neural networks (RNN) combined with bidirectional long short term memory (LSTM) [15], [16] and gated recurrent unit (GRU) [17] for online handwritten Chinese character recognition.

作者提出采用递归神经网络（RNN）、长期短记忆网络（LSTM）、封闭的复发性单元（GRU）进行文字的识别

四、手写中文字符的呈现

实际上，人们在书写文字的同时可以用一个有序列的数据集去记录笔尖所在的坐标位置和笔头当前所移动的方向。

可以这样表示：