A Brief Summary of Yann's "Gradient-Based Learning Applied to Document Recognition"

Paper Info:Gradient-Based Learning Applied to Document Recognition 

YANN LECUN, MEMBER, IEEE, L´EONBOTTOU, YOSHUA BENGIO, AND PATRICK HAFFNER


I.   Introduction


II. CNN for isolatedcharacter recognition

Features of Tradition Pattern Recognition:

1.     hand-designedfeature extractor

2.     trainable classifier

Problem: Images too large;topology of input (space or temporal correlations) ignored

Solution:

Using Convolutional Networks 

Features: 1)local receptive fields 2)shared weight 3)spatial or temporalsubsampling(Once a feature has been detected, location less important)->LeNet-5


III. Results andcomparison with other methods


IV. Multimodule systems and graph transformer networks(GTN)



V. Multiple object recognition: HOS (The first method for character string recognition)

Isolated characters TO strings of characters


optimizing a global criterion

A now classical method for segmentation andrecognition—HOS


Good candidate locations for cuts can be found by locating minima in the vertical projection profile, or minima of the distance between the upper and lower contours of the word.

Structure of the Process




Question: What's the meaning of  Interpretation graph?

Definitions in the paper: 

The goal of the recognitiontransformer is to generate a graph, called the interpretation graph orrecognition graph that contains all the possible interpretations for all thepossible segmentations of the input.

The interpretation graph hasalmost the same structure as the segmentation graph, except that each arc isreplaced by a set of arcs from and to the same node.

 

VI. Global training for graph transformer networks

?global training? The whole process?

1.Viterbi training 2.discriminative Viterbitraining 3.Forward training 4.discriminative forward training 5.remarks


VII. Multiple object recognition: Space displacement neural network


No segmentation needed

Problems: Expensive; neighbors; notsize-normalized

Solution:Convolutional Networks- A replicatedconvolutional

network, also called an SDNN

A.      Interpreting theOutput of an SDNN with a GTN

B.      Experiments withSDNN

C.     Global Training ofSDNN

D.     Object Detection and Spotting with SDNN


VIII. Graph transformernetworks and transducers


    

IX. & X. Applications

(Online Handwriting recognition system and check reading system)


  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值