利用MultiDigitNumberRecognition.java来验证新版本

5 篇文章 0 订阅
1 篇文章 0 订阅

20191129

beta3运行自带的dl4j-example/org.deeplearning4j.example/catcharecognition/MultiDigitNumberRecognition.java是正常的,但是beta5运行同样的文件就有问题。

 

20191122

上述文件在beta6被放到了dl4j-example/org.deeplearning4j.examples/convolution/catcharecognition/中。

 

20200210

beta6运行上述文件,没有配置虚拟机参数,4个epoch计算完以后,除了报以下错误以外,其余运行正常。程序停止前的混淆矩阵分布不记得了,没空重新测试。

Exception in thread "main" org.nd4j.linalg.exception.ND4JIllegalStateException: Axis array [1] contains values above array rank (rank=1)    at org.nd4j.linalg.api.shape.Shape.normalizeAxis(Shape.java:3394)    at org.nd4j.linalg.api.ops.BaseOp.defineDimensions(BaseOp.java:344)    at org.nd4j.linalg.api.ops.BaseIndexAccumulation.<init>(BaseIndexAccumulation.java:95)    at org.nd4j.linalg.api.ops.impl.indexaccum.IMax.<init>(IMax.java:47)    at org.nd4j.linalg.factory.Nd4j.argMax(Nd4j.java:578)    at org.deeplearning4j.examples.convolution.captcharecognition.MultiDigitNumberRecognition.modelPredict(MultiDigitNumberRecognition.java:177)    at org.deeplearning4j.examples.convolution.captcharecognition.MultiDigitNumberRecognition.main(MultiDigitNumberRecognition.java:103)

 

20200527

beta7运行MultiDigitNumberRecognition.java后和20200210beta6的情况一样。

程序停止前的混淆矩阵似乎有问题。

  0  1  2  3  4  5  6  7  8  9
-------------------------------
  0  0  0  0  0  0  0  0  0 22 | 0 = 0
  0  0  0  0  0  0  0  0  0  0 | 1 = 1
  0  0  0  0  0  0  0  0  0 13 | 2 = 2
  0  0  0  0  0  0  0  0  0  7 | 3 = 3
  0  0  0  0  0  0  0  0  0 16 | 4 = 4
  0  0  0  0  0  0  0  0  0  6 | 5 = 5
  0  0  0  0  0  0  0  0  0 14 | 6 = 6
  0  0  0  0  0  0  0  0  0  0 | 7 = 7
  0  0  0  0  0  0  0  0  0 15 | 8 = 8
  0  0  0  0  0  0  0  0  0 15 | 9 = 9

暂时就讨论到这里。

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 打赏
    打赏
  • 0
    评论
Recognizing arbitrary multi-character text in unconstrained natural photographs is a hard problem. In this paper, we address an equally hard sub-problem in this domain viz. recognizing arbitrary multi-digit numbers from Street View imagery. Traditional approaches to solve this problem typically separate out the localization, segmentation, and recognition steps. In this paper we propose a unified approach that integrates these three steps via the use of a deep convolutional neural network that operates directly on the image pixels. We employ the DistBelief (Dean et al., 2012) implementation of deep neural networks in order to train large, distributed neural networks on high quality images. We find that the performance of this approach increases with the depth of the convolutional network, with the best performance occurring in the deepest architecture we trained, with eleven hidden layers. We evaluate this approach on the publicly available SVHN dataset and achieve over 96% accuracy in recognizing complete street numbers. We show that on a per-digit recognition task, we improve upon the state-of-theart, achieving 97.84% accuracy. We also evaluate this approach on an even more challenging dataset generated from Street View imagery containing several tens of millions of street number annotations and achieve over 90% accuracy. To further explore the applicability of the proposed system to broader text recognition tasks, we apply it to transcribing synthetic distorted text from a popular CAPTCHA service, reCAPTCHA. reCAPTCHA is one of the most secure reverse turing tests that uses distorted text as one of the cues to distinguish humans from bots. With the proposed approach we report a 99.8% accuracy on transcribing the hardest category of reCAPTCHA puzzles. Our evaluations on both tasks, the street number recognition as well as reCAPTCHA puzzle transcription, indicate that at specific operating thresholds, the performance of the proposed system is comparable to, and in some cases exceeds, that of human operators.
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

AlphaFinance

您的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值