mnist运行遇到的几个问题

最新推荐文章于 2023-03-04 11:06:39 发布

DJames23

最新推荐文章于 2023-03-04 11:06:39 发布

阅读量656

点赞数

分类专栏：深度学习文章标签： tensorflow 神经网络深度学习

本文链接：https://blog.csdn.net/DJames23/article/details/103654221

版权

深度学习专栏收录该内容

10 篇文章 1 订阅

订阅专栏

参考博客：
https://blog.csdn.net/qq_38269418/article/details/78991649

这篇博客写的很详细，包括模型训练、测试与ps制作手写字体。这里主要写几个在运行mnist时遇到的问题点。

1.优化器（Optimizer）的选择

可以参考https://blog.csdn.net/weixin_41417982/article/details/81561210
在mnist_deep.py中可以使用三种optimizer:

train_step = tf.train.GradientDescentOptimizer(1e-3).minimize(cross_entropy)
#train_step = tf.train.AdagradOptimizer(1e-4).minimize(cross_entropy)
#train_step = tf.train.AdamOptimizer(1e-4).minimize(cross_entropy)

若训练时使用GradientDescent，那么在test.py（测试自己制作的手写数字）中也应使用GradientDescent或者直接注掉

#train_step = tf.train.GradientDescentOptimizer(1e-3).minimize(cross_entropy)

若测试时使用了AdamOptimizer将会报以下错误:

NotFoundError (see above for traceback): Key Variable/Adam not found in checkpoint

若训练使用Adam测试时使用GradientDescent或者注掉都不影响。
使用GradientDescent方法生成的模型较小为12M左右，Adam模型大小为39M左右。

2.cpu 与 gpu切换：

就是sess的写法区别

cpu版本写法为;

with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    for i in range(20000):
        batch = mnist.train.next_batch(50)
        if i % 100 == 0:
            train_accuracy = accuracy.eval(feed_dict={
                x: batch[0], y_: batch[1], keep_prob: 1.0})
            print('step %d, training accuracy %g' % (i, train_accuracy))
        train_step.run(feed_dict={x: batch[0], y_: batch[1], keep_prob: 0.5})
    saver.save(sess, '/home/dengjie/dengjie/project/Mnist/mnist/deepmodel/model.ckpt') #模型储存位置

gpu版本为：

sess = tf.InteractiveSession()
sess.run(tf.global_variables_initializer())
for i in range(20000):
    batch = mnist.train.next_batch(50)
    if i % 100 == 0:
        train_accuracy = accuracy.eval(feed_dict={
            x: batch[0], y_: batch[1], keep_prob: 1.0})
        print('step %d, training accuracy %g' % (i, train_accuracy))
    train_step.run(feed_dict={x: batch[0], y_: batch[1], keep_prob: 0.5})
saver.save(sess, '/home/dengjie/dengjie/project/Mnist/mnist/deepmodel/model.ckpt') #模型储存位置

gpu版本tensorflow使用cpu代码时会报以下错误;

could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR

3.打印手写体矩阵形式

tva = [(255-x)*1.0/255.0 for x in tv] 
print(tva)

在这里插入图片描述
识别结果：

4.报错（自己记录）

File "Test.py", line 103, in <module>
    saver.restore(sess, "/home/dengjie/dengjie/project/Mnist/mnist/model/model.ckpt")
...
InvalidArgumentError (see above for traceback): Assign requires shapes of both tensors to match. lhs shape= [5,5,32,64] rhs shape= [5,5,1,32]

需要注掉mnist_deep.py中的以下代码

#W = tf.Variable(tf.zeros([784,10]))
#b = tf.Variable(tf.zeros([10]))

#y = tf.nn.softmax(tf.matmul(x,W) + b)

其他参考：
三维可视化理解卷积：
https://cs.ryerson.ca/~aharley/vis/conv/
项目地址及源码：
https://cs.ryerson.ca/~aharley/vis/
使用opencv进行手写数字图片预处理参考博客：
https://blog.csdn.net/sparta_117/article/details/66965760
tensorflow中文社区：
http://www.tensorfly.cn/tfdoc/tutorials/mnist_pros.html
tensorflow函数：
https://www.w3cschool.cn/tensorflow_python/tensorflow_python-hdfb2xtv.html

DJames23

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录