MNIST(二）：基于CNN的mnist识别

最新推荐文章于 2023-11-01 15:48:41 发布

Itsukaa

最新推荐文章于 2023-11-01 15:48:41 发布

阅读量459

点赞数

本文链接：https://blog.csdn.net/qq_34391370/article/details/78609128

版权

在自学tensorflow的时候，经常会遇到这样的问题：
1.为啥这里用这个函数，还有这个函数在哪里定义的，我怎么查询api文档
2.为什么我看了同样是写mnist的代码，为什么实现的方法会有很大的区别

因为tensorflow现在的更新比较频繁，版本更替很快，所以很正常会看到实现的方法不同，还有一点就是各人的写代码风格不同，但是如果弄清楚实现的那几个步骤，其实也能很好理解。

至于如何查api文档，可以访问http://devdocs.io/ 页面向下拖就可以找到tensorflow,并且api文档是可以下载的，保存在浏览器的缓存中，没有网的时候也可以访问。

下面正题，利用CNN实现mnist识别：

tensorflow里面内置了处理mnist的各种函数，方便我们操作。所以不必我们进行数据的处理

如果我们想显示mnist数据集里的一个图片，怎么操作呢？

print(mnist.train.images.shape)     # (55000, 28 * 28)
print(mnist.train.labels.shape)   # (55000, 10)
plt.imshow(mnist.train.images[0].reshape((28, 28)), cmap='gray')
plt.title('%i' % np.argmax(mnist.train.labels[0])); plt.show()

这里写图片描述

2 . 接下来就是实现的过程：（这里借鉴了莫烦大佬的代码）
这里用了两层的conv加上后面的全连接

import tensorflow as tf
from tensorflow.examples.tutorials.mnist import input_data
import numpy as np
import matplotlib.pyplot as plt

tf.set_random_seed(1)
np.random.seed(1)

BATCH_SIZE = 50
LR = 0.001              # learning rate

mnist = input_data.read_data_sets('./mnist', one_hot=True)  # they has been normalized to range (0,1)
test_x = mnist.test.images[:2000]
test_y = mnist.test.labels[:2000]

# plot one example
print(mnist.train.images.shape)     # (55000, 28 * 28)
print(mnist.train.labels.shape)   # (55000, 10)
plt.imshow(mnist.train.images[0].reshape((28, 28)), cmap='gray')
plt.title('%i' % np.argmax(mnist.train.labels[0])); plt.show()

tf_x = tf.placeholder(tf.float32, [None, 28*28]) / 255.
image = tf.reshape(tf_x, [-1, 28, 28, 1])              # (batch, height, width, channel)
tf_y = tf.placeholder(tf.int32, [None, 10])            # input y

# CNN
conv1 = tf.layers.conv2d(   # shape (28, 28, 1)
    inputs=image,
    filters=16,
    kernel_size=5,
    strides=1,
    padding='same',
    activation=tf.nn.relu
)           # -> (28, 28, 16)
pool1 = tf.layers.max_pooling2d(
    conv1,
    pool_size=2,
    strides=2,
)           # -> (14, 14, 16)
conv2 = tf.layers.conv2d(pool1, 32, 5, 1, 'same', activation=tf.nn.relu)    # -> (14, 14, 32)
pool2 = tf.layers.max_pooling2d(conv2, 2, 2)    # -> (7, 7, 32)
flat = tf.reshape(pool2, [-1, 7*7*32])          # -> (7*7*32, )
output = tf.layers.dense(flat, 10)              # output layer

loss = tf.losses.softmax_cross_entropy(onehot_labels=tf_y, logits=output)           # compute cost
train_op = tf.train.AdamOptimizer(LR).minimize(loss)

accuracy = tf.metrics.accuracy(          # return (acc, update_op), and create 2 local variables
    labels=tf.argmax(tf_y, axis=1), predictions=tf.argmax(output, axis=1),)[1]

sess = tf.Session()
init_op = tf.group(tf.global_variables_initializer(), tf.local_variables_initializer()) # the local var is for accuracy_op
sess.run(init_op)     # initialize var in graph

for step in range(600):
    b_x, b_y = mnist.train.next_batch(BATCH_SIZE)
    _, loss_ = sess.run([train_op, loss], {tf_x: b_x, tf_y: b_y})
    if step % 50 == 0:
        accuracy_, flat_representation = sess.run([accuracy, flat], {tf_x: test_x, tf_y: test_y})
        print('Step:', step, '| train loss: %.4f' % loss_, '| test accuracy: %.2f' % accuracy_)

# print 10 predictions from test data
test_output = sess.run(output, {tf_x: test_x[:10]})
pred_y = np.argmax(test_output, 1)
print(pred_y, 'prediction number')
print(np.argmax(test_y[:10], 1), 'real number')

Itsukaa

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
MNIST(二）：基于CNN的mnist识别

在自学tensorflow的时候，经常会遇到这样的问题： 1.为啥这里用这个函数，还有这个函数在哪里定义的，我怎么查询api文档 2.为什么我看了同样是写mnist的代码，为什么实现的方法会有很大的区别因为tensorflow现在的更新比较频繁，版本更替很快，所以很正常会看到实现的方法不同，还有一点就是各人的写代码风格不同，但是如果弄清楚实现的那几个步骤，其实也能很好理解。至于如何查api文档，
复制链接

扫一扫