使用TensorFlow训练神经网络------手写体数字识别

最新推荐文章于 2024-06-04 01:34:03 发布

fly_Xiaoma

最新推荐文章于 2024-06-04 01:34:03 发布

阅读量899

点赞数

分类专栏： tensorflow

本文链接：https://blog.csdn.net/weixin_38664232/article/details/87356679

版权

tensorflow 专栏收录该内容

21 篇文章 1 订阅

订阅专栏

目标：使用TensorFlow实现一个简单的手写数字识别网络，并用这个网络来做个简单的识别示例

设计知识：dropout、learningrate decay、初始化等。将网络最终在validation数据上的得分尽可能的提高。

1.导入工具库

import numpy as np
import tensorflow as tf
from tensorflow.examples.tutorials.mnist import input_data
from matplotlib import pyplot as plt

2.数据情况总览

#读入数据包
mnist=input_data.read_data_sets('./')

print('train_images_shape',mnist.train.images.shape)
print('train_labels_shape',mnist.train.labels.shape)

print('validation_images_shape',mnist.validation.images.shape)
print('validation_labels_shape',mnist.validation.labels.shapes)

print('test_iamges_shape',mnist.test.images.shape)
print('test_labels_shape',mnist.test.labels.shape)

3. 数据展示

plt.figure(figsize=(8,8))

for idx in range(16):
    plt.subplot(4,4,idx+1)
    plt.axis('off')#不显示坐标轴
    plt.title('[{}]'.format(mnist.train.labels[idx]))
    plt.imshow(mnist.train.images[idx].reshape((28,28))

可以看到images里面有数量不等的图片，每张图片是28x28长度的一个一维向量，所以用的时候需要先给它还原成28x28的二维图片。labels中则是图片对应的数值的值。

4.定义用于训练的网络

首先定义网络的输入

这里直接使用上面的数据作为输入，所以定义两个placeholder分别用于图像和label数据，另外，定义一个float类型的变量用于设置学习率。

为了让网络更高效的运行，多个数据会被组织成一个batch送入网络，两个placeholder的第一个维度就是batchsize，因为我们这里还没有确定batchsize,所以第一个维度留空。

x=tf.placeholder('float',[None,784])
y=tf.placeholder('int64',[None])
learning_rate=tf.placeholder('float')

def initialize(shape,stddev=0.1):
    return tf.truncated_normal(shape,stddev=0.1)

#1. 隐层中的神经元个数
L1_units_count=100
W_1=tf.Variable(initialize([784,L1_units_count]))
b_1=tf.Variable(initialize([L1_units_count]))
logits_1=tf.matmul(x,W_1)+b_1
#将乘积数据激活函数，激活函数为ReLU
output_1=tf.nn.relu(logits_1)

#2. 神经网络输出节点 ，共10个输出点
L2_units_count=10
W_2=tf.Variable(initialize([L1_units_count,L2_units_count]))
b_2=tf.Variable(initialize([L2_units_count]))
logits_2=tf.matmul(output_1,W_2)

logits=logits_2

#定义loss和用于优化网络的优化器 loss计算使用了#sparse_softmax_cross_entropy_with_logits,这样做的好处
#是labels可以不用手动#做one_hot省了一些麻烦。这里使用sgd优化器，学习率可以根
#据需要设定

#拓展--可以尝试增大学习率，换个优化器再进行训练
cross_entropy_loss=tf.reduce_mean(
    tf.nn.sparse_softmax_cross_entropy_with_logits(logits=logits
                                                   ,labels=y)
)
optimizer=tf.train.GradientDescentOptimizer(
    learning_rate=learning_rate
).minimize(cross_entropy_loss)

#softmax概率分类
pred=tf.nn.softmax(logits)
correct_pred=tf.equal(tf.argmax(pred,1),y)
accuracy=tf.reduce_mean(tf.cast(correct_pred,tf.float32))

#saver用于保存或恢复训练的模型
batch_size=32
training_step=1000

saver=tf.train.Saver()

#创建Session，将数据填入网络
with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())

    #定义验证集合测试集

    validate_data={
        x:mnist.validation.images,
        y:mnist.validation.labels
    }
    test_data={x:mnist.test.images,y:mnist.test.labels}

    for i in range(training_step):
        xs,ys=mnist.train.next_batch(batch_size)
        _,loss=sess.run(
            [optimizer,cross_entropy_loss],
            feed_dict={
                x:xs,
                y:ys,
                learning_rate:0.3
            }
        )

        #每100次训练打印一次损失值与验证准确率
        if i>0 and i%100==0:
            validate_accuracy=sess.run(accuracy,feed_dict=
                                       validate_data)
            print(
                "after %d training steps,the loss is %g,the validation accuracy is %g"
                %(i,loss,validate_accuracy)
            )
            saver.save(sess,'./model.ckpt',global_step=i)
    print('the training is finish!')
    #最终的测试准确率
    acc=sess.run(accuracy,feed_dict=test_data)
    print('the test accuracy is: ',acc)

输出结果：

5.使用训练的模型做一个测试

with tf.Session() as sess:
    ckpt=tf.train.get_checkpoint_state('./')
    if ckpt and ckpt.model_checkpoint_path:
        saver.restore(sess,ckpt.model_checkpoint_path)
        final_pred,acc=sess.run(
            [pred,accuracy],
            feed_dict={
                x:mnist.test.images[:16],
                y:mnist.test.labels[:16]

            }
        )
        orders=np.argsort(final_pred)
        plt.figure(figsize=(8,8))
        print('acc=',acc)
        for idx in range(16):
            order=orders[idx,:][-1]
            prob=final_pred[idx,:][order]
            plt.subplot(4,4,idx+1)
            plt.axis('off')
            plt.title('{}:[{}]-[{:.1f}%]'.format(mnist.test.labels[idx],
                                                 order,prob*100))
            plt.imshow(mnist.test.images[idx].reshape((28,28)))

        plt.show()
    else:
        pass

输出：

观察一下每一个数字的准确率还是挺高的。

fly_Xiaoma

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
使用TensorFlow训练神经网络------手写体数字识别

目录1.导入工具库2.数据情况总览3. 数据展示4.定义用于训练的网络5.使用训练的模型做一个测试目标：使用TensorFlow实现一个简单的手写数字识别网络，并用这个网络来做个简单的识别示例设计知识：dropout、learningrate decay、初始化等。将网络最终在validation数据上的得分尽可能的提高。1.导入工具库import numpy...
复制链接

扫一扫

专栏目录