初学之撸一个tensorflow框架实现mnist手写数字识别

最新推荐文章于 2024-10-21 08:46:21 发布

Betty_Long

最新推荐文章于 2024-10-21 08:46:21 发布

阅读量88

点赞数

文章标签： tensorflow 神经网络深度学习

本文链接：https://blog.csdn.net/weixin_45666159/article/details/120577299

版权

这篇博客介绍了如何使用TensorFlow进行MNIST手写数字识别的步骤，包括定义神经网络、设置损失函数、选择梯度下降优化器、进行训练以及在测试集上的准确性评估。博主通过代码展示了softmax回归算法的实现过程，并最终得到了模型的预测准确度。

摘要由CSDN通过智能技术生成

# -*- coding: utf-8 -*-
"""
Created on Fri Oct  1 11:37:38 2021

@author: 19146

"""
"""
步骤：
1.定义算法公式，就是神经网络forward时的计算
2.定义loss,选定优化器，并指定优化器优化loss
3.迭代地对数据进行训练
4.在测试集或验证集上对准确率进行评测
"""

from tensorflow.examples.tutorials.mnist import input_data
mnist = input_data.read_data_sets("MNIST_data/",one_hot=True)    #采用one_hot编码

print(mnist.train.images.shape,mnist.train.labels.shape)
print(mnist.test.images.shape,mnist.test.labels.shape)
print(mnist.validation.images.shape,mnist.validation.labels.shape)

import tensorflow as tf
#创建一个新的InteractiveSession，这个就是默认的session
sess = tf.InteractiveSession()

#placeholder即输入数据的地方   [None,784]表示不限条数，每条输入是一个784维的向量
x = tf.placeholder(tf.float32,[None,784])
#把参数weights和biases初始化为0
W = tf.Variable(tf.zeros([784,10]))
b = tf.Variable(tf.zeros([10]))

#实现softmax Regression算法
y = tf.nn.softmax(tf.matmul(x,W)+b)
#输入的y即真实的y值
y_ = tf.placeholder(tf.float32,[None,10])
#定义损失函数cross_entropy
#y为预测的y值，y_为真实的y值  
cross_entropy = tf.reduce_mean(-tf.reduce_sum(y_*tf.log(y),reduction_indices=[1]))
#优化算法采用梯度下降算法
#tensorflow根据我们定义的整个计算图自动求导，并根据反向传播算法进行训练
#设置学习速率为0.5，优化目标设定为cross_entropy,得到进行训练的操作train_step
train_step = tf.train.GradientDescentOptimizer(0.5).minimize(cross_entropy)

#全局参数初始化
tf.global_variables_initializer().run()

#开始迭代地执行训练操作，每次随机从训练集中抽取100条样本构成一个mini_batch，并feed给placeholder，再用train_step进行训练
#使用一小部分样本进行训练称为随机梯度下降，这种做法绝大多数情况下比全样本训练的收敛速度快很多
for i in range(1000):
    batch_xs, batch_ys = mnist.train.next_batch(100)
    train_step.run({x:batch_xs, y_:batch_ys})

#argmax求最大值，tf.argmax(y,1)求各个预测数字中概率最大的一个，tf.argmax(y_,1)是用来找样本的真实数字类别
#tf.equal方法用来判断预测的数字类别是否是正确的类别
correct_prediction = tf.equal(tf.argmax(y,1),tf.argmax(y_,1))
#用tf.cast将correct_prediction的bool值转换成float32，再求平均
accuracy = tf.reduce_mean(tf.cast(correct_prediction,tf.float32))
print("模型预测的准确度为:")
#在test集上做预测
print(accuracy.eval({x:mnist.test.images, y_:mnist.test.labels}))