Tensorflow（五）使用CNN对MNIST数据集进行分类

最新推荐文章于 2024-09-15 17:06:34 发布

Bazingaea

最新推荐文章于 2024-09-15 17:06:34 发布

阅读量1.8k

点赞数

分类专栏： tensorflow 文章标签： CNN tensorflow tensorboard MNIST

本文链接：https://blog.csdn.net/Bazingaea/article/details/84137092

版权

本文使用TensorFlow构建了一个包含两个卷积层和一个全连接层的CNN模型，针对MNIST数据集进行分类。通过TensorBoard观察训练过程，探讨了过拟合问题，并应用dropout技术进行缓解。实验表明，dropout在全连接层的使用能改善过拟合，但可能降低训练精度。在卷积层应用dropout对模型性能影响较大。

摘要由CSDN通过智能技术生成

在tensorflow（二）中对MNIST数据集进行分类使用单层神经网络，梯度下降法以0.2的学习因子迭代了100次取得了92%的准确率，这个网络很简单，使用较大的学习因子也不会出现梯度爆炸或者梯度消失的情况，但是在复杂些的网络，比如这里用到的三层CNN网络使用0.2的学习因子就过大了。

本文结合了tensorfow（三）中的卷积神经网络模型以及tensorflow（四）中的tensorboard查看方法，神经网共有三层，两个卷积层，一个全连接层，一般情况下对特征图进行卷积操作后也会进行池化操作，所以讲池化层也包含在卷积层当中，当然代码实现是分开的，只是计算神经网络的层次时将他们划分在一起，并且统称为一个卷积层。

具体的内容在前面两节中都有总结，这里就直接贴代码了，需要说明的地方会注释：

#导包
import numpy as np
import h5py
import tensorflow as tf

#MNIST数据
#需要注意的一点是，数据格式与单层神经网络不同，CNN不需要把数据整合为（m*n）的格式
#也就是CNN不需要将所有特征值都合并在一起
from tensorflow.examples.tutorials.mnist import input_data
mnist = input_data.read_data_sets('MNIST_data',one_hot = True)
train_x = mnist.train.images
train_y = mnist.train.labels

test_x = mnist.test.images
test_y = mnist.test.labels #(55000, 10)

train_x = train_x.reshape([-1,28,28,1]) #(55000, 28, 28, 1)
test_x = test_x.reshape([-1,28,28,1]) # (10000, 28, 28, 1)


#定义一个 变量所有summary的整合
def variable_summaries(var):
    with tf.name_scope('summaries'):
        mean = tf.reduce_mean(var)
        tf.summary.scalar('mean', mean)
        with tf.name_scope('stddev'):
            stddev = tf.sqrt(tf.reduce_mean(tf.square(var - mean)))
        tf.summary.scalar('stddev', stddev)
        tf.summary.scalar('max', tf.reduce_max(var))
        tf.summary.scalar('min', tf.reduce_min(var))
        tf.summary.histogram('histogram', var)


#新建占位符
def create_placeholders(n_H0,n_W0,n_C0,n_y):
    with tf.name_scope('input'):
        X = tf.placeholder(shape=[None,n_H0,n_W0,n_C0],dtype = tf.float32,name='x_input')
        Y = tf.placeholder(shape=[None,n_y],dtype = tf.float32,name='y_input')
    return X,Y


#向前传播
def forward_propagation(X):
    tf.set_random_seed(1)
    
    #第一个卷积层 conv-relu-pooling
    with tf.name_scope(