基于Tensorflow的双层循环神经网络RNN(LSTM)实现(MNIST数据集)

最新推荐文章于 2021-05-05 12:20:41 发布

kawhi849

最新推荐文章于 2021-05-05 12:20:41 发布

阅读量3.2k

点赞数 2

分类专栏：机器学习

本文链接：https://blog.csdn.net/qq_35014850/article/details/82556506

版权

本文通过Tensorflow构建了双层循环神经网络（LSTM），并应用于MNIST手写数字数据集的分类任务。训练结果显示了模型的性能。

摘要由CSDN通过智能技术生成

本文使用双层LSTM网络，实现对MNIST数据集的分类。

理解LSTM网络的传送门：https://blog.csdn.net/jerr__y/article/details/58598296

参考：https://blog.csdn.net/jerr__y/article/details/61195257

# -*- coding:utf-8 -*-
import tensorflow as tf
from tensorflow.examples.tutorials.mnist import input_data

# 设置 GPU 按需增长
config = tf.ConfigProto()
config.gpu_options.allow_growth = True
sess = tf.Session(config=config)

# 首先导入数据，看一下数据的形式
mnist = input_data.read_data_sets('MNIST_data', one_hot=True)
print (mnist.train.images.shape)

##########################################################################设置模型超参数

lr = 1e-3
# 在训练和测试的时候，我们想用不同的 batch_size.所以采用占位符的方式
keep_prob = tf.placeholder(tf.float32, [])
batch_size = tf.placeholder(tf.int32, [])

# 每个时刻的输入特征是28维的，就是每个时刻输入一行，一行有 28 个像素
input_size = 28
# 时序持续长度为28，即每做一次预测，需要先输入28行
timestep_size = 28
# 每个隐含层的节点数
hidden_size = 256
# LSTM layer 的层数
layer_num = 2
# 最后输出分类类别数量，如果是回归预测的话应该是 1
class_num = 10

_X = tf.placeholder(tf.float32, [None, 784])
y = tf.placeholder(tf.float32, [None, class_num])

#############