LSTM实战：识别手写体数字

最新推荐文章于 2022-04-07 20:17:11 发布

Dance_Jacky

最新推荐文章于 2022-04-07 20:17:11 发布

阅读量1.1k

点赞数 1

分类专栏：深度学习文章标签： LSTM 手写体识别深度学习

本文链接：https://blog.csdn.net/Dance_Jacky/article/details/85019909

版权

深度学习专栏收录该内容

1 篇文章 0 订阅

订阅专栏

python3

pycharm

代码：

# -*- coding: utf-8 -*-
import tensorflow as tf
from tensorflow.contrib import rnn
import numpy as np
from tensorflow.examples.tutorials.mnist import input_data
import os
os.environ['TF_CPP_MIN_LOG_LEVEL'] = '2'

input_vec_size = lstm_size = 28 #输入向量的维度
time_step_size = 28 #循环层长度

batch_size =128
test_size = 256

def init_weights(shape):
    return tf.Variable(tf.random_normal(shape,stddev = 0.01))

def model(X,W,B,lstm_size):
    XT = tf.transpose(X,[1,0,2])
    XR = tf.reshape(XT,[-1,lstm_size])
    X_split = tf.split(XR,time_step_size,0)
    lstm = rnn.BasicLSTMCell(lstm_size,forget_bias = 1.0,state_is_tuple = True)
    outputs,_states = rnn.static_rnn(lstm,X_split,dtype = tf.float32)
    return tf.matmul(outputs[-1],W) + B , lstm.state_size
mnist = input_data.read_data_sets("MNIST_data/",one_hot = True)
trX,trY,teX,teY = mnist.train.images,mnist.train.labels,mnist.test.images,mnist.train.labels

trX = trX.reshape(-1,28,28)
teX = trX.reshape(-1,28,28)

X = tf.placeholder("float",[None,28,28])
Y = tf.placeholder("float",[None,10])
W = init_weights([lstm_size,10])
B = init_weights([10])

py_x,state_size = model(X,W,B,lstm_size)
cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=py_x,labels=Y))
train_op = tf.train.RMSPropOptimizer(0.001,0.9).minimize(cost)
predict_op = tf.argmax(py_x,1)

session_conf = tf.ConfigProto()
session_conf.gpu_options.allow_growth = True

with tf.Session(config = session_conf) as sess:
    tf.global_variables_initializer().run()
    for i in range(100):
        for start,end in zip(range(0,len(trX),batch_size),range(batch_size,len(trX)+1,batch_size)):
            sess.run(train_op,feed_dict={X:trX[start:end],Y:trY[start:end]})
        s = len(teX)
        test_indices = np .arange(len(teX))
        np.random.shuffle(test_indices)
        test_indices = test_indices[0:test_size]

        print(i,np.mean(np.argmax(teY[test_indices],axis = 1) == sess.run(predict_op,feed_dict={X:teX[test_indices]})))

实验结果：

源代码：
https://github.com/geroge-gao/deeplearning/tree/master/LSTM

Dance_Jacky

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
LSTM实战：识别手写体数字

python3pycharm代码：# -*- coding: utf-8 -*-import tensorflow as tffrom tensorflow.contrib import rnnimport numpy as npfrom tensorflow.examples.tutorials.mnist import input_dataimport osos.en...
复制链接

扫一扫