单向LSTM与双向LSTM对比

最新推荐文章于 2024-10-22 14:21:23 发布

kudou1994

最新推荐文章于 2024-10-22 14:21:23 发布

阅读量2.5w

点赞数 4

分类专栏： # 机器翻译学习神经机器翻译

本文链接：https://blog.csdn.net/kudou1994/article/details/80851227

版权

本文通过对比实验展示了单向和双向LSTM在手写数字识别任务中的性能。使用TensorFlow数据集，单向LSTM在100次训练后准确率趋于稳定，而双向LSTM虽然需要更多训练次数，但100次后准确率显著提高，稳定性更强。结论是双向LSTM在图像识别中表现更优。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

一个简单的DEMO：实现手写数字图片的识别

单向LSTM

利用的数据集是tensorflow提供的一个手写数字数据集。该数据集是一个包含55000张28*28的数据集。
训练100次
识别准确率还不是很稳定，但是从第17次开始就趋于相对稳定的状态了。

# -*- coding: utf-8 -*-
import tensorflow as tf
from tensorflow.contrib import rnn

import numpy as np
#import input_data
from tensorflow.examples.tutorials.mnist import input_data #####
mnist = input_data.read_data_sets('MNIST_data', one_hot=True) #####

# configuration
#                        O * W + b -> 10 labels for each image, O[? 28], W[28 10], B[10]
#                       ^ (O: output 28 vec from 28 vec input)
#                       |
#      +-+  +-+       +--+
#      |1|->|2|-> ... |28| time_step_size = 28
#      +-+  +-+       +--+
#       ^    ^    ...  ^
#       |    |         |
# img1:[28] [28]  ... [28]
# img2:[28] [28]  ... [28]
# img3:[28] [28]  ... [28]
# ...
# img128 or img256 (batch_size or test_size 256)
#      each input size = input_vec_size=lstm_size=28

# configuration variables
input_vec_size = lstm_size = 28 # 输入向量的维度
time_step_size = 28 # 循环层长度

batch_size = 128
test_size = 256

def init_weights(shape):
    return tf.Variable(tf.random_normal(shape, stddev=0.01))


def model(X, W, B, lstm_size):
    # X, input shape: (batch_size, time_step

最低0.47元/天解锁文章