初始化隐藏状态【h0】和细胞状态【c0】到RNN,LSTM,GRU --OCR(光学字符识别)

最新推荐文章于 2024-09-12 14:45:47 发布

Ai-编码

最新推荐文章于 2024-09-12 14:45:47 发布

阅读量586

点赞数 5

文章标签： rnn lstm gru ocr 循环神经网络门控循环单元

本文链接：https://blog.csdn.net/qq_33700934/article/details/139785905

版权

在PyTorch中，我们可以通过直接创建具有适当形状和类型的张量来自定义初始化LSTM的隐藏状态和细胞状态。这些张量的形状应该是(num_layers * num_directions, batch_size, hidden_size)，其中num_layers是LSTM层数，num_directions是LSTM的方向数（对于双向LSTM为2，对于单向LSTM为1），batch_size是批量中的样本数，hidden_size是隐藏层的大小。
以下展示了一张图片
如何自定义初始化隐藏状态和细胞状态到RNN,LSTM,GRU 网络：
…导入必备库…《
import torch.nn as nn
import torch
from PIL import Image
from torchvision import transforms
…RNN网络…
img = Image.open(‘00000.jpg’).convert(‘L’) #本地图片一张
tt = transforms.ToTensor()
pic = tt(img)
print(pic.size())
rnn = nn.RNN(input_size=100, hidden_size=256, num_layers=1,batch_first=True)
h0 = torch.randn(1, 1, 256) #[num_layers * num_directions, batch_size, hidden_size]。
out, hn= rnn(pic,h0)
print(out)
print(out.shape)
print(hn)
print(hn.shape)
…LSTM网络…

img = Image.open(‘00000.jpg’).convert(‘L’)
tt = transforms.ToTensor()
pic = tt(img)
print(pic.size())

ls = nn.LSTM(input_size=100, hidden_size=256, num_layers=2)

h0 = torch.zeros(2, pic.size(1), 256)
c0 = torch.zeros(2, pic.size(1), 256)
output, (hn, cn) = ls(pic, (h0, c0))
print(f’output:‘, output)
print(f’outputshape:’, output.shape)
print(f’hn:‘, hn)
print(f’hnshape:’, hn.shape)
print(f’cn:‘, cn)
print(f’cnshape:’, cn.shape)
…GRU网络…
img = Image.open(‘00000.jpg’).convert(‘L’) #本地图片一张
tt = transforms.ToTensor()
pic = tt(img)
print(pic.size())
rnn = nn.GRU(input_size=100, hidden_size=256, num_layers=1,batch_first=True)
h0 = torch.randn(1, 1, 256) #[num_layers * num_directions, batch_size, hidden_size]。
out, hn= rnn(pic,h0)
print(out)
print(out.shape)
print(hn)
print(hn.shape)
…
h0和c0被初始化，可以根据需要使用其他值或策略来初始化它们。重要的是要确保这些张量的形状与RNN,LSTM,GRU层期望的形状相匹配。

Ai-编码

关注

5
点赞
踩
9

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫