pytorch 中的torch.nn.LSTM函数

最新推荐文章于 2024-07-13 13:25:22 发布

learn-to-live

最新推荐文章于 2024-07-13 13:25:22 发布

阅读量589

点赞数 3

分类专栏：算法

本文链接：https://blog.csdn.net/ineedstudytosurvive/article/details/115080322

版权

算法专栏收录该内容

6 篇文章 0 订阅

订阅专栏

LSTM是RNN的一种变体
主要包括以下几个参数：
input_size:输入的input中的参数维度，即文本中的embedding_dim
hidden_size:隐藏层的维度
num_layers:LSTM的层数，一般为2-3层，默认为1
bias:是否使用偏置向，默认为True
batch_first:是否输入的input第一个为batch_size,pytorch默认False,即输入的input的三维张量是seq_len放在第一个
dropout:是否丢弃部分神经元,默认为0
bidirectional:是否使用双向LSTM ，默认False

输入：inputs,(h0,c0)
其中inputst是一个三维张量
主要包括[batch_size,seq_len,input_size]
h0是0时刻的隐层，默认为全0
c0是0时刻的cell状态，默认为全0
h0,c0的维度都为：[batch_size,num_layers*num_directions,hidden_size]

输出：outputs,(hn,cn)
output的维度[batch_size,seq_len,num_directions*hidden_size]
hn和cn是第n时刻的隐层和cell状态，维度和h0,c0相同。

下面是代码示例：

Talk is cheap.Show me the code.

input:
假设输入是[64,512,100]
LSTM = nn.LSTM(100,128,batch_first=True)
x1 = torch.randn([64,512,100)
output,(hn,cn) = LSTM(x1)

output.shape的shape[batch,seq_len,num_directions*hidden_size])
[64, 512, 128]
hn,cn的维度均为[num_layers * num_directions,batch,hidden_size]
[1,64,128]

如果是LSTM = nn.LSTM(100,128,batch_first=True,directional=True)
LSTM = nn.LSTM(100,128,batch_first=True)
x1 = torch.randn([64,512,100)
output,(hn,cn) = LSTM(x1)

那么output的维度将变成[64,512,256]
hn,cn的维度会变成[2,64,128]

learn-to-live

关注

3
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
pytorch 中的torch.nn.LSTM函数

LSTM是RNN的一种变体主要包括以下几个参数：input_size:输入的input中的参数维度，即文本中的embedding_dimhidden_size:隐藏层的维度num_layers:LSTM的层数，一般为2-3层，默认为1bias:是否使用偏置向，默认为Truebatch_first:是否输入的input第一个为batch_size,pytorch默认False,即输入的input的三维张量是seq_len放在第一个dropout:是否丢弃部分神经元,默认为0bidirectio
复制链接

扫一扫

专栏目录