一个github代码片段（lstm和gru）

最新推荐文章于 2024-03-15 09:55:21 发布

咪咕班克斯

最新推荐文章于 2024-03-15 09:55:21 发布

阅读量527

点赞数 1

分类专栏： pytorch github优秀代码分享

本文链接：https://blog.csdn.net/u012211422/article/details/116087610

版权

pytorch 同时被 2 个专栏收录

45 篇文章 5 订阅

订阅专栏

github优秀代码分享

31 篇文章 3 订阅

订阅专栏

编写不易如果觉得不错，麻烦关注一下~

代码段来自https://github.com/linjieli222/VQA_ReGAT/blob/master/model/language_model.py

调用的库里的lstm 和gru 模型框架，

其中forward 是将两个隐层变量拼接在一起

forward_all 返回所有

官网解释：https://pytorch.org/docs/stable/generated/torch.nn.LSTM.html?highlight=lstm#torch.nn.LSTM

class QuestionEmbedding(nn.Module):
    def __init__(self, in_dim, num_hid, nlayers, bidirect, dropout,
                 rnn_type='GRU'):
        """Module for question embedding
        """
        super(QuestionEmbedding, self).__init__()
        assert rnn_type == 'LSTM' or rnn_type == 'GRU'
        rnn_cls = nn.LSTM if rnn_type == 'LSTM' else nn.GRU \
            if rnn_type == 'GRU' else None

        self.rnn = rnn_cls(
            in_dim, num_hid, nlayers,
            bidirectional=bidirect,
            dropout=dropout,
            batch_first=True)

        self.in_dim = in_dim
        self.num_hid = num_hid
        self.nlayers = nlayers
        self.rnn_type = rnn_type
        self.ndirections = 1 + int(bidirect)

    def init_hidden(self, batch):
        # just to get the type of tensor
        weight = next(self.parameters()).data
        hid_shape = (self.nlayers * self.ndirections, batch, self.num_hid)
        if self.rnn_type == 'LSTM':
            return (weight.new(*hid_shape).zero_(),
                    weight.new(*hid_shape).zero_())
        else:
            return weight.new(*hid_shape).zero_()

    def forward(self, x):
        # x: [batch, sequence, in_dim]
        batch = x.size(0)
        hidden = self.init_hidden(batch)
        self.rnn.flatten_parameters()
        output, hidden = self.rnn(x, hidden)

        if self.ndirections == 1:
            return output[:, -1]

        forward_ = output[:, -1, :self.num_hid]
        backward = output[:, 0, self.num_hid:]
        return torch.cat((forward_, backward), dim=1)

    def forward_all(self, x):
        # x: [batch, sequence, in_dim]
        batch = x.size(0)
        hidden = self.init_hidden(batch)
        self.rnn.flatten_parameters()
        output, hidden = self.rnn(x, hidden)
        return output

咪咕班克斯

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
4
评论
一个github代码片段（lstm和gru）

调用的库里的数据，其中forward 是将两个隐层变量拼接在一起forward_all 是将所有隐层变量返回class QuestionEmbedding(nn.Module): def __init__(self, in_dim, num_hid, nlayers, bidirect, dropout, rnn_type='GRU'): """Module for question embedding """
复制链接

扫一扫

专栏目录