NLP（九）Attention

最新推荐文章于 2021-06-25 14:24:02 发布

优雅一只猫

最新推荐文章于 2021-06-25 14:24:02 发布

阅读量329

点赞数 1

分类专栏： NLP 文章标签： NLP

本文链接：https://blog.csdn.net/weixin_41492426/article/details/90580894

版权

NLP 专栏收录该内容

10 篇文章 0 订阅

订阅专栏

Attention模块需要使用keras的自定义写法
简要的说Attention模块时将n个时刻的LSTM输出结合算出一个向量输入到下一个RNN中
自己之前在看恩达的课程的时候，画了张图
在这里插入图片描述

class AttentionLayer(Layer):
    def __init__(self, **kwargs):
        self.init = initializations.get('normal')
        super(AttentionLayer, self).__init__(**kwargs)

    def build(self, input_shape):
        self.W = self.init((input_shape[-1],))
        self.trainable_weights = [self.W]
        super(AttLayer, self).build(input_shape)
        
    def call(self, x, mask=None):
        e = K.tanh(K.dot(x, self.W)
        ai = K.exp(e)
        weights = ai/K.sum(ai, axis=1).dimshuffle(0,'x')
        
        weighted_input = x*weights.dimshuffle(0,1,'x')
        return weighted_input.sum(axis=1)

    def get_output_shape_for(self, input_shape):
        return (input_shape[0], input_shape[-1])

# 基本使用和其他的layer一致
l_lstm = Bidirectional(LSTM(100, return_sequences=True))(embedded_seq)
attenion= AttentionLayer()(l_lstm)

优雅一只猫

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
NLP（九）Attention

Attention模块需要使用keras的自定义写法简要的说Attention模块时将n个时刻的LSTM输出结合算出一个向量输入到下一个RNN中自己之前在看恩达的课程的时候，画了张图class AttentionLayer(Layer): def __init__(self, **kwargs): self.init = initializations.get('no...
复制链接

扫一扫