深度学习作业L5W3（1）：Neural Machine Translation

最新推荐文章于 2022-01-05 16:29:38 发布

awake020

最新推荐文章于 2022-01-05 16:29:38 发布

阅读量190

点赞数

分类专栏：深度学习笔记文章标签： python 深度学习机器学习 tensorflow 算法

本文链接：https://blog.csdn.net/weixin_44334615/article/details/106346005

版权

利用attention模型构造一个日期翻译模型（将各种日期描述翻译成YYYY-MM-DD）

基本单元是字母，所以不需要embedding

在这里插入图片描述

attention计算

首先利用RepeatVector复制状态s（输出层LSTM状态值），利用Concatenate将s和a（处理层LSTM输出值）组合，在利用两层densor和一个softmax求出atteition矩阵，利用Dot层进行矩阵乘法求出输出层LSTM的输入值

全局变量

# Defined shared layers as global variables
repeator = RepeatVector(Tx)
concatenator = Concatenate(axis=-1)
densor1 = Dense(10, activation = "tanh")
densor2 = Dense(1, activation = "relu")
activator = Activation(softmax, name='attention_weights') # We are using a custom softmax(axis = 1) loaded in this notebook
dotor = Dot(axes = 1)

求输入值context

# GRADED FUNCTION: one_step_attention

def one_step_attention(a, s_prev):
    """
    Performs one step of attention: Outputs a context vector computed as a dot product of the attention weights
    "alphas" and the hidden states "a" of the Bi-LSTM.
    
    Arguments:
    a -- hidden state output of the Bi-LSTM, numpy-array of shape (m, Tx, 2*n_a)
    s_prev -- previous hidden state of the (post-attention) LSTM, numpy-array of shape (m, n_s)
    
    Returns:
    context -- context vector, input of the next (post-attetion) LSTM cell
    """
    
    ### START CODE HERE ###
    # Use repeator to repeat s_prev to be of shape (m, Tx, n_s) so that you can concatenate it with all hidden states "a" (≈ 1 line)
    s_prev = repeator(s_prev)
    # Use concatenator to concatenate a and s_prev on the last axis (≈ 1 line)
    concat = concatenator([a, s_prev]

最低0.47元/天解锁文章

awake020

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
深度学习作业L5W3（1）：Neural Machine Translation

利用attention模型构造一个日期翻译模型（将各种日期描述翻译成YYYY-MM-DD）基本单元是字母，所以不需要embeddingattention计算首先利用RepeatVector复制状态s（输出层LSTM状态值），利用Concatenate将s和a（处理层LSTM输出值）组合，在利用两层densor和一个softmax求出atteition矩阵，利用Dot层进行矩阵乘法求出输出层LSTM的输入值全局变量# Defined shared layers as global variable
复制链接

扫一扫