一组向量和一组向量的attention计算，pytorch实现

最新推荐文章于 2024-04-15 12:18:20 发布

FocusOneThread

最新推荐文章于 2024-04-15 12:18:20 发布

阅读量1k

点赞数

分类专栏：深度学习 PyTorch

本文链接：https://blog.csdn.net/guotong1988/article/details/86503378

版权

深度学习同时被 2 个专栏收录

212 篇文章 5 订阅 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

PyTorch

83 篇文章 2 订阅 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

本文介绍了一种使用PyTorch实现的方法，用于计算一组向量与另一组向量之间的注意力机制。详细阐述了计算过程，并提供了相关的代码示例。

摘要由CSDN通过智能技术生成

class GetAttentionHiddens(nn.Module):
    def __init__(self, input_size, attention_hidden_size, similarity_attention = False):
        super(GetAttentionHiddens, self).__init__()
        self.scoring = AttentionScore(input_size, attention_hidden_size, similarity_score=similarity_attention)

    def forward(self, x1, x2, x2_mask, x3=None, scores=None, return_scores=False, drop_diagonal=False):
        """
        Using x1, x2 to calculate attention score, but x1 will take back info from x3.
        If x3 is not specified, x1 will attend on x2.

        x1: [batch, len1, x1_input_size]
        x2: [batch, len2, x2_input_size]
        x2_mask: [batch, len2]

        x3: [batch, len2, x3_input_size]
        """
        if x3 is Non

了解本专栏