Global Attention Mechanism: Retain Information to Enhance Channel-Spatial Interactions(GAM)

最新推荐文章于 2024-03-09 22:43:38 发布

查无此人☞

最新推荐文章于 2024-03-09 22:43:38 发布

阅读量1.2k

点赞数 1

分类专栏：注意力总结文章标签：深度学习 pytorch 神经网络

本文链接：https://blog.csdn.net/hb_learing/article/details/121975033

版权

注意力总结专栏收录该内容

1 篇文章 0 订阅

订阅专栏

本文介绍了一个名为GAM_Attention的PyTorch模块，它结合了通道注意力和空间注意力，用于图像处理任务。通过线性MLP和卷积操作实现特征融合，适用于ImageNet-1k实验。核心部分展示了如何构造和使用这个模块，以及其在输入数据上的操作过程。

摘要由CSDN通过智能技术生成

Codes of pytorch:

import torch.nn as nn  
import torch  


class GAM_Attention(nn.Module):  
    def __init__(self, in_channels, out_channels, rate=4):  
        super(GAM_Attention, self).__init__()  

        self.channel_attention = nn.Sequential(  
            nn.Linear(in_channels, int(in_channels / rate)),  
            nn.ReLU(inplace=True),  
            nn.Linear(int(in_channels / rate), in_channels)  ###通道注意力  MLP来实现
        )  
      
        self.spatial_attention = nn.Sequential(  
            nn.Conv2d(in_channels, int(in_channels / rate), kernel_size=7, padding=3),  
            nn.BatchNorm2d(int(in_channels / rate)),  
            nn.ReLU(inplace=True),  
            nn.Conv2d(int(in_channels / rate), out_channels, kernel_size=7, padding=3),  #空间注意力  卷积实现
            nn.BatchNorm2d(out_channels)  
        )  

    def forward(self, x):  
        b, c, h, w = x.shape  
        print("Input size:",x.shape)
        x_permute = x.permute(0, 2, 3, 1).view(b, -1, c)  #（b,c,h*w）
        print("维度转换:x_permute = x.permute(0, 2, 3, 1).view(b, -1, c):",x_permute.shape)
        x_att_permute = self.channel_attention(x_permute).view(b, h, w, c)    #(b,h,w,c)
        
        x_channel_att = x_att_permute.permute(0, 3, 1, 2)  #(b,c,h,w)
        print("送入通道注意力子模块然后恢复到原始的size以便于x计算通道注意力:x_att_permute = self.channel_attention(x_permute).view(b, h, w, c).permute(0, 3, 1, 2):",x_channel_att.shape)
      
        x = x * x_channel_att  ###计算通道注意力
        print("Get channel attention map:",x.shape)
      
        x_spatial_att = self.spatial_attention(x).sigmoid()  
        print("把通道注意力图送入空间注意力子模块:",x_spatial_att.shape)
        out = x * x_spatial_att  
        print("得到的空间通道注意力图与通道注意力图点乘得到最后的GAM注意力图:",out.shape)
      
        return out  

  

if __name__ == '__main__':  
    x = torch.randn(1, 64, 32, 48)  
    b, c, h, w = x.shape  
    net = GAM_Attention(in_channels=c, out_channels=c)  
    y = net(x)

code results:
在这里插入图片描述

Title and authors:
在这里插入图片描述
paper address:
https://arxiv.org/pdf/2112.05561v1.pdf

Overview of GAM:
在这里插入图片描述
Channel and Spatial attention submodule

Experiment in ImageNet-1k

查无此人☞

关注

1
点赞
踩
2

收藏

觉得还不错? 一键收藏
1
评论
Global Attention Mechanism: Retain Information to Enhance Channel-Spatial Interactions(GAM)

本文提出了一种通过减少信息弥散和放大全局交互表示来提高深度神经网络性能的全局注意力机制。本文引入了3D-permutation 与多层感知器的通道注意力和卷积空间注意力子模块。在CIFAR-100和ImageNet-1K上对所提出的图像分类机制的评估表明，本文的方法稳定地优于最近的几个注意力机制，包括ResNet和轻量级的MobileNet。
复制链接

扫一扫

专栏目录