大核注意力Large Kernel Attention(LKA)

lct不吃香菜

已于 2024-03-08 16:24:52 修改

阅读量5.8k

点赞数 3

分类专栏：填坑日记文章标签：深度学习机器学习 python

于 2022-03-01 11:01:05 首次发布

本文链接：https://blog.csdn.net/m0_46507285/article/details/123200950

版权

填坑日记专栏收录该内容

11 篇文章 0 订阅

订阅专栏

在这里插入图片描述

这种分解方式也被叫做大核注意力(Large Kernel Attention)，即LKA。如上图所示，一个很大kernel size的卷积被分解成一个Depth-wise卷积+一个Depth-wise空洞卷积+一个1× \times× 1卷积。这样，就可以大大减少FLOPs和参数量。很有效地解决了小核卷积的local性。

class AttentionModule(nn.Module):
    def __init__(self, dim):
        super().__init__()
        self.conv0 = nn.Conv2d(dim, dim, 5, padding=2, groups=dim)  #depth-wise conv
        self.conv_spatial = nn.Conv2d(dim, dim, 7, stride=1, padding=9, groups=dim, dilation=3)   #conv_spatial
        self.conv1 = nn.Conv2d(dim, dim, 1)  # 1x1 conv


    def forward(self, x):
        u = x.clone()        
        attn = self.conv0(x)
        attn = self.conv_spatial(attn)
        attn = self.conv1(attn)

        return u * attn

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

lct不吃香菜

关注关注

3
点赞
踩
27

收藏

觉得还不错? 一键收藏
5
评论
大核注意力Large Kernel Attention(LKA)

这种分解方式也被叫做大核注意力(Large Kernel Attention)，即LKA。如上图所示，一个很大kernel size的卷积被分解成一个Depth-wise卷积+一个Depth-wise空洞卷积+一个1× \times× 1卷积。这样，就可以大大减少FLOPs和参数量。很有效地解决了小核卷积的local性。class AttentionModule(nn.Module): def __init__(self, dim): super().__init__()...
复制链接

扫一扫