YOLOV8改进:增加注意力模块,以CBAM模块为例

1,conv.py修改

在ultralytics\nn\modules\conv.py该文件中,我们可以写入自己的注意力模块,或者使用V8已经提供的CBAM模块(见代码的CBAM类),其实yolov8中已经加入了CBAM

第一部在__all__中加入CBAM(模块名)

__all__ = (
    "Conv",
    "Conv2",
    "LightConv",
    "DWConv",
    "DWConvTranspose2d",
    "ConvTranspose",
    "Focus",
    "GhostConv",
    "ChannelAttention",
    "SpatialAttention",
    "CBAM", #🚀
    "Concat",
    "RepConv",
    "BiFPN_Concat2",
    "BiFPN_Concat3",
    "RFACA",
)

第二步;在Conv.py中加入CBAM模块代码

#🚀
class CBAM(nn.Module):
    """Convolutional Block Attention Module."""

    def __init__(self, c1, kernel_size=7):
        """Initialize CBAM with given input channel (c1) and kernel size."""
        super().__init__()
        self.channel_attention = ChannelAttention(c1)
        self.spatial_attention = SpatialAttention(kernel_size)

    def forward(self, x):
        """Applies the forward pass through C1 module."""
        return self.spatial_attention(self.channel_attention(x))

2,__init__.py修改

添加完毕后在ultralytics\nn\modules\__init__.py中导入模块,一是在from.conv import中导入。二是在__all__=()中添加

导包

from .conv import (
    CBAM,
    ChannelAttention,
    Concat,
    Conv,
    Conv2,
    ConvTranspose,
    DWConv,
    DWConvTranspose2d,
    Focus,
    GhostConv,
    LightConv,
    RepConv,
    SpatialAttention,
    BiFPN_Concat2,
    BiFPN_Concat3,
    RFACA,
)

加入

__all__ = (
    "Conv",
    "Conv2",
    "LightConv",
    "RepConv",
    "DWConv",
    "DWConvTranspose2d",
    "ConvTranspose",
    "Focus",
    "GhostConv",
    "ChannelAttention",
    "SpatialAttention",
    "CBAM",
    )

3.tasks.py修改

第一在from ultralytics.nn.models import中添加模块

from ultralytics.nn.modules import (
    AIFI,
    C1,
    C2,
    C3,
    C3TR,
    OBB,
    SPP,
    SPPF,
    Bottleneck,
    BottleneckCSP,
    C2f,
    C2fAttn,
    CBAM,
)

第二在def parse_model函数中激活---添加分支elif m is CBAM:,具体代码如下:

        elif m in {CBAM}:
            c1, c2 = ch[f], args[0]
            if c2 != nc:
               c2 = make_divisible(min(c2,max_channels) * width,8)
            args = [c1, *args[1:]]

到此就把cbam模块添加完毕

4配置文件xx.yaml

样例如下,放入位置记得对应修改通道数和层数,另外就是CBAM注意力机制的卷积核大小为3/7,其他报错,示例如下

# YOLOAir 🚀 by 🥭, GPL-3.0 license
 
# Parameters      
nc: 11  # number of classes     V5M     3(masks face license plate)
 
scales:
  #n: [0.33, 0.25, 1024]
  s: [0.33, 0.50, 1024]
  m: [0.67, 0.75, 768]
  l: [1.00, 1.00, 512]
  x: [1.00, 1.25, 512]

backbone:
  # [from, repeats, module, args]
  - [-1, 1, Conv, [64, 3, 2]] # 0-P1/2
  - [-1, 1, Conv, [128, 3, 2]] # 1-P2/4
  - [-1, 3, C2f, [128, True]]
  - [-1, 1, Conv, [256, 3, 2]] # 3-P3/8
  - [-1, 6, C2f, [256, True]]
  - [-1, 1, Conv, [512, 3, 2]] # 5-P4/16
  - [-1, 6, C2f, [512, True]]
  - [-1, 1, Conv, [768, 3, 2]] # 7-P5/32
  - [-1, 3, C2f, [768, True]]
  - [-1, 1, Conv, [1024, 3, 2]] # 9-P6/64
  - [-1, 3, C2f, [1024, True]]

  - [-1,3,CBAM,[1024,7]]  ##11🥭
  - [-1, 1, SPPF, [1024, 5]] # 11
  
# YOLOv8.0x6 head
head:
  - [-1, 1, nn.Upsample, [None, 2, "nearest"]]
  - [[-1, 8], 1, BiFPN_Concat2, [1]] # cat backbone P5
  - [-1, 3, C2f, [768, False]] # 14

  - [-1, 1, nn.Upsample, [None, 2, "nearest"]]
  - [[-1, 6], 1, BiFPN_Concat2, [1]] # cat backbone P4
  - [-1, 3, C2f, [512, False]] # 17

  - [-1, 1, nn.Upsample, [None, 2, "nearest"]]
  - [[-1, 4], 1, BiFPN_Concat2, [1]] # cat backbone P3
  - [-1, 3, C2f, [256, False]] # 20 (P3/8-small)

  - [-1, 1, Conv, [256, 3, 2]]
  - [[-1, 18], 1, BiFPN_Concat2, [1]] # cat head P4    +1
  - [-1, 3, C2f, [512, False]] # 23 (P4/16-medium)

  - [-1, 1, Conv, [512, 3, 2]]
  - [[-1, 15], 1, BiFPN_Concat2, [1]] # cat head P5     +1
  - [-1, 3, C2f, [768, False]] # 26 (P5/32-large)

  - [-1, 1, Conv, [768, 3, 2]]
  - [[-1, 12], 1,BiFPN_Concat2, [1]] # cat head P6        Concat   +1        ,
  - [-1, 3, C2f, [1024, False]] # 29 (P6/64-xlarge)                   c2模块不适合拟合

  - [[21, 24, 27, 30], 1, Detect, [nc]] # Detect(P3, P4, P5, P6)   +1

运行训练打印出完整模型即成功

增加对应的模块后,之后的层数的layer+1,因此需要适当更改,不然会报concat维度不匹配的错误,如下

RuntimeError: Sizes of tensors must match except in dimension 1. Expected size 16 but got size 32 for tensor number 1 in the list.

  • 5
    点赞
  • 11
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值