YOLOv8改进——使用LSKA注意力机制改进SPPF,增强多尺度特征提取能力

YOLOv8改进

第一章 使用LSKA注意力机制改进SPPF,增强多尺度特征提取能力 

一、YOLOv8-SPPF-LSKA模型

1.YOLOv8-SPPF-LSKA模型结构

# Ultralytics YOLO 🚀, AGPL-3.0 license
# YOLOv8 object detection model with P3-P5 outputs. For Usage examples see https://docs.ultralytics.com/tasks/detect

# Parameters
nc: 80  # number of classes
scales: # model compound scaling constants, i.e. 'model=yolov8n.yaml' will call yolov8.yaml with scale 'n'
  # [depth, width, max_channels]
  n: [0.33, 0.25, 1024]  # YOLOv8n summary: 225 layers,  3157200 parameters,  3157184 gradients,   8.9 GFLOPs
  s: [0.33, 0.50, 1024]  # YOLOv8s summary: 225 layers, 11166560 parameters, 11166544 gradients,  28.8 GFLOPs
  m: [0.67, 0.75, 768]   # YOLOv8m summary: 295 layers, 25902640 parameters, 25902624 gradients,  79.3 GFLOPs
  l: [1.00, 1.00, 512]   # YOLOv8l summary: 365 layers, 43691520 parameters, 43691504 gradients, 165.7 GFLOPs
  x: [1.00, 1.25, 512]   # YOLOv8x summary: 365 layers, 68229648 parameters, 68229632 gradients, 258.5 GFLOPs

# YOLOv8.0n backbone
backbone:
  # [from, repeats, module, args]
  - [-1, 1, Conv, [64, 3, 2]]  # 0-P1/2
  - [-1, 1, Conv, [128, 3, 2]]  # 1-P2/4
  - [-1, 3, C2f, [128, True]]
  - [-1, 1, Conv, [256, 3, 2]]  # 3-P3/8
  - [-1, 6, C2f, [256, True]]
  - [-1, 1, Conv, [512, 3, 2]]  # 5-P4/16
  - [-1, 6, C2f, [512, True]]
  - [-1, 1, Conv, [1024, 3, 2]]  # 7-P5/32
  - [-1, 3, C2f, [1024, True]]
  - [-1, 1, SPPF_LSKA, [1024, 5]]  # 9

# YOLOv8.0n head
head:
  - [-1, 1, nn.Upsample, [None, 2, 'nearest']]
  - [[-1, 6], 1, Concat, [1]]  # cat backbone P4
  - [-1, 3, C2f, [512]]  # 12

  - [-1, 1, nn.Upsample, [None, 2, 'nearest']]
  - [[-1, 4], 1, Concat, [1]]  # cat backbone P3
  - [-1, 3, C2f, [256]]  # 15 (P3/8-small)

  - [-1, 1, Conv, [256, 3, 2]]
  - [[-1, 12], 1, Concat, [1]]  # cat head P4
  - [-1, 3, C2f, [512]]  # 18 (P4/16-medium)

  - [-1, 1, Conv, [512, 3, 2]]
  - [[-1, 9], 1, Concat, [1]]  # cat head P5
  - [-1, 3, C2f, [1024]]  # 21 (P5/32-large)

  - [[15, 18, 21], 1, Detect, [nc]]  # Detect(P3, P4, P5)

2.SPPF-LSKA代码

class SPPF_LSKA(nn.Module):
    """Spatial Pyramid Pooling - Fast (SPPF) layer for YOLOv5 by Glenn Jocher."""

    def __init__(self, c1, c2, k=5):  # equivalent to SPP(k=(5, 9, 13))
        super().__init__()
        c_ = c1 // 2  # hidden channels
        self.cv1 = Conv(c1, c_, 1, 1)
        self.cv2 = Conv(c_ * 4, c2, 1, 1)
        self.m = nn.MaxPool2d(kernel_size=k, stride=1, padding=k // 2)
        self.lska = LSKA(c_ * 4, k_size=11)

    def forward(self, x):
        """Forward pass through Ghost Convolution block."""
        x = self.cv1(x)
        y1 = self.m(x)
        y2 = self.m(y1)
        return self.cv2(self.lska(torch.cat((x, y1, y2, self.m(y2)), 1)))

二、添加方式

1.将上文中SPPF-LSKA代码添加到ultralytics/nn/modules/block.py中

2.在block.py的顶部注册模块名.

'SPPF_LSKA'

3.在ultralytics/nn/modules/__init__.py中注册模块名

SPPF_LSKA     'SPPF_LSKA'

4.在ultralytics/nn/tasks.py顶部注册模块名

SPPF_LSKA

 5.在ultralytics/nn/tasks.py中的def parse_model添加

SPPF_LSKA

6.使用上文配置文件开始训练

三、YOLOv8-SPPF-LSKA

  • 25
    点赞
  • 100
    收藏
    觉得还不错? 一键收藏
  • 29
    评论
YOLOv8SPPF是基于YOLOv8SPPF改进版本。SPPF是指空间金字塔池化(Spatial Pyramid Pooling with Filter),它是在YOLOv5中引入的一种特征提取方法。相比于之前的版本,YOLOv8SPPF在网络结构特征提取方面进行了改进。 首先,YOLOv8SPPF采用了PAN-FPN的思想。PAN-FPN是一种特征金字塔网络结构,可以在不同尺度上提取特征,从而提高目标检测的准确性。不过,与YOLOv5相比,YOLOv8SPPF删除了PAN-FPN上采样阶段中的卷积结构,并将C3模块替换为了C2f模块。 此外,YOLOv8SPPF还引入了其他的改进点。具体来说,它使用了ASPP(Atrous Spatial Pyramid Pooling)和RFB(Receptive Field Block)等模块来增强特征提取能力。同时,它采用了SPPCSPC 1.3 PAN-FPN等相关的改进方法,以进一步提高检测性能。 总结而言,YOLOv8SPPF是基于YOLOv8SPPF改进版本,通过引入PAN-FPN思想和其他的改进方法,提高了目标检测的准确性和特征提取能力。<span class="em">1</span><span class="em">2</span><span class="em">3</span> #### 引用[.reference_title] - *1* *2* [【目标检测】YOLOv8学习记录](https://blog.csdn.net/qq_43471945/article/details/129009099)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_1"}}] [.reference_item style="max-width: 50%"] - *3* [YOLOv8 从环境搭建到推理训练](https://blog.csdn.net/weixin_61988885/article/details/129421538)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_1"}}] [.reference_item style="max-width: 50%"] [ .reference_list ]
评论 29
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值