YOLO_v7讲解

江南綿雨

已于 2022-09-27 11:51:17 修改

阅读量5.7k

点赞数 4

分类专栏： CNN检测系列 # yolo系列文章标签： python 深度学习计算机视觉

于 2022-09-26 18:31:08 首次发布

本文链接：https://blog.csdn.net/weixin_43702653/article/details/127056539

版权

CNN检测系列同时被 2 个专栏收录

17 篇文章 82 订阅

订阅专栏

yolo系列

7 篇文章 3 订阅

订阅专栏

文章目录

一：特色浅析
二：ELAN模块
三：MPConv混合卷积
四：SPPCSPC
五：ImpConv隐性知识学习
六：Fine-to-Coarse Auxility Loss

一：特色浅析

和v6一样，对网络结构进行了大量改变，YOLO_v7有许多前人的影子，比如将YOLO_v6的RepConv故技重施，损失函数也和YOLOv5完全一样，都是上、下、左、右的cell中增量选择targets。当然，其中还是有许多不一样的细节点，如下：

ELAN模块
MPConv混合卷积
SPPCSPC
ImpConv隐性知识学习
Fine-to-Coarse Auxility Loss

下面我主要介绍上面几个细节，以此贯穿整个网络，我先贴上大牛画的结构图，出自该篇博文，特此鸣谢：
在这里插入图片描述

二：ELAN模块

在这里插入图片描述 yolov7.yaml中的初始化设置：

   [-1, 1, Conv, [64, 1, 1]],
   [-2, 1, Conv, [64, 1, 1]],
   [-1, 1, Conv, [64, 3, 1]],
   [-1, 1, Conv, [64, 3, 1]],
   [-1, 1, Conv, [64, 3, 1]],
   [-1, 1, Conv, [64, 3, 1]],
   [[-1, -3, -5, -6], 1, Concat, [1]],
   [-1, 1, Conv, [256, 1, 1]],  # 11

三：MPConv混合卷积

在这里插入图片描述
yolov7.yaml中的初始化设置：

   [-1, 1, Conv, [256, 1, 1]],  # 11

   # MPConv
   [-1, 1, MP, []],
   [-1, 1, Conv, [128, 1, 1]],
   [-3, 1, Conv, [128, 1, 1]],
   [-1, 1, Conv, [128, 3, 2]],
   [[-1, -3], 1, Concat, [1]],  # 16-P3/8

四：SPPCSPC

在这里插入图片描述

common.py中的对应代码部分：

class SPPCSPC(nn.Module):
    # CSP https://github.com/WongKinYiu/CrossStagePartialNetworks
    def __init__(self, c1, c2, n=1, shortcut=False, g=1, e=0.5, k=(5, 9, 13)):
        super(SPPCSPC, self).__init__()
        c_ = int(2 * c2 * e)  # hidden channels
        self.cv1 = Conv(c1, c_, 1, 1)
        self.cv2 = Conv(c1, c_, 1, 1)
        self.cv3 = Conv(c_, c_, 3, 1)
        self.cv4 = Conv(c_, c_, 1, 1)
        self.m = nn.ModuleList([nn.MaxPool2d(kernel_size=x, stride=1, padding=x // 2) for x in k])
        self.cv5 = Conv(4 * c_, c_, 1, 1)
        self.cv6 = Conv(c_, c_, 3, 1)
        self.cv7 = Conv(2 * c_, c2, 1, 1)

    def forward(self, x):
        x1 = self.cv4(self.cv3(self.cv1(x)))
        y1 = self.cv6(self.cv5(torch.cat([x1] + [m(x1) for m in self.m], 1)))
        y2 = self.cv2(x)
        return self.cv7(torch.cat((y1, y2), dim=1))

五：ImpConv隐性知识学习

在这里插入图片描述这一部分直接继承自YOLOR中的显隐性知识学习。一般情况下，将神经网络的浅层特征称为显性知识，深层特征称为隐性知识。而YOLOR的作者（同时也是YOLOv7的作者）则直接把神经网络最终观察到的知识称为显性知识，那些观察不到、与观察无关的知识称为隐性知识。也就是无法学习到的东西，先初始化再更新学习。

在model/common.py文件中，定义了两类隐性知识：ImplicitA和ImplicitM，分别和输入相加、相乘：

IDetect

def forward(self,  x):
x = self.conv_1✖1(implicitA(x))
x = implicitM(x)

ImplicitA

class ImplicitA(nn.Module):
    def __init__(self, channel, mean=0., std=.02):
        super(ImplicitA, self).__init__()
        self.channel = channel
        self.mean = mean
        self.std = std
        self.implicit = nn.Parameter(torch.zeros(1, channel, 1, 1))
        nn.init.normal_(self.implicit, mean=self.mean, std=self.std)

    def forward(self, x):
        return self.implicit + x

ImplicitM

class ImplicitM(nn.Module):
    def __init__(self, channel, mean=0., std=.02):
        super(ImplicitM, self).__init__()
        self.channel = channel
        self.mean = mean
        self.std = std
        self.implicit = nn.Parameter(torch.ones(1, channel, 1, 1))
        nn.init.normal_(self.implicit, mean=self.mean, std=self.std)

    def forward(self, x):
        return self.implicit * x