目标检测 - IoU和GIoU作为边框回归的损失和代码实现

最新推荐文章于 2024-01-11 20:46:08 发布

西笑生

最新推荐文章于 2024-01-11 20:46:08 发布

阅读量982

点赞数 2

分类专栏：目标检测文章标签： IoU GIoU General-IOU bounding Box giou loss

本文链接：https://blog.csdn.net/flyfish1986/article/details/110005818

版权

目标检测专栏收录该内容

60 篇文章 118 订阅

订阅专栏

本文介绍了IoU（Intersection over Union）和GIoU（Generalized Intersection over Union）在目标检测任务中作为边框回归损失的原理，包括计算方法和代码实现。GIoU通过最小封闭框的概念改进了IoU，提供了更精确的评估。作者还提供了不同类型的IoU Loss代码示例，如UnitBox的IoU和GIoU版本。

摘要由CSDN通过智能技术生成

目标检测 - IoU和GIoU作为边框回归的损失和代码实现

flyfish

GIoU
=General-IOU
=Generalized Intersection over Union

论文《Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression》

IoU和GIoU作为边框回归的损失

GIoU as Loss for Bounding Box Regression
算法过程如下

输入: 预测的边框 $B^p$ 和 GT边框 $B^g$ 的坐标
$B^p = (x^p_1,y^p_1,x^p_2,y^p_2)$ , $B^g = (x^g_1,y^g_1,x^g_2,y^g_2)$
输出: $\mathcal{L}_{IoU}$ , $\mathcal{L}_{GIoU}$

对于预测的边框 $B^p$ , 确保 $x^p_2>x^p_1$ ， $y^p_2>y^p_1$ :
$\hat{x}^p_1 = \min(x^p_1,x^p_2)$ ,
$\hat{x}^p_2 = \max(x^p_1,x^p_2)$ ,
$\hat{y}^p_1 = \min(y^p_1,y^p_2)$ ,
$\hat{y}^p_2 = \max(y^p_1,y^p_2)$
计算 $B^g$ 的面积: $A^g = (x^g_2 - x^g_1)\times(y^g_2 - y^g_1)$
计算 $B^p$ 的面积: $A^p = (\hat{x}^p_2 - \hat{x}^p_1)\times(\hat{y}^p_2 - \hat{y}^p_1)$
在 $B^p$ 和 $B^g$ 之间计算交集 $\mathcal{I}$ :
$x^{\mathcal{I}}_1 = \max(\hat{x}^p_1,x^g_1)$ ,
$x^{\mathcal{I}}_2 = \min(\hat{x}^p_2,x^g_2)$ ,
$y^{\mathcal{I}}_1 = \max(\hat{y}^p_1,y^g_1)$ ,
$y^{\mathcal{I}}_2 = \min(\hat{y}^p_2,y^g_2)$ ,
$\mathcal{I} = \begin{cases} (x^{\mathcal{I}}_2 - x^{\mathcal{I}}_1) \times (y^{\mathcal{I}}_2 - y^{\mathcal{I}}_1) & \text{if} \quad x^{\mathcal{I}}_2 > x^{\mathcal{I}}_1, y^{\mathcal{I}}_2 > y^{\mathcal{I}}_1, \\ 0 & \text{otherwise} \end{cases}$
寻找最小封闭框（smallest enclosing box，可看下图更清楚）的坐标 $B^c$ :
$x^{c}_1 = \min(\hat{x}^p_1,x^g_1)$ ,
$x^{c}_2 = \max(\hat{x}^p_2,x^g_2)$ ,
$y^{c}_1 = \min(\hat{y}^p_1,y^g_1)$ ,
$y^{c}_2 = \max(\hat{y}^p_2,y^g_2)$
计算 $B^c$ 的面积: $A^c = (x^c_2 - x^c_1)\times(y^c_2 - y^c_1)$
$\displaystyle IoU = \frac{\mathcal{I}}{\mathcal{U}}$ , where $\mathcal{U} = A^p+A^g-\mathcal{I}$
$\displaystyle GIoU = IoU - \frac{A^c-\mathcal{U}}{A^c}$
$\mathcal{L}_{IoU} = 1 - IoU$ , $\mathcal{L}_{GIoU} = 1 - GIoU$

用图说明的更清楚
IoU=Jaccard Index
IoU
最小的封闭框是如何计算的
在这里插入图片描述

在 $B^p$ 和 $B^g$ 之间计算黄色的交集 $\mathcal{I}$ ，绿色边框表示最小的封闭框 $B^c$ 。
最小封闭框=C
在这里插入图片描述

代码实现

关于IoU Loss

根据论文UnitBox和论文GIoU对与IoU Loss处理不同的方法
UnitBox的是-ln(IoU) ln是以e为底的对数
在这里插入图片描述
图上的坐标tblr的表示方式是这里的第三种Center-Size coordinates
$y_{i} = \log_{e} (x_{i})$
输出是 $x_{i}$ 输出是 $y_{i}$

GIoU里的是1-IoU
所以代码实现的时候，可以同时实现三个Loss
参考
UnitBox: An Advanced Object Detection Network

代码中的坐标表示方法采用了这里中的第一种boundary coordinates (x_min, y_min, x_max, y_max)
0,1,2,3下标可表示left，top，right，bottom
x2>x1,y2>y1

import torch
import torch.nn as nn
class IoULoss(nn.Module):
    """
    Intersetion Over Union (IoU) loss 支持三种不同的loss计算方法:
    * IoU(UnitBox paper)
    * Linear IoU(GIoU paper)
    * gIoU
    * 类型支持：iou,linear_iou,giou
    """
    def __init__(self, loc_loss_type='giou'):
        super(IoULoss, self).__init__()
        self.loc_loss_type = loc_loss_type

    def forward(self, pred, gt, weight=None):
        """
        Args:
            pred: Nx4 predicted bounding boxes, Each row is (x1, y1, x2, y2).
            gt: Nx4 gt bounding boxes
        """
        pred_x1 = pred[:, 0]
        print(pred_x1)
        pred_y1 = pred[:, 1]
        pred_x2 = pred[:, 2]
        pred_y2 = pred[:, 3]

        gt_x1 = gt[:, 0]
        gt_y1 = gt[:, 1]
        gt_x2 = gt[:, 2]
        gt_y2 = gt[:, 3]
        #如果再严谨些,代码确保x2>x1,y2>y1，下标0和下标2，谁小谁是x1

        gt_aera = (gt_x1 + gt_x2) * (gt_y1 + gt_y2) #对应算法第2步
        pred_aera = (pred_x1 + pred_x2) * (pred_y1 + pred_y2)#对应算法第3步

        I_x1 = torch.max(pred_x1, gt_x1)
        I_x2 = torch.min(pred_x2, gt_x2)
        I_y1 = torch.max(pred_y1, gt_y1)
        I_y2 = torch.min(pred_x2, gt_x2)
        area_intersect=(I_x2 - I_x1)*(I_y2-I_y1)#交集 对应算法第4步

        C_x1 = torch.min(pred_x1, gt_x1)
        C_x2 = torch.max(pred_x2, gt_x2)
        C_y1 = torch.min(pred_y1, gt_y1)
        C_y2 = torch.max(pred_x2, gt_x2)
        ac =(C_x2 - C_x1) * (C_y2 - C_y1)#最小封闭框 #对应算法第5步

        U = gt_aera + pred_aera - area_intersect#并集

        ious = (area_intersect ) / (U.clamp(min=1e-10))#分母不为0
        gious = ious - (ac - U) / ac.clamp(min=1e-10)
        if self.loc_loss_type == 'iou':
            losses = -torch.log(ious)
        elif self.loc_loss_type == 'linear_iou':
            losses = 1 - ious
        elif self.loc_loss_type == 'giou':
            losses = 1 - gious
        else:
            raise NotImplementedError

        if weight is not None:
            return (losses * weight).sum()
        else:
            return losses.sum()