SSD损失函数与训练分析

最新推荐文章于 2024-04-11 00:34:32 发布

Gallant Hu

最新推荐文章于 2024-04-11 00:34:32 发布

阅读量1.1k

点赞数 2

分类专栏：计算机视觉

本文链接：https://blog.csdn.net/weixin_42108090/article/details/109274829

版权

计算机视觉专栏收录该内容

43 篇文章 1 订阅

订阅专栏

匹配策略

Matching strategy During training we need to determine which default boxes correspond to a ground truth detection and train the network accordingly. For each ground truth box we are selecting from default boxes that vary over location, aspect ratio, and scale. We begin by matching each ground truth box to the default box with the best jaccard overlap (as in MultiBox [7]). Unlike MultiBox, we then match default boxes to any ground truth with jaccard overlap higher than a threshold (0.5). This simplifies the learning problem, allowing the network to predict high scores for multiple overlapping default boxes rather than requiring it to pick only the one with maximum overlap.

训练目的

The SSD training objective is derived from the MultiBox objective[7,8] but is extended to handle multiple object categories. Let $x_{ij}^p= \{ 1,0\}$ be an indicator for matching the $i$ -th default box to the $j$ -th ground truth box of category p. In the matching strategy above, we can have $\sum_ix_{ij}^p\ge1$ .
The overall objective loss function is a weighted sum of the localization loss (loc) and the confidence loss (conf):
$L(x,c,l,g)=\frac{1}{N}(L_{conf}(x,c)+\alpha L_{loc}(x,l,g))$

SSD 损失函数由两部分组成，一部分是目标框的位置损失，另一部分是类别置信度损失。 $l, g$ 分别为预测框和真实框的位置参数。
where N is the number of matched default boxes. If $N = 0$ , we set the loss to 0. The localization loss is a Smooth L1 loss between the predicted box $(l)$ and the ground truth box $(g)$ parameters. Similar to Faster R-CNN, we regress to offsets for the center $(c x, c y)$ of the default bounding box (d) and for its width (w) and height(h).