RecentNotes待进一步整理

NeXT_Vision

已于 2022-05-06 16:51:36 修改

阅读量320

点赞数

分类专栏：知识体系文章标签：机器学习深度学习

于 2020-08-26 09:55:37 首次发布

本文链接：https://blog.csdn.net/next_voyager/article/details/108234020

版权

知识体系专栏收录该内容

4 篇文章 0 订阅

订阅专栏

文章目录

evaluation metrics for DL tasks

*** CS230 Section 7 (Week 7): Advanced Evaluation Metrics

Warmup: Classification and the F1 Score
- Accuracy
- Confusion Matrix
- Precision, Recall, and the F1 Score
Object Detection: IoU, AP, and mAP
- Intersection over Union (IoU)
- Average Precision (AP): the Area Under Curve (AUC)
- Mean Average Precision (mAP)
Evaluation Metrics for NLP Tasks
Evaluations Metrics for GANs

Deep Neural Networks for Regression Problems

Deep Neural Networks for Regression Problems | by Mohammed AL-Ma’amari | Towards Data Science 20180929
Neural Networks for Regression (Part 1)—Overkill or Opportunity? - MissingLink.ai

anchor的尺寸计算

if we have pooled our image from 800 px to 50px, the sub_sample equals 16; the sub_sample corresponding to {C1, C2, C3, C4, C5} will be {2, 4, 8, 16, 32}; the sub_sample corresponding to {P2, P3, P4, P5, P6} will be {4, 8, 16, 32, 64};
由
$\times w = (base\_size \times scale)^2, \frac{h}{w} = ratio$
得：
$base\_size \times scale \times \sqrt{ratio}$
$base\_size \times scale \times \frac{1}{\sqrt{ratio}}$

strides=[4, 8, 16, 32, 64], # The strides of anchors in multiple feature levels. This is consistent with the FPN feature strides. The strides will be taken as base_sizes if base_sizes is not set.
ratios=[0.5, 1.0, 2.0], # The ratio between height and width.
scales=[8], # Basic scale of the anchor in a single level, the area of the anchor in one position of a feature map will be scale * base_sizes.
base_sizes (list[int] | None): The basic sizes of anchors in multiple levels. If None is given, strides will be used as base_sizes. (If strides are non square, the shortest stride is taken.)

anchor_generator=dict(
	type='AnchorGenerator',
	scales=[8],
	ratios=[0.5, 1.0, 2.0],
	strides=[4, 8, 16, 32, 64]),

损失函数

Focal Loss

RetinaNet中是使用Binary Cross Entropy Function来处理the multi-class case;
RetinaNet论文中P3页脚注提到"1Extending the focal loss to the multi-class case is straightforward and works well; for simplicity we focus on the binary loss in this work."

mmdet/models/losses/focal_loss.py

# This method is only for debugging
def py_sigmoid_focal_loss(pred,
                          target,
                          weight=None,
                          gamma=2.0,
                          alpha=0.25,
                          reduction='mean',
                          avg_factor=None):
    """PyTorch version of `Focal Loss <https://arxiv.org/abs/1708.02002>`_.

    Args:
        pred (torch.Tensor): The prediction with shape (N, C), C is the
            number of classes
        target (torch.Tensor): The learning label of the prediction.
        weight (torch.Tensor, optional): Sample-wise loss weight.
        gamma (float, optional): The gamma for calculating the modulating
            factor. Defaults to 2.0.
        alpha (float, optional): A balanced form for Focal Loss.
            Defaults to 0.25.
        reduction (str, optional): The method used to reduce the loss into
            a scalar. Defaults to 'mean'.
        avg_factor (int, optional): Average factor that is used to average
            the loss. Defaults to None.
    """
    pred_sigmoid = pred.sigmoid()
    target = target.type_as(pred)
    pt = (1 - pred_sigmoid) * target + pred_sigmoid * (1 - target)
    focal_weight = (alpha * target + (1 - alpha) *
                    (1 - target)) * pt.pow(gamma)
    loss = F.binary_cross_entropy_with_logits(
        pred, target, reduction='none') * focal_weight
    loss = weight_reduce_loss(loss, weight, reduction, avg_factor)
    return loss

卷积操作的输入输出尺寸计算

nn.Conv2d的尺寸计算
$H_{out} = \lfloor \frac{H_{in} + 2 \times padding[0] - dilation[0] \times (kernel\_size[0] - 1) }{stride[0]} + 1 \rfloor$
其中, dilation[0]是卷积核之间的间距(默认为1);

IoU w.r.t. [t,b,l,r] of pred的导数

$IoU=\frac{G \bigcap P}{G \bigcup P}=\frac{G \bigcap P}{G + P - G \bigcap P}=\frac{I}{U}$
$\frac{\partial IoU}{\partial p}=\frac{\partial I/\partial p \cdot U - \partial U/\partial p \cdot I}{U^2}, p\in[t,b,l,r]$
$\frac{\partial U}{\partial p}=\frac{\partial (G + P - I)}{\partial p}=\partial P/\partial p - \partial I/\partial p, p\in[t,b,l,r]$
$I b o x$ 指的是G与P的交集box,
$\frac{\partial I}{\partial t} =\frac{\partial ((Ibox.r-Ibox.l) \cdot (Ibox.b-Ibox.t))}{\partial t}= \begin{cases} (Ibox.r-Ibox.l), & \text{if $pred.t$ > $gt.t$} \\ 0, & \text{else} \end{cases}$
$\frac{\partial I}{\partial b} =\frac{\partial ((Ibox.r-Ibox.l) \cdot (Ibox.b-Ibox.t))}{\partial t}= \begin{cases} (Ibox.r-Ibox.l), & \text{if $pred.b$ < $gt.b$} \\ 0, & \text{else} \end{cases}$
$\frac{\partial I}{\partial l} =\frac{\partial ((Ibox.r-Ibox.l) \cdot (Ibox.b-Ibox.t))}{\partial t}= \begin{cases} - (Ibox.b-Ibox.t), & \text{if $pred.l$ > $gt.l$} \\ 0, & \text{else} \end{cases}$
$\frac{\partial I}{\partial r} =\frac{d((Ibox.r-Ibox.l) \cdot (Ibox.b-Ibox.t))}{\partial t}= \begin{cases} (Ibox.b-Ibox.t), & \text{if $pred.r$ < $gt.r$} \\ 0, & \text{else} \end{cases}$

DIoU w.r.t. [x,y,w,h] of pred的导数

$\frac{\rho ^2(G, P)}{c^2}$
$\frac{\partial DIoU}{\partial x}=\frac{\partial IoU}{\partial x} - \frac{2\rho}{c}\cdot \frac{(\partial \rho/\partial x \cdot c - \partial c/\partial x \cdot \rho)}{c^2}, x\in[ctr_x,ctr_y,w,h]$
$\leftarrow l - \eta \cdot \nabla l = l - \eta \cdot \frac{\partial IoU}{\partial l}$
$\leftarrow r - \eta \cdot \nabla r = r - \eta \cdot \frac{\partial IoU}{\partial r}$
由于 $x=\frac{(l + r)}{2}$ ，故得，
$\leftarrow x - \eta \cdot \nabla x = x - \eta \cdot \frac{(\nabla l + \nabla r)}{2}$

例：已知 $z = f (l, r, t, b) = g (x, y, w, h)$ , $\frac{\partial z}{\partial l}=L$ , $\frac{\partial z}{\partial r}=R$ , $x=\frac{l+r}{2}$ , 求 $\frac{\partial z}{\partial x}=?$
此处，z与l,r之间的关系以及 z与x之间的关系是分别各由两个函数 $f(\cdot)$ 和 $g(\cdot)$ 确定的，因此不符合复合函数求导的链式法则，也就不存在以下关系： $\frac{\partial z}{\partial l}=\frac{\partial z}{\partial x}\cdot \frac{\partial x}{\partial l}, \frac{\partial z}{\partial r}=\frac{\partial z}{\partial x}\cdot \frac{\partial x}{\partial r}$ 。