【论文阅读】非极大值抑制

最新推荐文章于 2024-07-02 09:44:40 发布

TwT520Ly

最新推荐文章于 2024-07-02 09:44:40 发布

阅读量1.4k

点赞数

分类专栏：深度学习

本文链接：https://blog.csdn.net/TwT520Ly/article/details/79485182

版权

深度学习专栏收录该内容

19 篇文章 1 订阅

订阅专栏

1.非极大值抑制
非极大值抑制（NMS）就是抑制不是极大值的元素，进一步搜索局部最大值。在YOLO算法以及一些其他的目标检测算法中，会生成很多候选框，每一个候选框都会输入到一个分类器中得到一个置信值。因为在窗口滑动的过程中，候选框之间会有很多的重叠区域，因此要进行筛选。

2.算法流程
（1）把置信度最高的一个bounding box作为目标，将剩下bbox与该目标bbox求交区域面积。
（2）如果交叉区域面积大于设定的阈值，则在剩下的bbox中去除该bbox（相当于是说这两个候选框中含有大量的相似信息，因此保留置信度更高的）。
（3）都对比完以后，将第二高置信度bbox作为目标，重复上述过程。

3.代码实现

import numpy as np

bboxs = np.array([
    [204, 102, 358, 250, 0.5],
    [257, 118, 380, 250, 0.7],
    [280, 135, 400, 250, 0.6],
    [255, 118, 360, 235, 0.7]
])

thresh = 0.3


def nms(bboxs, thresh):
    x1 = bboxs[:, 0]
    y1 = bboxs[:, 1]
    x2 = bboxs[:, 2]
    y2 = bboxs[:, 3]
    scores = bboxs[:, 4]
    areas = (x2 - x1 + 1) * (y2 - y1 + 1)
    order = scores.argsort()[::-1]

    keep = []
    while order.size > 0:
        i = order[0]
        keep.append(i)

        # 同时计算出四个交区域的(x1, y1, x2, y2)
        xx1 = np.maximum(x1[i], x1[order[1:]])
        yy1 = np.maximum(y1[i], y1[order[1:]])
        xx2 = np.minimum(x2[i], x2[order[1:]])
        yy2 = np.minimum(y2[i], y2[order[1:]])

        # 可能没有相交区域
        w = np.maximum(0.0, xx2 - xx1 + 1)
        h = np.minimum(0.0, yy2 - yy1 + 1)
        inter = w * h

        ovr = inter / (areas[i] + areas[order[1:]] - inter)
        ids = np.where(ovr < thresh)[0]
        # 将下标加一，第一个元素的位置是目标候选框的位置
        order = order[ids + 1]
    return keep


if __name__ == '__main__':
    keep = nms(bboxs, thresh)
    print(keep)