聚类心得（默认框）

最新推荐文章于 2022-06-23 14:07:05 发布

hi我是大嘴巴

最新推荐文章于 2022-06-23 14:07:05 发布

阅读量343

点赞数

本文链接：https://blog.csdn.net/weixin_38740463/article/details/88928261

版权

本文探讨了聚类分析在目标检测中的应用，重点关注如何利用聚类中心优化默认框，以最大化与真实标注框的IOU，并通过调整算法避免坐标偏好的问题。在实践中，采用M-means策略进行聚类初期可能会遇到挑战。

摘要由CSDN通过智能技术生成

1） (x_j,y_j) 是框的中心点， (w_j,h_j) 是框的宽和高，N是所有标注框的个数，这是真实gt

2) 默认框（即预测框），我们希望其与gt的IOU最大，其中心与默认框的x,y最小（最好重合），即令 d=1-IOU(d为聚类中心) ，即有 $d=1-IOU\left [ (x_j,y_j,w_j,h_j),(x_j,y_j,W_i,H_i) \right ],j\in\{1,2,...,N\},i\in\{1,2,...,k\}$

3）最后，我们需要 $W_i^{'}=\frac{1}{N_i}\sum w_{i},H_i^{'}=\frac{1}{N_i}\sum h_{i}$

其中要点：使用M-mean聚类初期，容易有坐标偏好

from os import listdir
from os.path import isfile, join
import argparse
# import cv2
import numpy as np
import sys
import os
import shutil
import random
import math


def IOU(x, centroids):
    '''
    :param x: 某一个ground truth的w,h
    :param centroids:  anchor的w,h的集合[(w,h),(),...]，共k个
    :return: 单个ground truth box与所有k个anchor box的IoU值集合
    '''
    IoUs = []
    w, h = x  # ground truth的w,h
    for centroid in centroids:
        c_w, c_h = centroid  # anchor的w,h
        if c_w >= w and c_h >= h:  # anchor包围ground truth
            iou = w * h / (c_w * c_h)
        elif c_w >= w and c_h <= h:  # anchor宽矮
            iou = w * c_h / (w * h + (c_w - w) * c