【深度学习】Hard Negative Mining的理解（带论文重点内容解读）

最新推荐文章于 2025-02-07 09:56:59 发布

旅途中的宽~

最新推荐文章于 2025-02-07 09:56:59 发布

阅读量4.2k

点赞数 2

分类专栏：目标检测经典论文导读文章标签：深度学习目标检测 Hard Negative

本文链接：https://blog.csdn.net/wzk4869/article/details/127348343

版权

目标检测经典论文导读专栏收录该内容

81 篇文章

订阅专栏

该思路源自于论文《Rich feature hierarchies for accurate object detection and semantic segmentation》，就是我们应该提到的两阶段目标检测的开山奠基之作。

由于一个图片中的ground_truth比较少，所以会到导致正样本会比较少，很有可能会出现正负样本不均衡的状态，所以运用了hard negative mining这个方法来帮助我们训练。

hard negative mining顾名思义：negative，即负样本，其次是hard，说明是困难样本，也就是说在对负样本分类时候，loss比较大（label与prediction相差较大）的那些样本，也可以说是容易将负样本看成正样本的那些样本。

hard negative mining就是多找一些hard negative加入负样本集，进行训练，这样会比easy negative组成的负样本集效果更好。

R-CNN论文关于hard negative mining的部分引用了两篇论文：

Object detection with discriminatively trained part based models
Example-based learning for viewbased humanface detection

我们分别看一下原文中关于这个方法的定义：

首先是第一篇：

在这里插入图片描述

核心描写为：

Bootstrapping methods train a model with an initial subset of negative examples, and then collect negative examples that are incorrectly classified by this initial model to form a set of hard negatives. A new model is trained with the hard negative examples, and the process may be repeated a few times.

bootstrap方法用一个初始的负例子集训练一个模型，然后收集这个初始模型分类错误的负例，形成一组硬负例。用困难的否定例训练一个新的模型，这个过程可能会重复几次。

解下来是第二篇：

在这里插入图片描述

核心语段为：

we use the following “bootstrap” strategy that incrementally selects only those “nonface” patterns with high utility value:

Start with a small set of “nonface” examples in the training database.
Train the MLP classifier with the current database of examples.
Run the face detector on a sequence of random images. Collect all the “nonface” patterns that the current system wrongly classifies as “faces” (see Fig. 5b).Add these “nonface” patterns to the training database as new negative examples.
Return to Step 2.

而R-CNN中的Hard Negative Mining就是采用了这种自举法（bootstrap）的方法：

1.先用初始的正负样本训练分类器（此时为了平衡数据，使用的负样本也只是所有负样本的子集）

2.用（1）训练好的分类器对样本进行分类,把其中错误分类的那些样本(hard negative)放入负样本子集

3.再继续训练分类器

4.如此反复,直到达到停止条件(比如分类器性能不再提升)

也就是说，R-CNN的Hard Negative Mining相当于给模型定制一个错题集，在每轮训练中不断“记错题”，并把错题集加入到下一轮训练中，直到网络效果不能上升为止。