cascade R-CNN算法详解

Cascade R-CNN是针对目标检测问题的优化算法,通过级联多个检测器,逐步提高IoU阈值,减少噪声样本,避免过拟合,提升检测质量。文章详细介绍了算法背景、结构、边界框回归、分类和级联损失函数,并通过实验展示了其优越性。
摘要由CSDN通过智能技术生成

算法背景

论文全称:Cascade R-CNN: Delving into High Quality Object Detection
论文链接:https://arxiv.org/abs/1712.00726
论文时间:2017.12.3

本文是在R-CNN算法上的拓展,针对目标检测问题,我们利用IoU(intersection over union)来定于正样本与负样本,IoU的阈值的定义是非常重要的。IoU阈值太低,eg.0.5,会产生很多带有噪声的样本。

问题

但随着阈值的升高也会导致检测表现下降:

  • IoU阈值太高,会导致正样本的数量太少,从而导致过拟合(overfit)。正样本数量会呈指数下降。
  • 训练与推断使用不一样的阈值会导致不匹配现象。

解决方案

本文提出了一个多阶段的目标检测模型。核心就是利用不断提高的阈值,在保证样本数不减少的情况下训练出高质量的检测器,通过级联检测网络来达到由于预测结果的目的。优化的假设的重采样确保所有的检测器都有一个相当尺寸的正样本集,从而减少过拟合问题。

算法介绍

目标检测问题可以被分为分类与定位两个复杂问题,定位问题由于近似误分类为正样本( “close” false positives)问题而变得难以解决。这些边界框“close but not correct”。

现阶段的提取候选区域的目标检测算法都是两阶段的

In object detection, an intersection over union (IoU) threshold is required to define positives and negatives. An object detector, trained with low IoU threshold, e.g. 0.5, usually produces noisy detections. However, detection per- formance tends to degrade with increasing the IoU thresh- olds. Two main factors are responsible for this: 1) over- fitting during training, due to exponentially vanishing pos- itive samples, and 2) inference-time mismatch between the IoUs for which the detector is optimal and those of the in- put hypotheses. A multi-stage object detection architecture, the Cascade R-CNN, is proposed to address these prob- lems. It consists of a sequence of detectors trained with increasing IoU thresholds, to be sequentially more selec- tive against close false positives. The detectors are trained stage by stage, leveraging the observation that the out- put of a detector is a good distribution for training the next higher quality detector. The resampling of progres- sively improved hypotheses guarantees that all detectors have a positive set of examples of equivalent size, reduc- ing the overfitting problem. The same cascade procedure is applied at inference, enabling a closer match between the hypotheses and the detector quality of each stage. A simple implementation of the Cascade R-CNN is shown to surpass all single-model object detectors on the challeng- ing COCO dataset. Experiments also show that the Cas- cade R-CNN is widely applicable across detector architec- tures, achieving consistent gains independently of the base- line detector strength. The code will be made available at https://github.com/zhaoweicai/cascade-rcnn.
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值