【YOLOX】《YOLOX：Exceeding YOLO Series in 2021》-CSDN博客

本文链接：https://blog.csdn.net/bryant_meng/article/details/131314688

在这里插入图片描述

arXiv-2021

Ge Z, Liu S, Wang F, et al. Yolox: Exceeding yolo series in 2021[J]. arXiv preprint arXiv:2107.08430, 2021.

https://github.com/Megvii-BaseDetection/YOLOX

文章目录

1 Background and Motivation
2 Related Work
3 Advantages / Contributions
4 Method
5 Experiments
6 Conclusion（own）
附录——SimOTA 细节

1 Background and Motivation

目标检测新方向，anchor-based to anchor-free，NMS based to NMS free，static label assignment to 各种新的 label assignment

这些技术没有应用在 yolo 家族上，本文把上述目标检测发展的新技术应用在 yolov3 上，提出 yolo X，效果可观

在这里插入图片描述

2 Related Work

anchor free
NMS free
label assignment

3 Advantages / Contributions

基于 yolov3 提出 yolox
公开数据集验证速度精度有提升
won the 1st Place on Streaming Perception Challenge (Workshop on Autonomous Driving at CVPR 2021) using a single YOLOX-L model.

4 Method

YOLOX-DarkNet53

4.1 Implementation details

BCE Loss for training cls and obj branch, and IoU Loss for training reg branch.

在这里插入图片描述

来自深入浅出Yolo系列之Yolox核心基础完整讲解

在这里插入图片描述

4.2 decoupled head

采用了 decouple head 的形式，可以明显提升收敛速度，如下图所示

在这里插入图片描述

解耦头会收敛更快，精度也会更高，但会增加运算的复杂度
在这里插入图片描述
来自深入浅出Yolo系列之Yolox核心基础完整讲解

定量分析看，提点也很明显
在这里插入图片描述

end-to-end YOLO 采用了 NMS-free 的技术，介绍如下
在这里插入图片描述

Zhou Q, Yu C. Object detection made simpler by eliminating heuristic NMS[J]. IEEE Transactions on Multimedia, 2023, 25: 9254-9262.

4.3 Strong data augmentation

引入了 Mosaic and MixUp，close it for the last 15 epochs

加了强数据增强后，发现 ImageNet pre-train is no more beneficial，作者都 train from scratch 了

4.4 anchor-free

anchor based 的方法的缺点

需要根据数据集先聚类得到 anchor，缺乏泛化性
引入了更多的计算量

anchor based 是 $3 * （ 20 * 20 + 40 * 40 + 80 * 80 ） * 85$

anchor free 是 $1 * （ 20 * 20 + 40 * 40 + 80 * 80 ） * 85$

anchor-free，采用的是 FCOS 的那套

在这里插入图片描述

Tian Z, Shen C, Chen H, et al. FCOS: A simple and strong anchor-free object detector[J]. IEEE transactions on pattern analysis and machine intelligence, 2020, 44(4): 1922-1933.

center location of each object as the positive sample and pre-define a scale range