NIPS2018 - Reducing Network Agnostophobia

CarolSu2055

已于 2024-03-13 14:11:56 修改

阅读量1.2k

点赞数

文章标签：人工智能深度学习计算机视觉

于 2022-09-01 14:41:56 首次发布

本文链接：https://blog.csdn.net/qq_38712865/article/details/126620091

版权

ABSTRACT
Agnostophobia, the fear of the unknown, can be experienced by deep learning engineers while applying their networks to real-world applications. Unfortunately, network behavior is not well defined for inputs far from a networks training set. In an uncontrolled environment, networks face many instances that are not of interest to them and have to be rejected in order to avoid a false positive. This problem has previously been tackled by researchers by either a) thresholding softmax, which by construction cannot return none of the known classes, or b) using an additional background or garbage class. In this paper, we show that both of these approaches help, but are generally insufficient when previously unseen classes are encountered. We also introduce a new evaluation metric that focuses on comparing the performance of multiple approaches in scenarios where such unseen classes or unknowns are encountered. Our major contributions are simple yet effective Entropic Open-Set and Objectosphere losses that train networks using negative samples from some classes. These novel losses are designed to maximize entropy for unknown inputs while increasing separation in deep feature space by modifying magnitudes of known and unknown samples. Experiments on networks trained to classify classes from MNIST and CIFAR-10 show that our novel loss functions are significantly better at dealing with unknown inputs from datasets such as Devanagari, NotMNIST, CIFAR-100, and SVHN.

图像分类任务，作为计算机视觉中最基础的任务，看似简单，却在实际应用中常面临一些不足。鲁棒性(Robustness)、开集问题(Open-set)、类别不均衡(Class imbalance) 这些都是在学术数据集上很少考虑，而实际中常见且直接影响算法效果的问题。这篇文章讨论的是开集问题。简言之，开集问题是在测试时如何针对训练集不包含的类别数据进行预测/分类。（题外话，人脸识别就是典型开集识别问题，训练集中 ID 和应用场景中 ID 往往有较大差异，采用度量学习方法）不考虑开集问题，在实际中会造成大量 false positive，影响使用体感。

文章中以手写数字识别为例(0-9，10分类问题)，将 Devanagari 数据集作为开集数据。采用 LeNet++ 作为backbone，将图片映射到 2 维特征空间，进而预测 0-9 类别。

符号表

符号	含义
Y	所有类别空间
C	已知类别，known classes (1…C)
U	未知类别，unknown classes
B	未知类别子集1，background, garbage, or known unknown classes.
A	未知类别子集2，unknown unknown classes
D	测试数据集
D’	训练数据集