论文笔记 A brief introduction to weakly supervised learning - 2017

CiLin-Yan

已于 2022-04-11 21:52:23 修改

阅读量852

点赞数

分类专栏：弱监督目标检测文章标签：深度学习 cnn

于 2022-03-08 12:28:56 首次发布

本文链接：https://blog.csdn.net/weixin_43791477/article/details/123350606

版权

弱监督目标检测专栏收录该内容

5 篇文章 2 订阅

订阅专栏

本文回顾了弱监督学习的研究进展，关注三种典型类型的弱监督：不完全监督（仅部分训练数据带有标签）、不精确监督（仅提供粗粒度标签）和不准确监督（给定标签可能不是真实标签）。主要讨论了主动学习和半监督学习作为应对不完全监督的技术，以及半监督学习中的集群假设和流形假设。同时介绍了多实例学习以处理不精确监督问题。

摘要由CSDN通过智能技术生成

`2017` A brief introduction to weakly supervised learning

南大 周志华 National science review(IF 17.3), 2017 (Citations 815)

ABSTRACT

This article reviews some research progress of weakly supervised learning, focusing on three typical types of weak supervision: （See the introduction for a more detailed explanation）

incomplete supervision, where only a subset of training data is given with labels;
inexact supervision, where the training data are given with only coarse-grained labels;
inaccurate supervision, where the given labels are not always ground-truth.

INTRODUCTION

Typically, there are three types of weak supervision.

incomplete supervision, i.e. only a (usually small) subset of training data is given with labels while the other data remain unlabeled.

For example, in image categorization the ground-truth labels are given by human annotators; it is easy to get a huge number of images from the Internet, whereas only a small subset of images can be annotated due to the human cost.

inexact supervision, i.e. only coarse-grained labels are given.

It is desirable to have every object in the images annotated; however, usually we only have image-level labels rather than object-level labels.

inaccurate supervision, i.e. the given labels are not always ground-truth.

Such a situation occurs, e.g. when the image annotator is careless or weary, or some images are difficult to categorize.

INCOMPLETE SUPERVISION

Incomplete supervision concerns the situation in which we are given a small amount of labeled data, which is insufficient to train a good learner, while abundant unlabeled data are available.

Formally, the task is to learn $\mathcal{X} \mapsto \mathcal{Y}$ from a training data set $D = {(x_1, y_1), . . . , (x_l , y_l ), x_{l +1}, . . . , x_m }$ .

There are two major techniques for this purpose:
$\text{incomplete supervision}\left\{ \begin{aligned} &\text{active learning (With human intervention)}\\ &\text{semi-supervised learning (Without human intervention)} \end{aligned} \right.$

active learning;

Active learning assumes that there is an‘oracle’, such as a human expert, that can be queried to get ground-truth labels for selected unlabeled instances.

$\text{selection criteria of actie learning} \left\{ \begin{aligned} &\text{informativeness}\\ &\text{representativeness} \end{aligned} \right.$

semi-supervised learning.

In contrast, semi-supervised learning attempts to automatically exploit unlabeled data in addition to labeled data to improve learning performance, where no human intervention is assumed.

$\text{semi-supervised learning}\left\{ \begin{aligned} &\text{(pure) semi-supervised learning}\\ &\text{tranductive learning}\\ \end{aligned} \right.$

Actually, in semi-supervised learning there are two basic assumptions, i.e. the cluster assumption and the manifold assumption; both are about data distribution. The former assumes that data have inherent cluster structure, and thus, instances falling into the same cluster have the same class label. The latter assumes that data lie on a manifold, and thus, nearby instances have similar predictions. The essence of both assumptions lies in the belief that similar data points should have similar outputs, whereas unlabeled data can be helpful to disclose which data points are similar.

$\text{four major categories of semi-supervised learning}\left\{ \begin{aligned} &\text{generative methods}\\ &\text{graph-based methods}\\ &\text{low-density separation methods}\\ &\text{disagreement-based methods.}\\ \end{aligned} \right.$

INEXACT SUPERVISION

Formally, the task is to learn $\mathcal{X} \mapsto \mathcal{Y}$ from a training data set $D = \{(X_1, y_1), ..., (X_m, y_m)\}$ , where $X_i = \{x_{i1}, . . . , x_{im_i} \}\subseteq \mathcal{X}$ is called a bag, $x_{i j}\in X (j∈ \{1, ..., m_i\})$ is an instance, $m_i$ is the number of instances in $X_i$ , and $y_i\in \mathcal{Y} = \{Y, N\}$ . Xi is a positive bag, i.e. $y_i = Y$ , if there exists $x_{i p}$ that is positive, while $p\in \{1, ..., m_i\}$ is unknown. The goal is to predict labels for unseen bags. This is called multi-instance learning.