Mate&Few-Shot Learning

最新推荐文章于 2024-03-07 11:53:36 发布

Paul-Huang

最新推荐文章于 2024-03-07 11:53:36 发布

阅读量226

点赞数 1

分类专栏：医学图像&论文笔记文章标签：深度学习

本文链接：https://blog.csdn.net/huang1024rui/article/details/114776498

版权

医学图像&论文笔记专栏收录该内容

10 篇文章 5 订阅

订阅专栏

Mate&Few-Shot Learning

1. 基本概念

Meta Learning
- $\color{red}Few-shot\;learning$ is a kind of $\color{red}meta\;learning$ .
- Meta learning: $\color{red}learn\;to\;learn$ .
Supervised Learning vs. Few-Shot Learning
1. Traditional supervised learning:
  - $\color{red}Test$ samples are $\color{red}never\;seen\;before$ .
  - $\color{red}Test$ samples are from $\color{blue}known\;classes$ .
2. Few-shot learning:
  - $\color{red}Query$ samples are $\color{red}never\;seen\;before$ .
  - $\color{red}Query$ samples are from $\color{red}unknown\;classes$ .
术语
- Training Set：
- Support Set：
  - $\color{red}k$ -way: the support set has $\color{red}k$ classes.
  - $\color{green}n$ -shot: every class has $\color{green}n$ samples.
    - 3-way is easier than 6-way;
    - 2-shot is easier than 1-shot.
- Query：
Idea: Learn a Similarity Function
- Basic Idea:
  - Learn a similarity function: $sim(x,x^* )$ .
  - Ideally, $sim(x_1,x_2 )=1$ , $sim(x_1,x_3 )=0$ , and $sim(x_2,x_3 )=0$ .
- Step:
  - First, learn a similarity function from large-scale training dataset.
  - Then, apply the similarity function for prediction.
    - Compare the $\color{red}query$ with every sample in the $\color{red}support\;set$ .
    - Find the sample with the highest similarity score.
Datasets
- Omniglot
  - Official website: https://github.com/brendenlake/omniglot/
  - TensorFlow: https://www.tensorflow.org/datasets/catalog/omniglot

2. Siamese Network

2.1 Learning Pairwise Similarity Scores

Ref:

Bromley et al. Signature verification using a Siamese time delay neural network. In NIPS. 1994.
Koch, Zemel, & Salakhutdinov. Siamese neural networks for one-shot image recognition. In ICML, 2015.

Data for Training set
Select 2 pieces of training data each time and label them.
CNN for Feature Extraction
Training Siamese Network
- Forword Network
- BackWord Network
  Update the parameter of CNN.
One-shot Prediction
The training data (for the Siamese network) does not contain the support set classes and the query.

2.2 Triplet Loss

Ref:

Schroff, Kalenichenko, & Philbin. Facenet: A unified embedding for face recognition and clustering. InCVPR, 2015

Data for Training set
Select 3 pieces of training data each time and label them.
CNN for Feature Extraction
Feature Extraction use the same CNN.
Triplet Loss
One-Shot Prediction

2.3 Basic Idea of Few-Shot Learning

Train a $\color{red}Siamese\;network$ on large-scale training set.
Given a $\color{red}support\;set$ of 𝑘-way 𝑛-shot.
- 𝑘-way means 𝑘 classes.
- 𝑛-shot means every class has 𝑛 samples.
- The training set does not contain the 𝑘 classes.
Given a $\color{red}query$ , predict its class.
- Use the Siamese network to compute similarity or distance.

3. Pretraining and Fine Tuning

Cosine Similarity
Softmax Function
Softmax Classifier(全连接层+softmax函数)
Here, 𝑘 is number of classes, and 𝑑 is number of features.

3.1 Few-Shot Prediction Using Pretrained CNN

Reference:

Dhillon, Chaudhari, Ravichandran, & Soatto. A baseline for few-shot image classification. In ICLR, 2020.
Chen, Wang, Liu, Xu, & Darrell. A New Meta-Baseline for Few-Shot Learning. arXiv, 2020

Pretraining
- Pretrain a CNN for $\color{red}feature\;extraction$ (aka embedding).
- The CNN can be pretrained using $\color{red}standard\;supervised\;learning$ or $\color{red}Siamese \;network$ .
Deal with the Support set
Making Few-Shot Prediction
$q$ is Query.
Summary

3.2 Benefit of Fine Tuning

Reference:

Chen, Liu, Kira, Wang, & Huang. A Closer Look at Few-shot Classification. In ICLR, 2019.
Dhillon, Chaudhari, Ravichandran, & Soatto. A baseline for few-shot image classification. In ICLR, 2020.
Chen, Wang, Liu, Xu, & Darrell. A New Meta-Baseline for Few-Shot Learning. arXiv, 2020.

Fine-Tuning is a improved algorithm for Few-Shot Prediction Using Pretrained CNN.
The process of Few-Shot Prediction Using Pretrained CNN is:

the $j$ of $x_j$ is the $\color{red}query$ . $\color{red}W$ and $\color{red}b$ are from the support set.

Trick 1: A Good Initialization

We can train $\color{red}W$ and $\color{red}b$ on the support set. (Fine tuning.)
Trick 2: Entropy Regularization
Trick 3: Cosine Similarity + Softmax Classifier

summary

参考：

小样本学习和元学习（中文课程） - Shusen Wang

Paul-Huang

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
Mate&Few-Shot Learning

Mate&Few-Shot Learning1. 基本概念Meta LearningFew−shot learning\color{red}Few-shot\;learningFew−shotlearning is a kind of meta learning\color{red}meta\;learningmetalearning.Meta learning: learn to learn\color{red}learn\;to\;learnlearntolearn.
复制链接

扫一扫

专栏目录