Life Long Learning (LLL)

最新推荐文章于 2024-06-18 11:22:37 发布

连理o

最新推荐文章于 2024-06-18 11:22:37 发布

阅读量463

点赞数

分类专栏：机器学习文章标签： deep learning

本文链接：https://blog.csdn.net/weixin_42437114/article/details/120414563

版权

33 篇文章 16 订阅

订阅专栏

Life Long Learning

What people think about AI …

通过让模型不断地学习新任务来不断增强模型的能力 $\rightarrow$ Even though I have learned task 2, I do not forget task 1.

Life Long Learning in real-world applications

在这里插入图片描述

Multi-task training can be considered as the upper bound of LLL. But can multi-task training solve the problem?
- No:

First of all, we need a sequence of tasks. (目前 LLL 研究的不同任务还是比较简单的)
- e.g. permutation 表示用某种规则将数字打乱

Evaluation

在这里插入图片描述

$R_{i,j}$ : after training task $i$ , performance on task $j$ . If $i > j$ , After training task $i$ , does task $j$ be forgot. If $i < j$ , Can we transfer the skill of task $i$ to task $j$
一般评估方法有以下两种:
- (1) Accuracy
  $\frac{1}{T}\sum_{i=1}^TR_{T,i}$
- (2) Backward Transfer (一般为负数)
  $\frac{1}{T-1}\sum_{i=1}^{T-1}R_{T,i}-R_{i,i}$

Selective Synaptic Plasticity: 选择性的突触可塑性 (只让 NN 中的某些神经元间的连接具有可塑性，其余的必须被固化)

Why Catastrophic Forgetting?

在这里插入图片描述

Basic Idea: Some parameters in the model are important to the previous tasks. Only change the unimportant parameters. ( $\theta$ should be close to $\theta^b$ in certain directions.)
- If $b_i=0$ , there is no constraint on $\theta_i$ $\Rightarrow$ Catastrophic Forgetting
- If $b_i=\infty$ , $\theta_i$ would always be equal to $\theta_i^b$ $\Rightarrow$ Intransigence

SGD 表示正常训练，会导致 catastrophic forgetting；L2 将 $b_i$ 均设为 1，会导致 Intransigence

那么我们如何知道某一个参数对一个任务是否重要呢？大致思想是我们可以在该任务上训练一个模型，然后观察当某一个参数改变时，会不会对 loss 产生很大的影响，如果影响特别大，那么就认为该参数 $\theta_i$ 比较重要，它对应的 $b_i$ 也可以设定为一个比较大的值。每次在一个 task 上训练完，都不断对 $b_i$ 进行累加作为最终的 $b_i$ :