持续学习——《Selfless Sequential Learning》——ICLR2019

最新推荐文章于 2024-12-27 10:46:10 发布

原创最新推荐文章于 2024-12-27 10:46:10 发布

· 376 阅读

0 ·

版权

文章标签：

#机器学习 #深度学习 #神经网络 #人工智能

博士科研同时被 2 个专栏收录

28 篇文章

订阅专栏

增量学习

10 篇文章

订阅专栏

本文探讨了在固定模型容量下进行连续学习的挑战，借鉴哺乳动物大脑的学习机制，介绍了稀疏编码通过局部神经抑制和折扣（SLNID）实现的方法。旨在创建高效且互不干扰的表示，减少过拟合，避免灾难性遗忘。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Abstract

sequential learning=lifelong learning=incremental learning = continual learning, look at the scenario with fixed model capacity, the learning process should account for future tasks to be added and thus leave enough capacity for them. (not selfish)

Introduction

the challenge situation of learning a sequence of tasks, without access to any previous or future task data and restricted to a fixed model capacity. 哺乳动物mammalian brain的大脑学习任务。用神经科学的内容来解释motivation.首先，较少数量的神经元被激活用于表征信息，然后被激活的神经元会减少该神经元周围的神经元的活动（lateral inhibition）。This creates a powerful decorrelated and compact representation with minimum interference between different input patterns in the brain (Yu et al., 2014)
《Reducing overfitting in deep networks by decorrelating representations arxiv2015》 show that when the amount of overfitting in a neural network is reduced, the representation correlation is also reduced
Parameter sparsity or representation sparsity.
要理清几个概念，disentangled representation.解耦的表达更不容易遭遇灾难性遗忘
Sparse and decorrelated representation。Decorrelated representation=disentangled representation
EWC, MAS
后面讲sparsity，the main idea of our regularizer is to penalize neurons that are active at the same time.

Method

Sparse coding through Local Neural Inhibition and Discounting (SLNID). 介绍了一种新的regularizer, which encourages sparsity in the activations for each layer.

Conclusion

sparsity should be imposed at the level of representation rather than at the level of the network parameters.
提出方法的motivation来自于lateral inhibition in the mammalian brain. 具体地，a new regularizer that decorrelates nearby active neurons.
Leaning a new task selflessly by leaving capacity for future tasks, avoid forgetting previous tasks通过考虑神经元的重要性neuron importance（之前的工作相似的insight，parameter importance）

Key points: 这篇文章motivation很好;包装方法的解释（神经科学）可以多学习