2022 VeLO: Training Versatile Learned Optimizers by Scaling Up

最新推荐文章于 2024-08-08 14:28:23 发布

文三路张同学

最新推荐文章于 2024-08-08 14:28:23 发布

阅读量610

点赞数

分类专栏：我的科研之路~ 文章标签：人工智能深度学习 hypernet

本文链接：https://blog.csdn.net/qq_36160277/article/details/128687013

版权

我的科研之路~ 同时被 2 个专栏收录

46 篇文章 2 订阅

订阅专栏

论文

20 篇文章 3 订阅

订阅专栏

VeLO是一个基于元学习理念设计的优化器，通过大规模训练能够适应各种任务，无需超参数调优。其架构结合了LSTM和超网络MLP，每个LSTM控制多个MLP参数，它们通过全局上下文协同工作。在元训练过程中，优化器接收参数值和梯度作为输入，输出更新参数，旨在找到能有效优化指定目标的更新规则。

摘要由CSDN通过智能技术生成

VeLO: Training Versatile Learned Optimizers by Scaling Up

通过扩展模型的规模来训练一个通用的优化器。

设计上，优化器的原理基于元学习的思路，即从相关任务上学习经验，来帮助学习目标任务。

相比迁移学习，元学习更强调获取元知识，它是一类任务上的通用知识，可以被泛化到更多任务上去。

基于这一思想，VeLO也会吸收梯度并自动输出参数更新，无需任何超参数调优，并自适应需要优化的各种任务。

架构上，AI优化器整体由LSTM（长短期记忆网络）和超网络MLP（多层感知机）构成。

其中每个LSTM负责设置多个MLP的参数，各个LSTM之间则通过全局上下文信息进行相互协作。

训练上，AI优化器采用元训练的方式，以参数值和梯度作为输入，输出需要更新的参数。

Introduction

在meta-training中存在的问题？

在meta-learning中，数据集（也就是大量的Tasks）不容易收集（不像image、text这样的任务）：In meta-learning, a large training dataset corresponds to a large set of tasks, which are representative of the tasks a practitioner might want to optimize. Unlike image and text data that can be gathered from the internet, there is no standardized or automated way to collect these tasks

什么是一个Learned update rules呢？

回忆一下SGD，其中优化器的更新是一个fixed-form，即下一个参数=上一个参数-梯度*学习率（也有可能不是一个固定的学习率，但必然是一个超参数）。而在Learned update rules中，我们可以把这个更新函数进行参数化，变成一个可学习的neural networks with meat-parameters, 它使用梯度信息作为输入， $U (g, ...; θ)$ 。

进一步地，除了梯度信息作为输入，还可以讲loss、当前的参数值等信息作为参数输入进来。

什么是Meta-training？

Meta-training is the process of fifinding the (meta-)parameters θ of the update rule U(·; θ) such that the resulting optimizer performs well on some specifified meta-objective.

文三路张同学

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
1
评论
2022 VeLO: Training Versatile Learned Optimizers by Scaling Up

While deep learning models have replaced hand-designed features across many domains,these models are still trained with hand-designed optimizers. In this work, we leverage the samescaling approach behind the success of deep learning to learn versatile op
复制链接

扫一扫

专栏目录