WarmupLinearSchedule为什么先升后降

最新推荐文章于 2022-06-13 15:07:25 发布

想念@思恋

最新推荐文章于 2022-06-13 15:07:25 发布

阅读量1.3k

点赞数

分类专栏： python编程 pytorch 文章标签： python

本文链接：https://blog.csdn.net/tailonh/article/details/120392713

版权

python编程同时被 2 个专栏收录

139 篇文章 10 订阅

订阅专栏

pytorch

47 篇文章 2 订阅

订阅专栏

#num_train_optimization_steps = int(len(train_data) / args.train_batch_size / args.gradient_accumulation_steps) * args.num_train_epochs

class WarmupLinearSchedule(_LRSchedule):
    """
    Linearly increases learning rate from 0 to 1 over `warmup` fraction of training steps.
    Linearly decreases learning rate from 1. to 0. over remaining `1 - warmup` steps.
    """
    warn_t_total = True

    def get_lr_(self, progress):
    	# progerss从0开始
        if progress < self.warmup:
            return progress / self.warmup
        return max((progress - 1.) / (self.warmup - 1.), 0.)

设self.warmup=0.3，表示前30%的epoch学习率逐步上升，后70%逐步下降。
progress=0.1时， progress / self.warmup=1/3
progress=0.2时， progress / self.warmup=2/3
progress=0.3时， progress / self.warmup=3/3
progress=0.4时， (progress - 1.) / (self.warmup - 1.)=6/7
progress=0.5时， (progress - 1.) / (self.warmup - 1.)=5/7
progress=0.6时， (progress - 1.) / (self.warmup - 1.)=4/7
progress=0.7时， (progress - 1.) / (self.warmup - 1.)=3/7
progress=0.8时， (progress - 1.) / (self.warmup - 1.)=2/7
progress=0.9时， (progress - 1.) / (self.warmup - 1.)=1/7
progress / self.warmup和(progress - 1.) / (self.warmup - 1.)相当于控制学习率增长的一个系数。

# 在BertAdam中，group['schedule'].get_lr(state['step'])就会调用WarmupLinearSchedule函数，前提是使用线性的Schedule
lr_scheduled = group['lr']
lr_scheduled *= group['schedule'].get_lr(state['step'])

想念@思恋

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
WarmupLinearSchedule为什么先升后降

#num_train_optimization_steps = int(len(train_data) / args.train_batch_size / args.gradient_accumulation_steps) * args.num_train_epochsclass WarmupLinearSchedule(_LRSchedule): """ Linearly increases learning rate from 0 to 1 over `warmup` fractio
复制链接

扫一扫

专栏目录