随机权值平均优化器SWA(Stochastic Weight Averaging)简介

最新推荐文章于 2024-04-08 08:54:05 发布

Gallant Hu

最新推荐文章于 2024-04-08 08:54:05 发布

阅读量2.1k

点赞数 2

分类专栏：机器学习二

本文链接：https://blog.csdn.net/weixin_42108090/article/details/108541516

版权

机器学习二专栏收录该内容

18 篇文章 1 订阅

订阅专栏

SWA is a simple procedure that improves generalization in deep learning over Stochastic Gradient Descent (SGD) at no additional cost, and can be used as a drop-in replacement for any other optimizer in PyTorch. SWA has a wide range of applications and features:
SWA（随机权值平均）是一种通过梯度下降改善深度学习泛化能力的方法，而且不会要求额外的计算量，可以用到Pytorch的优化器中。

from torchcontrib.optim import SWA

...
...

# training loop
base_opt = torch.optim.SGD(model.parameters(), lr=0.1)
opt = torchcontrib.optim.SWA(base_opt, swa_start=10, swa_freq=5, swa_lr=0.05)
for _ in range(100):
     opt.zero_grad()
     loss_fn(model(input), target).backward()
     opt.step()
opt.swap_swa_sgd()

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

Gallant Hu

关注关注

2
点赞
踩
8

收藏

觉得还不错? 一键收藏
打赏
1
评论
随机权值平均优化器SWA(Stochastic Weight Averaging)简介

SWA is a simple procedure that improves generalization in deep learning over Stochastic Gradient Descent (SGD) at no additional cost, and can be used as a drop-in replacement for any other optimizer in PyTorch. SWA has a wide range of applications and feat
复制链接

扫一扫