【Pytorch】tensor初始化方法

最新推荐文章于 2023-12-13 17:41:24 发布

hello_dear_you

最新推荐文章于 2023-12-13 17:41:24 发布

阅读量3.7k

点赞数

分类专栏： # pytorch 文章标签： pytorch init

本文链接：https://blog.csdn.net/hello_dear_you/article/details/102483176

版权

pytorch 专栏收录该内容

8 篇文章 1 订阅

订阅专栏

1. 导入常用初始化方法

from torch.nn.init import xavier_uniform_, xavier_normal_
from torch.nn.init import kaiming_uniform_, kaiming_normal_

2. 各种初始化方法分析

xavier_uniform_(tensor, gain=1.0)

Note: 以均匀分布的值初始化输入tensor. 方法根据《Understanding the difficulty of training deep feedforward neural networks - Glorot, X. & Bengio, Y. (2010)》论文实现。最终得到的Tesor值取样于U(−a,a) ，

其中： $a = gain \ast \sqrt{6 \div fanin + fanout}$ \

参数：

gain: 缩放因素(optional)

xavier_normal_(tensor, gain=1.0)

Note: 以正太分布的值初始化输入tensor. 方法根据《Understanding the difficulty of training deep feedforward neural networks - Glorot, X. & Bengio, Y. (2010)》论文实现。最终得到的Tesor值取样于 $N(0, std^{2})$ ,

其中： $std = gain \ast \sqrt{2 \div fanin + fanout}$

kaiming_uniform_(tensor, a=0, mode='fan_in', nonlinearity='leaky_relu')

Note: 以均匀分布的值初始化输入tensor. 方法根据《Delving deep into rectifiers: Surpassing human-level performance on ImageNet classification - He, K. et al. (2015)》论文实现。最终得到的Tesor值取样于U(−bound,bound) ，

其中： $bound = \sqrt{6 \div (1 + a^{2}) * fanin}$

参数：a:

mode: "fan_in" 或 "fan_out". 选择“fan_in" 在前向传播中保存权重方差的幅度， ”fan_out" 在后向传播中保存幅度。

nonlinearity: 非线性函数。推荐"relu" or "leaky_relu".

kaiming_normal_(tensor, a=0, mode='fan_in', nonlinearity='leaky_relu')

Note: 以正太分布的值初始化输入tensor. 方法根据《Delving deep into rectifiers: Surpassing human-level performance on ImageNet classification - He, K. et al. (2015)》论文实现。最终得到的Tesor值取样于 $N(0, std^{2})$ ，

其中： $std = \sqrt{2 \div fanin × (1 + a^{2})}$

hello_dear_you

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
【Pytorch】tensor初始化方法

1. 导入常用初始化方法from torch.nn.init import xavier_uniform_, xavier_normal_from torch.nn.init import kaiming_uniform_, kaiming_normal_2. 各种初始化方法分析xavier_uniform_(tensor,gain=1.0)Note: 以均匀分布的值初始化输入...
复制链接

扫一扫

专栏目录