深度学习的Xavier初始化方法

最新推荐文章于 2022-08-19 16:28:36 发布

路虽远在路上

最新推荐文章于 2022-08-19 16:28:36 发布

阅读量1.6w

点赞数 2

分类专栏：机器学习

本文链接：https://blog.csdn.net/u010185894/article/details/71104387

版权

机器学习专栏收录该内容

12 篇文章 0 订阅

订阅专栏

在tensorflow中，有一个初始化函数：tf.contrib.layers.variance_scaling_initializer。Tensorflow 官网的介绍为：

variance_scaling_initializer(
    factor=2.0,
    mode='FAN_IN',
    uniform=False,
    seed=None,
    dtype=tf.float32
)

Returns an initializer that generates tensors without scaling variance.

When initializing a deep network, it is in principle advantageous to keep the scale of the input variance constant, so it does not explode or diminish by reaching the final layer. This initializer use the following formula:

  if mode='FAN_IN': # Count only number of input connections.
    n = fan_in
  elif mode='FAN_OUT': # Count only number of output connections.
    n = fan_out
  elif mode='FAN_AVG': # Average number of inputs and output connections.
    n = (fan_in + fan_out)/2.0

    truncated_normal(shape, 0.0, stddev=sqrt(factor / n))

这段话可以理解为，通过使用这种初始化方法，我们能够保证输入变量的变化尺度不变，从而避免变化尺度在最后一层网络中爆炸或者弥散。

这个方法就是 Xavier 初始化方法，可以从以下这两篇论文去了解这个方法：

或者可以通过这些文章去了解：

路虽远在路上

关注

2
点赞
踩
13

收藏

觉得还不错? 一键收藏
4
评论
深度学习的Xavier初始化方法

在tensorflow中，有一个初始化函数：tf.contrib.layers.variance_scaling_initializer。Tensorflow 官网的介绍为：variance_scaling_initializer( factor=2.0, mode='FAN_IN', uniform=False, seed=None, dtype=tf.fl
复制链接

扫一扫

专栏目录