All you need is a good init. If you can’t find the good init, use Batch Normalization.
训练过程--对付梯度弥散/梯度消失
最新推荐文章于 2022-12-19 16:12:10 发布
All you need is a good init. If you can’t find the good init, use Batch Normalization.