Deeplearning.ai - 浅层神经网络

最新推荐文章于 2024-06-12 08:36:39 发布

纫秋兰以为佩

最新推荐文章于 2024-06-12 08:36:39 发布

阅读量351

点赞数

分类专栏： Deeplearning.ai

本文链接：https://blog.csdn.net/sinat_27421407/article/details/79627449

版权

32 篇文章 0 订阅

订阅专栏

15 篇文章 0 订阅

订阅专栏

神经网络和深度学习
吴恩达 Andrew Ng

这里写图片描述

这里写图片描述

圆括号表示样本，方括号表示层数
Horizontal(横向), the matrix A goes over different training examples. Vertically(竖向), the different hidden unit.

$\sigma(z)$ 常用于二元分类问题
$tanh(z)=\frac{e^z-e^{-z}}{e^z+e^{-z}}$ （效果常常比 $\sigma(z)$ 好）
z 很大或者很小时， $、tanh(z)、\sigma(z)$ 的斜率接近于0，会拖慢梯度下降法（梯度弥散）
不同层的激活函数可以不同
ReLU(z)=max(0,z)
- 在 0 处不可导，但在实验中 z 极少取到 0
- 负数区域时神经元不会训练， $Leaky\ ReLU()$ 可避免这一问题
- 通常学习速度优于前两个激活函数
- $Leaky\ ReLU(z)=max(0.01z,z)$ （系数可以自定义）
identity activation function 恒等激活函数 $g(z)=z$
If using linear activation functions, then the neural network is just outputting a linear function of the input.
No matter how many layers your neural network has, it is just computing a linear activation function. There is no need to have the hidden layers.
两个线性函数的组合仍然是线性函数
线性函数通常只在输出层使用，隐藏层几乎都使用非线性函数