Class1-Week4-Deep Neural Network

最新推荐文章于 2023-01-24 16:11:19 发布

zcx_language

最新推荐文章于 2023-01-24 16:11:19 发布

阅读量264

点赞数

分类专栏： Deep Learning

本文链接：https://blog.csdn.net/language_zcx/article/details/99221937

版权

Deep Learning 专栏收录该内容

18 篇文章 0 订阅

订阅专栏

文章目录

Compute Process
Parameters vs Hyperparameters
- Defination
- Tune for Hyperparameters

Compute Process

Forward Propagation

Layer-l:

Input: $A^{[l-1]}$
Compute Process:
$Z^{[l]}=W^{[l]}A^{[l-1]}+b^{[l]}$
$A^{[l]}=g(Z^{[l]})$
Output: $A^{[l]}$
Cache: $Z^{[l]},W^{[l]},b^{[l]}$

Backward Propagation

Layer-l:

Input: $dA^{[l]}$
Compute Process:
$dZ^{[l]} = dA^{[l]} * g'(Z^{[l]})$
$dW^{[l]} = \frac{1}{m} * dZ^{[l]}A^{[l-1]T}$
$db^{[l]} = \frac{1}{m} * np.sum(dZ^{[l]}, axis=1, keepdims=True)$
$dA^{[l-1]} = W^{[l]T}dZ^{[l]}$
Output: $dA^{[l-1]}$
Update:
$W^{[l]} = W^{[l]} - \alpha dW^{[l]}$
$b^{[l]} = b^{[l]} - \alpha db^{[l]}$

Matrix Dimensions

Layer-l:

$\begin{aligned} dW^{[l]} = W^{[l]} &: (n^{[l]}, n^{[l-1]}) \\ db^{[l]} = b^{[l]} &: (n^{[l]}, 1) \\ dZ^{[l]} = Z^{[l]} &: (n^{[l]}, m) \\ dA^{[l]} = A^{[l]} &: (n^{[l]}, m) \end{aligned}$

Parameters vs Hyperparameters

Defination

In machine learning, a hyperparameter is a parameter whose value is set before the learning process begins. By contrast, the values of other parameters are derived via training.
1. Parameters: W, b
2. Hyperparameters:
- Learning_rate $\alpha$ – we can set a proper learning rate by drawing the relationship graph between iterations and cost in different learning rate.
- Iteration_numbers
- Network architecture
- Activation functions
- …

Tune for Hyperparameters

在这里插入图片描述

zcx_language

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Class1-Week4-Deep Neural Network

Compute ProcessForward PropagationLayer-l:Input: A[l−1]A^{[l-1]}A[l−1]Compute Process:Z[l]=W[l]A[l−1]+b[l]Z^{[l]}=W^{[l]}A^{[l-1]}+b^{[l]}Z[l]=W[l]A[l−1]+b[l]A[l]=g(Z[l])A^{[l]}=g(Z^{[l]})A...
复制链接

扫一扫

专栏目录