deeplearning course-01-03 Shallow Neural Network

最新推荐文章于 2021-09-25 17:21:19 发布

simonNada

最新推荐文章于 2021-09-25 17:21:19 发布

阅读量380

点赞数

分类专栏：学习笔记深度学习

本文链接：https://blog.csdn.net/monkey3233/article/details/79567924

版权

深度学习同时被 2 个专栏收录

7 篇文章 0 订阅

订阅专栏

学习笔记

3 篇文章 0 订阅

订阅专栏

deeplearning course-01-03 Shallow Neural Network

@(学习笔记)

本笔记为Andrew NG的DeepLearning课程，其链接为：Coursera链接网易云课堂链接
个人建议用coursera的比较好，虽然付费了，但是有题目可以做

PS:如果有错误，欢迎指正！

deeplearning course-01-03 Shallow Neural Network

一. 网络表示方式：

z [1] 1 = w [1] 1 a [1] 1 + b [1] 1

$z^{[1]}_{1} = w^{[1]}_{1}a^{[1]}_1 + b^{[1]}_1$
其中，右上角的 1 代表网络层数，下标1代表该层第1个神经元

![这里写图片描述 | center|](//img-blog.csdn.net/20180315144652564?watermark/2/text/Ly9ibG9nLmNzZG4ubmV0L01PTktFWTMyMzM=/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70) 一个神经元，其中的圆形代表了两次计算向量化：横向为训练数据序号，纵向为神经元序号

![这里写图片描述](//img-blog.csdn.net/20180315144807818?watermark/2/text/Ly9ibG9nLmNzZG4ubmV0L01PTktFWTMyMzM=/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70)

二. 激活函数

sigmoid function：二分类的时候用，输出是(0~,1）

s i g m o i d (x) = 1 1 + e - x

$sigmoid(x) = \dfrac{1}{1+e^{-x}}$ ![这里写图片描述](//img-blog.csdn.net/20180315144830273?watermark/2/text/Ly9ibG9nLmNzZG4ubmV0L01PTktFWTMyMzM=/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70) tanh function:

t a n h (x) = e z - e - z e z + e - z

$tanh(x) = \dfrac{e^z-e^{-z}}{e^z+e^{-z}}$

Relu function:

r e l u (x) = {0 x x < 0 x > 0

$relu(x) = \begin{cases} 0 & x<0\\ x & x >0 \end{cases}$

单层神经网络公式：

Logistic regression did not work well on the “flower dataset”. You are going to train a Neural Network with a single hidden layer.
Mathematically:

For one example $x^{(i)}$ :

z [1] (i) = W [1] x (i) + b [1] (1)

$z^{[1] (i)} = W^{[1]} x^{(i)} + b^{[1]}\tag{1}$

a [1] (i) = tanh (z [1] (i)) (2)

$a^{[1] (i)} = \tanh(z^{[1] (i)})\tag{2}$

z [2] (i) = W [2] a [1] (i) + b [2] (3)

$z^{[2] (i)} = W^{[2]} a^{[1] (i)} + b^{[2]}\tag{3}$

y^(i) = a [2] (i) = σ (z [2] (i)) (4)

$\hat{y}^{(i)} = a^{[2] (i)} = \sigma(z^{ [2] (i)})\tag{4}$

y (i) p r e d i c t i o n = {10 if a [2] (i) > 0.5 otherwise (5)

$y^{(i)}_{prediction} = \begin{cases} 1 & \mbox{if } a^{[2](i)} > 0.5 \\ 0 & \mbox{otherwise } \end{cases}\tag{5}$

Given the predictions on all the examples, you can also compute the cost $J$ as follows:

\begin{matrix} (6) & J = - \frac{1}{m} \sum_{i = 0}^{m} (y^{(i)} \log (a^{[2] (i)}) + (1 - y^{(i)}) \log (1 - a^{[2] (i)})) \end{matrix}

$J = - \frac{1}{m} \sum\limits_{i = 0}^{m} \large\left(\small y^{(i)}\log\left(a^{[2] (i)}\right) + (1-y^{(i)})\log\left(1- a^{[2] (i)}\right) \large \right) \small \tag{6}$

注意事项：

1. 保持维度

db1 =  np.sum(b1, axis = 0, keepdims = 1)

之前少了，keepdims项，计算导致了错误

2. 维度计算公式

W : (n [l], n l - 1)

$W: (n^{[l]},n^{l-1})$

b : (n [l], 1)

$b: (n^{[l]},1)$

simonNada

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录