ML Notes: Week 4 - Neural Networks: Representation

最新推荐文章于 2021-06-28 17:36:52 发布

CCrazyGuy

最新推荐文章于 2021-06-28 17:36:52 发布

阅读量208

点赞数 1

分类专栏： ML学习笔记

本文链接：https://blog.csdn.net/jty573894890/article/details/106585354

版权

ML学习笔记专栏收录该内容

5 篇文章 0 订阅

订阅专栏

1. Model representation

1.1 Neural network model

在这里插入图片描述 The typical neuron has input wires which called the dendrites and also has an output wire called an Axon. The nucleus is considered as the computational unit. We could simplify the model as follows:

Terms: a neuron or an artificial neuron with a sigmoid or logistic activation function

1.2 Some notations in the neural networks

在这里插入图片描述
layer 1: Input layer
layer 2: Hidden layer (the rest layers all could be called hidden layer)
layer 3: Output layer

$\Theta$ are parameter vectors. In addition, the $\Theta$ also is called as weights.
$\Theta_{in}^{(j)}$ = matrix of weights mapping from layer $j$ in layer $j + 1$ . If a network has $s_j$ units in layer $j$ and has $s_{j+1}$ units in layer $j + 1$ , then $\Theta^{(j)} = s_{j+1}*(s_j+1)$ .
$a_i^{(j)}$ = “activation” of unit $i$ in layer $j$ .

$\begin{aligned} a_1^{(2)}& = g(\Theta_{10}^{(1)}x_0 + \Theta_{11}^{(1)}x_1 + \Theta_{12}^{(1)}x_2 + \Theta_{13}^{(1)}x_3) \\ a_2^{(2)} &= g(\Theta_{20}^{(1)}x_0 + \Theta_{21}^{(1)}x_1 + \Theta_{22}^{(1)}x_2 + \Theta_{23}^{(1)}x_3)\\ a_3^{(2)} &= g(\Theta_{30}^{(1)}x_0 + \Theta_{31}^{(1)}x_1 + \Theta_{32}^{(1)}x_2 + \Theta_{33}^{(1)}x_3) \\ \newline h_\Theta(x) = a_1^{(3)} &= g(\Theta_{10}^{(2)}a_0^{(2)} + \Theta_{11}^{(2)}a_1^{(2)} + \Theta_{12}^{(2)}a_2^{(2)} + \Theta_{13}^{(2)}a_3^{(2)}) \newline \end{aligned}$ * $g$ is sigmoid/logistic activation function.

1.3 Forward propagation nueral network

The process of computing the activations, shown in the above figure, from the input then the hidden then the output layer, and that’s also called forward propagation

Now, we will vectorize the model. We difine
$\begin{aligned} z_1^{(2)}&=\Theta_{10}^{(1)}x_0 + \Theta_{11}^{(1)}x_1 + \Theta_{12}^{(1)}x_2 + \Theta_{13}^{(1)}x_3 \\ z_2^{(2)}&=\Theta_{20}^{(1)}x_0 + \Theta_{21}^{(1)}x_1 + \Theta_{22}^{(1)}x_2 + \Theta_{23}^{(1)}x_3\\ z_3^{(2)}&=\Theta_{30}^{(1)}x_0 + \Theta_{31}^{(1)}x_1 + \Theta_{32}^{(1)}x_2 + \Theta_{33}^{(1)}x_3 \end{aligned}$
we can rewrite it as $z^{(2)} =[z_1^{(2)} z_1^{(2)} z_1^{(2)}]^T= \Theta^{(1)} x$ . If we treat $x$ as $a^{(1)}$ , so $z^{(2)} = \Theta^{(1)} a^{(1)}$ .
That is $z^{(j+1)} = \Theta^{(j)} a^{(j)}$

And $a_1^{(2)} = g(z_1^{(2)}), a_2^{(2)} = g(z_2^{(2)}), a_3^{(2)} = g(z_3^{(2)})$ could be written as $a^{(2)} =g( z^{(2)})$ .

For the above neural network model, if we take the input layer away, the model just like logistic function.

Logistic function: $h_\theta(x) = g(\theta_0+\theta_1x+\theta_2x_2)$
The simplified neural network model: $h_\Theta(x)=g(\Theta_{10}^{(2)}a_0^{(2)} + \Theta_{11}^{(2)}a_1^{(2)} + \Theta_{12}^{(2)}a_2^{(2)} + \Theta_{13}^{(2)}a_3^{(2)} )$

1.4 Other network architectures

在这里插入图片描述

2. How to compute a complex nonlinear function？

$x_1,x_2 \in\{0,1\}$

2.1 AND

$y = x_1$ AND $x_2$
在这里插入图片描述
$\Theta^{(1)} =\begin{bmatrix}-30 & 20 & 20 \end{bmatrix}$

2.2 OR

$y = x_1$ OR $x_2$
在这里插入图片描述
$\Theta^{(1)} =\begin{bmatrix}-10 & 20 & 20 \end{bmatrix}$

2.3 NOT

$y =$ NOT $x_1$
在这里插入图片描述
$\Theta^{(1)} =\begin{bmatrix}10 & -20 \end{bmatrix}$

2.4 (NOT $x 1$ ) AND (NOT $x_2$ )

在这里插入图片描述
$\Theta^{(1)} =\begin{bmatrix}10 & -20 & -20\end{bmatrix}$

2.5 XNOR

$y = (x_1$ AND $x_2)$ OR $($ (NOT $x 1$ ) AND (NOT $x_2$ ) $)$
在这里插入图片描述

* we are able to put pieces together to generate some new functions.

3. Multi-class Classification

The ouput $y_i$ will be $\begin{bmatrix} 1 \\ 0\\ 0\\ 0\end{bmatrix}$ , $\begin{bmatrix} 0 \\ 1\\ 0\\ 0\end{bmatrix}$ , $\begin{bmatrix} 0 \\ 0\\ 1\\ 0\end{bmatrix}$ , $\begin{bmatrix} 0 \\ 0\\ 0\\ 1\end{bmatrix}$ depending on what the corresponding input $X_i$ is. And in this way, we could implement the multi-class Classification.

CCrazyGuy

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
1
评论
ML Notes: Week 4 - Neural Networks: Representation

1. Model representation1.1 Neural network model1.2 Some notations in the neural networks1.3 Forward propagation nueral network2. How to compute a complex nonlinear function？2.1 AND2.2
复制链接

扫一扫