[Machinie Learning] 吴恩达机器学习课程笔记——Week5

Carsick Car

已于 2022-12-17 18:21:22 修改

阅读量169

点赞数

分类专栏： Machine Learning 文章标签：人工智能 python

于 2022-12-17 17:58:02 首次发布

本文链接：https://blog.csdn.net/qq_52883908/article/details/128355325

版权

Machine Learning 专栏收录该内容

6 篇文章 0 订阅

订阅专栏

这篇博客详细介绍了吴恩达的机器学习课程中关于神经网络和反向传播的内容。博主总结了成本函数和反向传播算法，包括前向传播、误差计算、梯度下降以及在实践中的注意事项，如参数初始化、梯度检查和随机初始化。此外，还提及了神经网络在自动驾驶等领域的应用，并提供了额外的学习资源链接。

摘要由CSDN通过智能技术生成

Machine Learning by Andrew Ng

💡 吴恩达机器学习课程学习笔记——Week 5
🐠 本人学习笔记汇总合订本
✓ 课程网址 standford machine learning
🍭 参考资源

课程笔记
python版作业

学习提纲

Cost Function and Back-propagation
Back-propagation in Practice
Application of Neural Networks
Review

Cost Function and Back-propagation

Notation

L = total number of layers in the network
$S_l$ = number of units (not counting bias unit) in layer l
K = number of output units/classes

we denote $h_Θ(x)_k$ as being a hypothesis that results in the k-th output
输入x， $h_Θ(x)_k$ 是输出的第k个特征

1.Cost Function
the cost function for regularized logistic regression
and the cost function for neural network are as below

the cost function for regularized logistic regression was

for neural network, the cost function is

Here, we define $\delta_j^l$ as the error for $a_j^l$

2.Back-propagation Algorithm

Same as other ml algorithms, our goal is to minimize the cost function

Therefor, we need to compute

$J(\theta)$
$\frac{\partial}{\partial \theta_{i,j} ^{(l)}} J(\theta)$

The algorithm to compute $\frac{\partial}{\partial \theta_{i,j} ^{(l)}} J(\theta)$ is as below

The whole algorithm is as below

step1 and step2, forward propagation

step3, compute $\delta^{(L)}$

step4, compute $\delta^{(L-1)}$ and so on,

step5, compute the $\frac{\partial}{\partial \theta_{i,j} ^{(l)}} J(\theta)$

3.Back-propagation Intuition

Back-propagation in Practice

1.Implementation Note: Unrolling Parameters
use gradient checking to assure everything goes well

2.Gradient Checking

3.Random Initialization

4.Putting It Together

6 steps to train a network

Ideally, we want $h_\theta(x^i) \approx y^i$ But remember that $J(\Theta)$ is not a convex function and thus we can end up in a local minima instead.

Application of Neural Networks

1.Autonomous Driving
skip.

Review

skip.

额外阅读

吴恩达机器学习——反向传播算法

Carsick Car

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
[Machinie Learning] 吴恩达机器学习课程笔记——Week5

吴恩达机器学｜机器学习｜Machine Learning｜AI｜反向传播
复制链接

扫一扫

专栏目录