Jacobian, Hessian

最新推荐文章于 2024-07-10 22:11:32 发布

TramSevenTeen

最新推荐文章于 2024-07-10 22:11:32 发布

阅读量300

点赞数

分类专栏：统计学机器学习文章标签：深度学习

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/yzp294562577/article/details/53863961

版权

统计学同时被 2 个专栏收录

3 篇文章 0 订阅

订阅专栏

3 篇文章 0 订阅

订阅专栏

Optimizing techniques in Deep Learning

Jacobian Matrix
represent the partial derivatives of a MIMO system.
Hessian Matrix
the derivative of derivative. For a function $f: \mathbb{R}^n\rightarrow \mathbb{R}$ , $H(f)(x)_{i,j}=\frac{\partial^2}{\partial x_j \partial x_i}f(x)$ . The Hessian Matrix is real symmetric.

First order optimization algorithm: only use Jacobian Matrix
Second order optimization algorithm: make use of Hessian Matrix. Example: Newton’s Method, which assume the local function can be approximated by positive definite quadratic. It is not suitable around saddle points.

KL Divergence

Measures how two distributions $P(X)$ and $Q(x)$ are different. It is not symmetric, so not a true distance measure. Noted that the KL Divergence is non-negative. It is zero when $P(X)=Q(x)$ . It is closely related to “Cross Entropy”. See Deep Learning Textbook page 76-78 for detailed explanation.

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Jacobian, Hessian

Jacobian, Hessian, KL Divergence
复制链接

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。