Coursea-吴恩达-machine learning学习笔记（七）【week 3之Regularization】

最新推荐文章于 2024-08-03 19:12:05 发布

痞靥

最新推荐文章于 2024-08-03 19:12:05 发布

阅读量231

点赞数

分类专栏：机器学习文章标签：正则化

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/u012347642/article/details/80412198

版权

机器学习专栏收录该内容

17 篇文章 0 订阅

订阅专栏

欠拟合(高偏差)：没有很好的拟合训练集数据；
过度拟合(高方差)：可以很好的拟合训练集数据，但是函数太过庞大，变量太多，且缺少足够多的数据约束该模型，无法泛化到新的数据样本。

解决过度拟合的方法：

1.减少特征变量的数量
- 人为选择保留的特征变量
- 利用模型选择算法

2.正则化
- 保留所有的特征变量，但要减小数量级或参数 $\theta_j$ 的数值
- 当拥有很多特征变量，且每一个都对预测 $y$ 起一点作用时，利用正则化很好

正则化的思路：
当 $\theta_0，\theta_1，\cdots，\theta_n$ 等参数取较小值时，将得到更简单的假设函数，不易发生过拟合的问题。

线性回归的正则化代价函数：

J (θ) = 1 2 m [\sum i = 1 m (h θ (x (i)) - y (i)) 2 + λ \sum j = 1 n θ 2 j]

$J(\theta)={1\over2m}[\sum_{i=1}^m (h_\theta(x^{(i)})-y^{(i)})^2+\lambda\sum_{j=1}^n \theta_j^2]$

λ∑nj=1θ2j λ ∑ j = 1 n θ j 2 $\lambda\sum_{j=1}^n \theta_j^2$ 为正则化项(注：不包含

θ0 θ 0 $\theta_0$ )，

λ λ $\lambda$ 为正则化参数

$\lambda$ 的作用是平衡两个目标：
目标1：使假设函数更好地拟合训练集数据；
目标2：保持参数值比较小。

由于目标是最小化 $J(\theta)$ ，对于 $\lambda\sum_{j=1}^n \theta_j^2$ ，当 $\lambda$ 特别大的时候， $\theta_j$ 就会很小。

正则化线性回归

梯度下降：
Repeat {

θ 0 : = θ 0 - α 1 m \sum i = 1 m (h θ (x (i)) - y (i)) x (i) 0 θ j : = θ j - α [1 m \sum i = 1 m (h θ (x (i)) - y (i)) x (i) j + λ m θ j]

$\theta_0:=\theta_0-\alpha{1\over m} \sum_{i=1}^m (h_\theta(x^{(i)})-y^{(i)})x^{(i)}_0\\\theta_j:=\theta_j-\alpha[{1\over m} \sum_{i=1}^m (h_\theta(x^{(i)})-y^{(i)})x^{(i)}_j+{\lambda\over m}\theta_j]$ }
对

θj θ j $\theta_j$ 的梯度下降公式进行整理变形：

θ j : = (1 - α λ m) θ j - α 1 m \sum i = 1 m (h θ (x (i)) - y (i)) x (i) j

$\theta_j:=(1-\alpha{\lambda\over m})\theta_j-\alpha{1\over m} \sum_{i=1}^m (h_\theta(x^{(i)})-y^{(i)})x^{(i)}_j$ (注：

1−αλm<1 1 − α λ m < 1 $1-\alpha{\lambda\over m}<1$ )

正规方程：
线性回归的正规方程：

θ = (X T X) - 1 X T y

$\theta=(X^TX)^{-1}X^Ty$
正则化线性回归的正规方程：

θ = (X T X + λ L) - 1 X T y

$\theta=(X^TX+\lambda L)^{-1}X^Ty$ 其中，

L L $L$ 为一个

(n + 1) * (n + 1)

$(n+1)*(n+1)$ 矩阵，形式为

⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ 011 ⋱ 1 ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥

$\left[ \begin{array}{} 0&&&&\\ &1&&&\\ &&1&&\\ &&&\ddots&\\ &&&&1 \end{array} \right]$

正则化逻辑回归

代价函数：

J (θ) = - [1 m \sum i = 1 m y (i) l o g (h θ (x (i))) + (1 - y (i)) l o g (1 - h θ (x (i)))] + λ 2 m \sum j = 1 n θ 2 j

$J(\theta)=-[{1\over m}\sum_{i=1}^m y^{(i)}log(h_\theta(x^{(i)}))+(1-y^{(i)})log(1-h_\theta(x^{(i)}))]\\+{\lambda \over 2m}\sum_{j=1}^n \theta_j^2$

梯度下降：
Repeat {

θ 0 : = θ 0 - α 1 m \sum i = 1 m (h θ (x (i)) - y (i)) x (i) 0 θ j : = θ j - α [1 m \sum i = 1 m (h θ (x (i)) - y (i)) x (i) j + λ m θ j]

$\theta_0:=\theta_0-\alpha{1\over m} \sum_{i=1}^m (h_\theta(x^{(i)})-y^{(i)})x^{(i)}_0\\\theta_j:=\theta_j-\alpha[{1\over m} \sum_{i=1}^m (h_\theta(x^{(i)})-y^{(i)})x^{(i)}_j+{\lambda\over m}\theta_j]$ }

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。