线性回归的两种算法实现梯度下降和解析法（西瓜书学习）

最新推荐文章于 2023-05-21 21:21:07 发布

幻一空

最新推荐文章于 2023-05-21 21:21:07 发布

阅读量699

点赞数

文章标签： python 机器学习算法

本文链接：https://blog.csdn.net/hk_bruce/article/details/113815966

版权

线性回归算法实现

规定

d个属性描述的示例
$\begin{pmatrix} x_1\\ ...\\ x_d \end{pmatrix}$
$x_i$ 为第i个属性的取值

即:
$f(x)=w_1x_1+w_2x_2+...+w_dx_d+b$
向量形式:
$f(x)=w^Tx+b, w= \begin{pmatrix} w_1\\ ...\\ w_d \end{pmatrix}$

将w和b一起写为向量形式
即:
$\hat{W}= \begin{pmatrix} w\\ b \end{pmatrix}$
数据集表示为
$D=\{(x_1,y_1),...,(x_m,y_m)\}, x_i= \begin{pmatrix} x_{i1}\\ ...\\ x_{id} \end{pmatrix}, y_i\in R$

数据集转化为矩阵
$\begin{pmatrix} x_{11} & x_{12} & ... & x_{1d} & 1\\ x_{21} & x_{22} & ... & x_{2d} & 1\\ ... & ... & ... & ... & ...\\ x_{m1} & x_{m2} & ... & x_{md} & 1 \end{pmatrix}$
$\begin{pmatrix} y_1\\ ...\\ y_m \end{pmatrix}$
代价函数
$J(\hat{W})=\frac{1}{2m}(Y-X\hat{W})^T(Y-X\hat{W})$

参数估计和解释

$(w,b)=\mathop{\arg\min}\limits_{(w,b)}J(\hat{W})$
使得
$f(x)=X\hat{W}\simeq Y$
均方误差有非常好的几何意义，它对应了常用的欧式距离。基于均方误差最小化进行模型求解的方法加最小二乘法。在线性回归中，最小二乘法就是试图找到一个条直线，使得所有样本到直线上的距离最小

梯度下降

$J(\hat{W})$ 导数为
$\frac{1}{m}X^T(X\hat{W}-Y)$
$\hat{W}-=J^{'}({\hat{W}})$

正规方程解法

公式为
$\hat{W}=(X^TX)^{-1}X^TY =X^{-1}Y$

导入库

import numpy as np
import matplotlib.pyplot as plt

准备数据

# 样本数量
m=100
# 初始化数据
X=np.linspace(0,10,m)
Y=X+2+np.random.randn(m)
# 展示样本数据
plt.scatter(X,Y)
plt.show()
# 将样本数据矩阵化
X=X.reshape((m,1))
X1=np.hstack((np.ones_like(X),X))
Y=Y.reshape((m,1))
# 参数
omiga=np.zeros(2).reshape((2,1))
# 步长
alpha=0.01
precision=1e-3

在这里插入图片描述

代价函数

# 代价函数
def cost(omiga,X,Y):
    return (Y-X@omiga).T @ (Y-X@omiga)

梯度迭代

# 代价列表
costs=[]
while True:
    costs.append(cost(omiga,X1,Y)[0][0])
    omiga -=alpha /m * X1.T @ ( X1 @ omiga - Y )
    if len(costs)>1: 
        if costs[-2]-costs[-1]<=precision:
            break

# 代价函数曲线
plt.plot(np.arange(len(costs)),costs)
plt.show()

在这里插入图片描述

展示结果

plt.plot(X,X1 @ omiga)
plt.scatter(X,Y)
plt.show()

在这里插入图片描述

print(omiga)

[[1.80041713]
 [1.05535637]]

正规方程

# omiga = np.linalg.pinv(X1.T @ X1) @ X1.T @ Y
# print(omiga)
omiga=np.linalg.pinv(X1) @ Y
print(omiga)

[[1.88910625]
 [1.04202106]]

plt.plot(X,X1 @ omiga)
plt.scatter(X,Y)
plt.show()

在这里插入图片描述

幻一空

关注

0
点赞
踩
8

收藏

觉得还不错? 一键收藏
0
评论
线性回归的两种算法实现梯度下降和解析法（西瓜书学习）

线性回归算法实现规定d个属性描述的示例x=(x1...xd)x=\begin{pmatrix}x_1\\...\\x_d\end{pmatrix}x=⎝⎛x1...xd⎠⎞xix_ixi为第i个属性的取值即:f(x)=w1x1+w2x2+...+wdxd+bf(x)=w_1x_1+w_2x_2+...+w_dx_d+bf(x)=w1x1+w2x2+...+wdxd+b向量形式:f(x)=wTx+b,w=(w1...wd)f(x)=w^Tx+b,w
复制链接

扫一扫