一元（多元）线性回归推导

最新推荐文章于 2024-07-22 14:29:50 发布

weixin_43425490

最新推荐文章于 2024-07-22 14:29:50 发布

阅读量1.1k

点赞数 1

文章标签：线性回归回归算法

本文链接：https://blog.csdn.net/weixin_43425490/article/details/121235732

版权

一元线性回归

设模型为一元线性函数：
$y = w_1 x + w_0$
现有样本： ${(x_1, y_1), (x_2, y_2),\dots, (x_n, y_n)}$ ，用于拟合这个一元线性函数，得到 $w_1^*,w_0^*$ 。

$\hat y_i$ 为拟合后的预测值，指定差平方为损失函数 $L(w_1^*, w_0^*)$ ，要使得其最小，从而得到 $w_1^*, w_0^*$ ，所以有目标：
$argmin_{w_1^*, w_0^*} L(w_1^*, w_0^*)$
$L(w_1^*, w_0^*) = \sum_{i=1}^n {(\hat y_i-y_i)^2} \tag{1}$
$\hat y_i = w_1^* x_i + w_0^* \tag{2}$

结合 (1)(2)式，有：

$\begin{aligned}L & = \sum_{i=1}^n {(w_1^* x_i + w_0^* - y_i)^2}\\ & = \sum_{i=1}^n {((w_1^* x_i)^2 + {w_0^*}^2 + {y_i}^2 + 2 w_1 w_0^* x_i - 2 w_1^* y_i x_i - 2 w_0^* y_i)}\\ \end{aligned} \tag{3}$

于 (3)式分别对 $w_1^*, w_0^*$ 求偏导，有：
$\begin{aligned} \frac {\partial {L}}{\partial {w_0^*}} &= \sum_{i=1}^n {(2 w_0^*+ 2 w_1^* x_i + 2 y_i)} =2 n w_0 + 2 w_1^*\sum_{i=1}^n {x_i} - 2 \sum_{i=1}^n {y_i}\\ \frac {\partial {L}}{\partial {w_1^*}} &= \sum_{i=1}^n {(2 x_i^2 w_1 + 2x_i w_0 - 2 x_i y_i)} = 2 w_1 \sum_{i=1}^n {x_i^2} + 2 w_0 \sum_{i=1}^n {x_i} - 2 \sum_{i=1}^n {x_i y_i} \end{aligned}$

令上式偏导等于 0：
$\left\{ \begin{aligned} n w_0 + w_1^*\sum_{i=1}^n {x_i} - \sum_{i=1}^n {y_i} &= 0\\ w_1 \sum_{i=1}^n {x_i^2} + w_0 \sum_{i=1}^n {x_i} - \sum_{i=1}^n {x_i y_i} &= 0 \end{aligned} \right . \tag{4}$

求解方程组(4)，得：
$\left\{ \begin{aligned} w_0 &= \frac{\sum_{i=1}^n y_i - w_1 \sum_{i=1}^n x_i}{n}\\ w_1&= \frac{ \sum_{i=1}^n x_i y_i - w_0 \sum_{i=1}^n x_i}{ \sum_{i=1}^n x_i^2} \end{aligned} \right . \tag{5}$

方程组(5) 相互代入可得：
$\left\{ \begin{aligned} w_0 &= \frac{\sum_{i=1}^n y_i - \frac{\sum_{i=1}^n {x_i} \sum_{i=1}^n {x_iy_i}}{(\sum_{i=1}^n x_i)^2}} {n - \frac{(\sum_{i=1}^n x_i)^2}{\sum_{i=1}^n x_i^2}}\\ w_1&= \frac{ \sum_{i=1}^n x_i y_i - w_0 \sum_{i=1}^n x_i}{ \sum_{i=1}^n x_i^2 - \frac{ {(\sum_{i=1}^n {x_i})}^2}{n}} \end{aligned} \right . \tag{6}$
于 (6)式求出 $w_1$ ，就可以带入方程组(5) 直接计算 $w_0$ 。

多元线性回归（最小二乘法）

以上方法类推到 $w_2,w_3,\cdots$ ，就可以得到多元线性回归的解。
但是这样挨个挨个推导着实麻烦。

若直接有多元函数：
$\mathbf x \mathbf w$
其中， $\mathbf w = [w_1, w_2, \dots, w_n]^\mathrm T, \mathbf x = [x_1,x_2, \dots, x_n], x_1 = 1$ 。
因为直接将函数的常数项视作变量，所以 $x_1 = 1$ 。

那么现在有多元线性回归的推导：
$\hat \mathbf w$ 为求得的系数向量， $\mathbf y =[y_1,y_2,\dots, y_n]^\mathrm T$ ， $\mathbf X = [\mathbf x_1, \mathbf x_2, \dots, \mathbf x_n]^\mathrm T$ ，指定损失函数：
$L(\hat \mathbf w) = \Vert {\mathbf y - \mathbf X\hat \mathbf w} \Vert_2^2$
有求解目标：
$\argmin_{\mathbf w} \Vert \mathbf y - \mathbf X \hat \mathbf w \Vert_2^2$
$\begin{aligned} \Vert \mathbf y - \mathbf X \hat \mathbf w \Vert_2^2 &= (\mathbf y - \mathbf X \hat \mathbf w)^\mathrm T(\mathbf y - \mathbf X \hat \mathbf w)\\ &= (\mathbf y^\mathrm T - \hat \mathbf w^\mathrm T \mathbf X^\mathrm T)(\mathbf y - \mathbf X \hat \mathbf w)\\ &=\mathbf y^\mathrm T \mathbf y + \hat \mathbf w^\mathrm T \mathbf X^\mathrm T\mathbf X \hat \mathbf w - \hat \mathbf w^\mathrm T \mathbf X^\mathrm T \mathbf y - \mathbf y^\mathrm T \mathbf X \hat \mathbf w \end{aligned}$

有矩阵求导公式：
$\begin{aligned} \frac{\rm d\mathbf x^\mathrm T \mathbf A \mathbf x}{\rm d \mathbf x} &= (\mathbf A + \mathbf A^\mathrm T)\mathbf x\\ \frac{\rm d\mathbf x^\mathrm T \mathbf A}{\rm d \mathbf x} &= \mathbf A\\ \frac{\rm d\mathbf A \mathbf x}{\rm d \mathbf x} &= \mathbf A^\mathrm T \end{aligned}$

对 $L(\hat \mathbf w)$ 求导：
$\frac{\partial L(\hat \mathbf w)}{\partial \hat\mathbf w} = 2\mathbf X^\mathrm T \mathbf X \hat\mathbf w - \mathbf X^\mathrm T \mathbf y - \mathbf X^\mathrm T \mathbf y \\ = 2\mathbf X^\mathrm T \mathbf X \hat\mathbf w - 2 \mathbf X^\mathrm T \mathbf y$
为求极值，令导数等于 0，可得
$\hat\mathbf w = (\mathbf X^\mathrm T \mathbf X)^{-1} \mathbf X^\mathrm T \mathbf y$

weixin_43425490

关注

1
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
一元（多元）线性回归推导

模型为一元线性函数：y=w1x+w0y = w_1 x + w_0y=w1x+w0现有样本：(x1,y1),(x2,y2),…,(xn,yn){(x_1, y_1), (x_2, y_2),\dots, (x_n, y_n)}(x1,y1),(x2,y2),…,(xn,yn)，用于拟合这个一元线性函数，得到 w1∗,w0∗w_1^*,w_0^*w1∗,w0∗。y^i\hat y_iy^i 为拟合后的预测值，指定差平方为损失函数 L(w1∗,w0∗)L(w_1^*, w_
复制链接

扫一扫