机器学习-多变量的线性回归模型与实现笔记

最新推荐文章于 2024-07-09 13:12:51 发布

隐形人真忙

最新推荐文章于 2024-07-09 13:12:51 发布

阅读量4k

点赞数 2

分类专栏：安全编程

本文链接：https://blog.csdn.net/u011721501/article/details/49624759

版权

安全编程专栏收录该内容

34 篇文章 1 订阅

订阅专栏

0x00 代价函数

多变量的回归问题中，feature有多个，比如房屋价格预测问题，feature除了房屋的面积，还可能有房子的间数、房子的地段、房子的层数等因素所影响。在多变量回归问题中，我们的假设函数如下：

在此基础上，计算回归的代价函数为：

根据上面的公式，在Octave中，一行代码即可实现：

function J = computeCost(X, y, theta)
m = length(y); % 训练集数目
J = 1/(2*m) * sum((X*theta - y) .^ 2) ;  % 代价函数
end

X为feature的矩阵（房子的面积、层面等），y为训练集结果矩阵（即训练集中的真实房价），theta为回归的参数（有多个）。

0x01 梯度下降

我们知道，对于回归模型，其核心是在求合适的各个theta参数。
在求回归模型中合适的参数时，通常使用梯度下降算法，公式如下：

这里需要注意的是，对于n+1个参数，theta1为1是恒定的，不用处理，其余n个参数进行同时更新。
对于单变量的梯度下降算法，Octave代码如下：

function [theta, J_history] = gradientDescent(X, y, theta, alpha, num_iters)
m = length(y); % number of training examples
J_history = zeros(num_iters, 1);
for iter = 1:num_iters,
    % 同步更新
    t1 = theta(1) - alpha * (1/m) * sum((X*theta - y));
    t2 = theta(2) - alpha * (1/m) * sum((X*theta - y) .* X(:,2)) ;
    theta(1) = t1 ;
    theta(2) = t2 ;
    J_history(iter) = computeCost(X, y, theta);
end;
end;

在求theta(1)和theta(2)时，使用了两个temp变量，是为了对两个theta的值进行同时更新操作(先更新后赋值)。
在多变量的梯度下降算法实现中，只需要同时更新各个theta即可：

function [theta, J_history] = gradientDescentMulti(X, y, theta, alpha, num_iters)
m = length(y); % number of training examples
J_history = zeros(num_iters, 1);

for iter = 1:num_iters
    temp = zeros(size(theta,1), 1);
    % 同时更新theta值
    for i=1:size(theta,1),
      temp(i) = theta(i) - alpha * (1/m) * sum((X*theta - y).* X(:,i)) ;
    end;

    % 更新之后进行赋值操作
    for i=1:size(theta,1),
      theta(i) = temp(i);
    end;
    J_history(iter) = computeCostMulti(X, y, theta);
end
end