逻辑回归和牛顿法 Logistic Regression and Newton's Method

最新推荐文章于 2021-10-06 21:32:10 发布

woneil

最新推荐文章于 2021-10-06 21:32:10 发布

阅读量1.8k

点赞数 2

分类专栏：深度学习

本文链接：https://blog.csdn.net/ahbbshenfeng/article/details/40269151

版权

深度学习专栏收录该内容

14 篇文章 0 订阅

订阅专栏

英文中文

训练数据：

共80个学生的两门课成绩。根据两门成绩，40名学生可以上大学，另外40名学生不能上大学。训练集由向量X和Y组成。X为80行*2列，每行表示一个学生，第1列表示第1门课成绩，第2例表示第2门课成绩。Y为80行*1列，1列值为1表示可以上大学，值为0表示不能上大学。

% Exercise 4 -- Logistic Regression

clear all; close all; clc

x = load('ex4x.dat'); %x:80*2, 2个特征
y = load('ex4y.dat'); %y:80*1

[m, n] = size(x); %m:80, n:2

% Add intercept term to x
x = [ones(m, 1), x];   %x:80*3, 设X0 = 1

% Plot the training data
% Use different markers for positives and negatives
figure
pos = find(y==1);%y中元素值等于1的所有下标
neg = find(y == 0);
plot(x(pos, 2), x(pos,3), '+')  %分别表示可以上大学的同学第1门成绩、第2门成绩
hold on
plot(x(neg, 2), x(neg, 3), 'o') %分别表示不能上大学的同学第1门成绩、第2门成绩
hold on
xlabel('Exam 1 score')
ylabel('Exam 2 score')


% Initialize fitting parameters
theta = zeros(n+1, 1); % 3*1

% Define the sigmoid function  定义一个内置函数g，参数是z
g = inline('1.0 ./ (1.0 + exp(-z))'); 

% Newton's method  牛顿法
MAX_ITR = 7;  %牛顿法通过迭代次数5-15次就收敛了

J = zeros(MAX_ITR, 1);  %成本函数初始值为0

for i = 1:MAX_ITR
    % Calculate the hypothesis function
    z = x * theta;   % 80*1
    h = g(z);
    
    % Calculate gradient and hessian.
    % The formulas below are equivalent to the summation formulas
    % given in the lecture videos.
    grad = (1/m).*x' * (h-y); %梯度的矢量表示
    H = (1/m).*x' * diag(h) * diag(1-h) * x; %H的矢量表示
    
    % Calculate J (for testing convergence)  成本函数
    J(i) =(1/m)*sum(-y.*log(h) - (1-y).*log(1-h));
    
    theta = theta - H\grad; %theta迭代公式
end
% Display theta
theta

% Calculate the probability that a student with
% Score 20 on exam 1 and score 80 on exam 2 
% will not be admitted  通过这个学生成绩计算概率值, g函数值大于0.5就能上大学
prob = 1- g([1, 20, 80]*theta)  %不能上的概率

% Plot Newton's method result
% Only need 2 points to define a line, so choose two endpoints  线画长一点
plot_x = [min(x(:,2))-2,  max(x(:,2)+2)];
% Calculate the decision boundary line   分界线
%令函数g的值0.5，可推出theta'X=0,满足这个方程的点就是分界线，即：
%theta(1)*1+theta(2)*plot_x+theta(3)*plot_y=0,解出plot_y即可。
%plot_x,plot_y分别表示成绩1、成绩2
plot_y = (-1./theta(3)).*(theta(2).*plot_x +theta(1));
plot(plot_x, plot_y)
legend('Admitted', 'Not admitted', 'Decision Boundary')
hold off

% Plot J
figure
plot(0:MAX_ITR-1, J, 'o--', 'MarkerFaceColor', 'r', 'MarkerSize', 8)
xlabel('Iteration'); ylabel('J')
% Display J
J

结果分析：

采用牛顿法经过5次迭代成本函数J就可以收敛了。第4次和第5次迭代的结果差已经小于 $10^{-7}$