对数几率回归Logistic Regression（Matlab）

最新推荐文章于 2022-07-26 10:01:53 发布

zhifeng687

最新推荐文章于 2022-07-26 10:01:53 发布

阅读量1.5k

点赞数

机器学习专栏收录该内容

5 篇文章 0 订阅

订阅专栏

这里的数据均来源于吴恩达老师机器学习的课程。

上一篇内容是线性回归，利用线性模型进行回归学习，最终结果是找到一组合适的theta值，使代价函数的值最小，可是对于分类任务该如何解决呢？其实也是希望学习到的也是一组满足这种条件的theta。

先从简单的二分类问题考虑，比如从两门考试的成绩x1,x2，判断录取与否。对于结果y只有两种取值，,录取时y=1,不录取y=0.这时候，如果还像之前那怎样直接相乘算出预测值就不太合适了，因为此时需要把预测值的范围控制在[0,1],想要做到这一点，可以利用sigmoid函数，对数几率函数(logistic function)是sigmoid函数的重要代表。也就是说，要找到一组theta，在预测函数是这个样子的条件下，代价函数值最小。

总的来说，方法没有变，只是改了预测函数的形式。

二分类

[plain]view plain copy 
   
 data = load('ex2data1.txt');  
 X = data(:, [1, 2]); y = data(:, 3);  
 figure; hold on;  
 pos=find(y==1);  
 neg=find(y==0);  
 plot(X(pos,1),X(pos,2),'k+','LineWidth',2,'MarkerSize',7);  
 plot(X(neg,1),X(neg,2),'ko','MarkerFaceColor','y','MarkerSize',7);  
 % Labels and Legend  
 xlabel('Exam 1 score')  
 ylabel('Exam 2 score')  
 hold off;  
 [m, n] = size(X);  
 % Add intercept term to x and X_test  
 X = [ones(m, 1) X];  
 % Initialize fitting parameters  
 initial_theta = zeros(n + 1, 1);  
 options = optimset('GradObj', 'on', 'MaxIter', 400);  
 [theta, cost] = fminunc(@(t)(costFunction(t, X, y)), initial_theta, options);  
 %plot decision boundary  
 hold on;  
 if size(X, 2) <= 3  
     % Only need 2 points to define a line, so choose two endpoints  
     plot_x = [min(X(:,2))-2,  max(X(:,2))+2];  
     % Calculate the decision boundary line  
     plot_y = (-1./theta(3)).*(theta(2).*plot_x + theta(1));  
     plot(plot_x, plot_y)  
     % Legend, specific for the exercise  
     legend('Admitted', 'Not admitted', 'Decision Boundary')  
     axis([30, 100, 30, 100])  
 else  
     % Here is the grid range  
     u = linspace(-1, 1.5, 50);  
     v = linspace(-1, 1.5, 50);  
     z = zeros(length(u), length(v));  
     % Evaluate z = theta*x over the grid  
     for i = 1:length(u)  
         for j = 1:length(v)  
             z(i,j) = mapFeature(u(i), v(j))*theta;  
         end  
     end  
     z = z';   
     contour(u, v, z, [0, 0], 'LineWidth', 2)  
 end  
 hold off;  

[plain]view plain copy 
   
 %costFunction.m  
 function [J, grad] = costFunction(theta, X, y)  
 m = length(y); % number of training examples  
 J = 0;  
 grad = zeros(size(theta));  
 h=1.0./(1.0+exp(-1*X*theta));  
 m=size(y,1);  
 J=((-1*y)'*log(h)-(1-y)'*log(1-h))/m;  
 for i=1:size(theta,1),  
     grad(i)=((h-y)'*X(:,i))/m;  
 end  
 end