首先这次作业包含以下几个文件,带*号的是需要我们补充完整的。
然后根据提供的PDF文档,依次完成。
1、Visualizing the data(数据可视化)
这个代码PDF文档里直接提供了,只需要把它复制到plotData.m中就行。
使用plot绘图:X为数据矩阵,每一列代表一个特征(这里是一个m*2维矩阵,即2元特征);y为数据结果向量(m*1维),y的取值为0或1。
% pos为y向量中为1的行序号组成的向量,neg为y向量中为0的行序号组成的向量
pos = find(y==1); neg = find(y == 0);
% Plot Examples:'k+'代表黑色+,'ko'代表黑色o,LineWidth是线宽,MarkerSize是+或o的大小
plot(X(pos, 1), X(pos, 2), 'k+','LineWidth', 2,'MarkerSize', 7);
plot(X(neg, 1), X(neg, 2), 'ko','MarkerFaceColor','y','MarkerSize',7);
2、sigmoid.m:这个很简单,S型函数
function g = sigmoid(z)
%SIGMOID Compute sigmoid function
% g = SIGMOID(z) computes the sigmoid of z.
% You need to return the following variables correctly
g = zeros(size(z));
% ====================== YOUR CODE HERE ======================
% Instructions: Compute the sigmoid of each value of z (z can be a matrix,
% vector or scalar).
g=1./(1+exp(-z));
% =============================================================
end
3、costFunction.m:表示代价函数J(θ),以及用梯度下降算法求minJ(θ)
在ex2.m中,X为一个m*n的矩阵,y为m*1的矩阵,theta为(n+1)*1的矩阵
ex2.m中相关代码为:
data = load('ex2data1.txt');
X = data(:, [1, 2]); y = data(:, 3);
[m, n] = size(X);
X = [ones(m, 1) X];
initial_theta = zeros(n + 1, 1);
[cost, grad] = costFunction(initial_theta, X, y);
costFunction.m中为:
function [J, grad] = costFunction(theta, X, y)
%COSTFUNCTION Compute cost and gradient for logistic regression
% J = COSTFUNCTION(theta, X, y) computes the cost of using theta as the
% parameter for logistic regression and the gradient of the cost
% w.r.t. to the parameters.
% Initialize some useful values
m = length(y); % number of training examples
% You need to return the following variables correctly
J = 0;
grad = zeros(size(theta));
% ====================== YOUR CODE HERE ======================
% Instructions: Compute the cost of a particular choice of theta.
% You should set J to the cost.
% Compute the partial derivatives and set grad to the partial
% derivatives of the cost w.r.t. each parameter in theta
%
% Note: grad should have the same dimensions as theta
%
J=(-log(sigmoid(theta'*X'))*y-log(1-sigmoid(theta'*X'))*(1-y))/m;
grad=X'*(sigmoid(theta'*X')'-y)/m;
% =============================================================
end
4、costFunctionReg.m:正则化后的J(θ),grad
function [J, grad] = costFunctionReg(theta, X, y, lambda)
%COSTFUNCTIONREG Compute cost and gradient for logistic regression with regularization
% J = COSTFUNCTIONREG(theta, X, y, lambda) computes the cost of using
% theta as the parameter for regularized logistic regression and the
% gradient of the cost w.r.t. to the parameters.
% Initialize some useful values
m = length(y); % number of training examples
% You need to return the following variables correctly
J = 0;
grad = zeros(size(theta));
% ====================== YOUR CODE HERE ======================
% Instructions: Compute the cost of a particular choice of theta.
% You should set J to the cost.
% Compute the partial derivatives and set grad to the partial
% derivatives of the cost w.r.t. each parameter in theta
J=(-log(sigmoid(theta'*X'))*y-log(1-sigmoid(theta'*X'))*(1-y))/m+lambda*(theta(2:size(theta,1),1)'*theta(2:size(theta,1),1))/2/m;
grad(1,1)=X(:,1)'*(sigmoid(theta'*X')'-y)/m;
grad(2:size(theta,1),1)=X(:,2:size(theta,1))'*(sigmoid(theta'*X')'-y)/m+theta(2:size(theta,1),1)*lambda/m;
% =============================================================
end
5、predict.m:
function p = predict(theta, X)
%PREDICT Predict whether the label is 0 or 1 using learned logistic
%regression parameters theta
% p = PREDICT(theta, X) computes the predictions for X using a
% threshold at 0.5 (i.e., if sigmoid(theta'*x) >= 0.5, predict 1)
m = size(X, 1); % Number of training examples
% You need to return the following variables correctly
p = zeros(m, 1);
% ====================== YOUR CODE HERE ======================
% Instructions: Complete the following code to make predictions using
% your learned logistic regression parameters.
% You should set p to a vector of 0's and 1's
%
for i = 1:m
tmp = sigmoid(X(i,:)*theta);
if tmp < 0.5
p(i) = 0;
else
p(i) = 1;
end;
% =========================================================================
end
**