Coursera ML(3)-Multivariate Linear Regression python实现

最新推荐文章于 2024-07-16 19:26:54 发布

mmmwhy

最新推荐文章于 2024-07-16 19:26:54 发布

阅读量447

点赞数

分类专栏： python 机器学习文章标签： python 算法

本文链接：https://blog.csdn.net/mmmwhy/article/details/66478037

版权

python 同时被 2 个专栏收录

2 篇文章 0 订阅

订阅专栏

机器学习

2 篇文章 0 订阅

订阅专栏

Multivariate Linear Regression and Programming Exercise 1 更多见：李飞阳

Gradient Descent for Multiple Variables

Suppose we have n variables, set hypothesis to be:

Cost Function
Gradient Descent Algorithm

Get every feature into approximately [-1, 1]. Just normalize all the parameters :)
Learning Rate:Not too big(fail to converge), not too small(too slow)
Polynormal Regression:Use feature scalling. (Somewhat like normalizing dimension)

Programming Exercise 1

下载程序及相关数据

Stanford coursera Andrew Ng 机器学习课程编程作业（Exercise 1），作业下载链接貌似被墙了，下载链接放这。
http://home.ustc.edu.cn/~mmmwhy/machine-learning-ex1.zip

重新推导一下：

其实这里一共就两个式子：

computeCost

$h θ (x) = θ 0 + θ 1 x 1 + θ 2 x 2 + θ 3 x 3 + \dots + θ n x n$ $h_\theta (x) = \theta_0 + \theta_1 x_1 + \theta_2 x_2 + \theta_3 x_3 + \cdots + \theta_n x_n$
$J (θ 0, θ 1) = 1 2 m \sum i = 1 m (h θ (x (i)) - y (i)) w$ $J(\theta_0,\theta_1 )=\frac1{2m} \sum_{i=1}^{m}(h_{\theta}(x^{(i)})-y^{(i)})^w$
gradientDescent

$repeat until convergence: {θ 0 : = θ 1 : =} θ 0 - α 1 m \sum i = 1 m (h θ (x i) - y i) θ 1 - α 1 m \sum i = 1 m ((h θ (x i) - y i) x i)$ $\begin{align*} \text{repeat until convergence: } \lbrace & \newline \theta_0 := & \theta_0 - \alpha \frac{1}{m} \sum\limits_{i=1}^{m}(h_\theta(x_{i}) - y_{i}) \newline \theta_1 := & \theta_1 - \alpha \frac1m \sum\limits_{i=1}^m\left((h_\theta(x_i) - y_i) x_i\right) \newline \rbrace& \end{align*}$

python拟合实现代码

原本用的是matlab代码，我用python实现了一下，结果是一样的：

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt


def readfile(path):
    X=[]
    y=[]
    with open(path,'r') as f:
        for line in f:
            X.append([1,float(line.split(',')[0])])
            y.append(float(line.split(',')[1]))
    return X,y


def dataplot(x,theta,y):
    plt.plot(x, y, 'rx', markersize=10)
    plt.ylabel('Profit in $10,000s')
    plt.xlabel('Population of City in 10,000s')
    plt.plot(X[:,1],X*theta,'-')
    plt.show()


def computeCost(X,y,theta):
    m = len(y)
    J = 0
    for i in range(m):
        J = J + float((X[i]*theta-y[i])**2)
    return J/(2*m)

def gradientDescent(X, y, theta, alpha, num_iters):
    m = len(y)
    num_iters = 1500
    J_history = np.zeros(num_iters)
    for i in range(num_iters):
        S =X.T * (X * theta - np.mat(y).T) / m
        theta = theta - alpha * S;
        J_history[i] = computeCost(X,y,theta)
    return theta

if __name__=="__main__":
    theta = np.mat([[0],[0]])
    iterations = 1500
    alpha = 0.01
    iterations = 1500
    path = "C:\Users\wing\Documents\MATLAB\ex1\ex1data1.txt"

    x,y = readfile(path)# 小写的X不是矩阵，是list，大写的X是矩阵。
    X = np.mat(x)
    J = computeCost(X, y, theta)
    theta = gradientDescent(X, y, theta, alpha, iterations)
    dataplot(X[:,1],theta,y)

输出的图有点小，就这样吧。

mmmwhy

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Coursera ML(3)-Multivariate Linear Regression python实现

Multivariate Linear Regression and Programming Exercise 1Gradient Descent for Multiple VariablesSuppose we have n variables, set hypothesis to be:Cost Function Gradient Descent Algorithm Get ever
复制链接

扫一扫