吴恩达机器学习笔记（五）--多变量线性回归

最新推荐文章于 2023-07-03 03:37:25 发布

步渊comajor

最新推荐文章于 2023-07-03 03:37:25 发布

阅读量280

点赞数

分类专栏：系统学习文章标签：人工智能机器学习学习笔记

本文链接：https://blog.csdn.net/comajor/article/details/87350208

版权

系统学习专栏收录该内容

10 篇文章 0 订阅

订阅专栏

吴恩达机器学习笔记（五）–多变量线性回归

学习基于：吴恩达机器学习.

1. Multiple Features

Linear regression with multiple variables is also known as “multivariate linear regression”.

equation	notation
$x_j^{(i)}$	value of feature j in the i^th training example
$x^{(i)}$	the input (features) of the i^th training example
$m$	the number of training examples
$n$	the number of features

The multivariable form of the hypothesis function accommodating these multiple features is as follows:
$h_\theta(x) = \theta_0x_0 + \theta_1x_1 + \theta_2x_2 + ... + \theta_nx_n$ 　　　　 $x_0 \equiv 1 )$
Using the definition of matrix multiplication, our multivariable hypothesis function can be concisely represented as:
$h_\theta(x) = \left[ \begin{matrix} \theta_0 & \theta_1 & ... & \theta_n \end{matrix} \right]\left[ \begin{matrix} x_0 \\ x_1 \\ ... \\ x_n \end{matrix} \right] = \theta^Tx$

2. Gradient Descent For Multiple Variables

The gradient descent equation itself is generally the same form; we just have to repeat it for our ‘n’ features:

repeat until convergence: {
　 $\theta_{j} := \theta_{j} - \alpha\frac{1}{m}\sum_{i = 1}^{m}(h_{\theta}(x^{i})-y^{i})x_j^{(i)}$
　for $j : = 0 . . . n$
}

1） Feature Scaling

We can speed up gradient descent by having each of our input values in roughly the same range. This is because θ will descend quickly on small ranges and slowly on large ranges, and so will oscillate inefficiently down to the optimum when the variables are very uneven.

The way to prevent this is to modify the ranges of our input variables so that they are all roughly the same. Ideally:
- $\leq x_i \leq 1$

2） Learning Rate

This is the gradient descent algorithm:
- $\theta_{j} := \theta_{j} - \alpha\frac{\partial}{\partial\theta_{j}}J(\theta_{0}, \theta_{1}).$
We need to adjust the value of $\alpha$ so that gradient descent can converge