Question1: Linear Regression
1.1 Linear Regression with single variable && Gradient descent for single variable
1.2 Linear Regression with multiple variables && Gradient descent for multiple variables
1.3 Normal Equation
1.4 Compare Gradient Descent && Normal Equation
Gradient Descent
Need to choose α
Needs many iterations
Works well even when n is large.
Normal Equation
No need to choose α.
Don’t need to iterate.
Need to compute〖(X^T X)〗^(-1)
Slow if n is very large.
增加计算的复杂性以及对计算机内存容量的要求。
What if 〖(X^T X)〗^(-1)is non-invertible?
(1)Redundant features (linearly dependent).
(2)Too many features(e.g. m≤n)
Delete some features, or use regularization.