Machine Learning Course 4 Selection of Model_4. selection of a model is based on-CSDN博客

本文链接：https://blog.csdn.net/weixin_43366276/article/details/107821803

本课程讨论了机器学习中模型选择的重要性。平均误差在测试数据上的表现分为由偏差和方差引起的两种错误。偏差表示模型的简化程度，而方差反映了模型对训练数据的敏感性。过拟合和欠拟合是模型诊断的关键问题，可以通过增加特征、使用更复杂模型或正则化来调整偏差。模型选择的目标是找到平衡偏差和方差的模型，交叉验证是实现这一目标的有效手段。

摘要由CSDN通过智能技术生成

Course 4

Average Error on testing data

Two main errors:

Error due to bias
Error due to variance

Estimator

Only god knows about the best function
$\hat f$
and we can only get a function from training data called
$f^*$
we say:
$f^*\quad is\quad an\quad estimator \quad of\quad \hat f$
and the difference between f^* and f^hat comes from bias and variance

Bias and Variance of Estimator

Suppose the mean of a variable is μ, and the variance of x is σ²
$\quad N \quad points:\{x^1,x^2,...,x^3\}$

$m=\frac{1}{N}\sum_n x^n\neq\mu \quad s^2=\frac 1 N \sum_n (x^n-m)^2$

$E[m]=E[\frac{1}{N}\sum_n x^n]=\frac 1 N \sum_n E[x^n]=\mu\quad E[s^2]=\frac{N-1}N \sigma ^2$

m is a biased estimator of μ, s² is a biased estimator of σ²
$Var[m]=\frac{\sigma^2}{N}$
which shows how much m deviates μ and the variance depends on the amount of sample

and the relationship of these parameters is below:
在这里插入图片描述

Simple model with small variance, and complicated model with large variance since simpler model is less likely to be influenced by the sampled data.
在这里插入图片描述

Diagnosis

If your model cannot even fit the training data, then you got a large bias, it is Underfitting
If you can fit training data but got large error on testing data, then you probably got a large variance, it is Overfitting

For bias, redesign your model: