Assume you have a classification model, training data and testing data
假设您有一个分类模型,训练数据和测试数据
x_train , y_train // This is the training data
x_test , y_test // This is the testing data
y_predicted // the values predicted by the model given an input
The error rate is the average error of value predicted by the model and the correct value.
误差率是模型预测的值和正确值的平均误差。
偏压 (Bias)
Let’s assume we have trained the model and are trying to predict values with input ‘x_train’. The predicted values are y_predicted. Bias is the error rate of y_predicted and y_train.
假设我们已经训练了模型,并尝试使用输入“ x_train”预测值。 预测值是y_predicted。 偏差是y_predicted和y_train的错误率。
In simple terms,think of bias as the error rate of the training data.
简而言之,可以将偏差视为训练数据的错误率。
When the error rate is high, we call it High Bias and when the error rate is low, we call it Low Bias
当错误率高时,我们将其称为高偏差;当错误率低时,我们将其称为低偏差
方差(Variance)
Let’s assume we have trained the model and this time we are trying to predict values with input ‘x_test’. Again, the predicted values are y_predicted. Variance is the error rate of the y_predicted and y_test
假设我们已经训练了模型,这次我们尝试使用输入“ x_test”预测值。 同样,预测值是y_predicted。 方差是y_predicted和y_test的错误率
In simple terms, think of variance as the error rate of the testing data.
简单来说,将方差视为测试数据的错误率。
When the error rate is high, we call it High Variance and when the error rate is low, we call it Low Variance
当错误率高时,我们称之为高方差;当错误率低时,我们称之为低方差。
不合身 (Underfitting)
When the model has a high error rate in the training data, we can say the model is underfitting. This usually occurs when the number of training samples is too low. Since our model p