多重插补 均值插补
Understanding the Mean /Median Imputation and Implementation using feature-engine….!
了解使用特征引擎的均值/中位数插补和实现…。!
均值或中位数插补: (Mean or Median Imputation:)
The mean or median value should be calculated only in the train set and used to replace NA in both train and test sets. To avoid over-fitting
平均值或中位数应仅在训练集中进行计算,并用于代替训练和测试集中的NA。 避免过度拟合
均值/中位数插补:定义: (Mean / Median imputation: definition:)
Mean/median imputation consists of replacing all occurrences of missing values (NA) within a variable by the mean or median.
均值/中位数推算包括用均值或中位数替换变量中所有缺失值(NA)的出现。
我可以使用均值/中位数插补估算哪些变量? (Which variables can I impute with Mean / Median Imputation?)
· The mean and median can only be calculated on numerical variables, therefore, these methods are suitable for continuous and discrete numerical variables only.
·平均值和中位数只能通过数值变量来计算,因此,这些方法仅适用于连续和离散数值变量。