数据预处理常用方法
1.filling null data
fill with average value.
using logistic regression or naive bayes algorithm to predict the null data according to other data nearby.
2.data smoothness
3.clean outlier data
4.rectify inconsistent data
5。standardization z-score
好数据的要求:
1. 标准化
2. 均方差