使用波士顿房价数据集来作为案例,完成数据归一化和标准化
数据归一化
sklearn.preprocessing.MinMaxScaler()
from sklearn import preprocessing
from sklearn.datasets import load_boston
import pandas as pd
import numpy as np
data = load_boston()
x = data.data
y = data.target
df = pd.DataFrame(x,columns=data.feature_names)
print(df)
std = preprocessing.MinMaxScaler()
x = std.fit_transform(x)
df = pd.DataFrame(x,columns=data.feature_names)
print(df)
原数据
归一化后的数据
数据标准化:使数据分布服从正态分布
sklearn.preprocessing.StandardScaler()
from sklearn.datasets import load_boston
import pandas as pd
import numpy as np
data = load_boston()
x = data.data
y = data.target
df = pd.DataFrame(x,columns=data.feature_names)
print(df)
scale = preprocessing.StandardScaler()
x = scale.fit_transform(x)
df = pd.DataFrame(x,columns=data.feature_names)
print(df)
标准化后的数据