1.数据标准化
import numpy as np
from sklearn.preprocessing import StandardScaler
‘’’
scale_: 缩放比例,同时也是标准差
mean_: 每个特征的平均值
var_:每个特征的方差
n_sample_seen_:样本数量,可以通过patial_fit 增加
‘’’
x = np.array(range(1, 10)).reshape(-1, 1)
ss = StandardScaler()
ss.fit(x)#一定要加fit
print(x)
print(ss.n_samples_seen_)
print(ss.mean_)
print(ss.var_)
print(ss.scale_)
print(‘标准化后的数据:’)
print(ss.fit_transform(x))
输出结果:
[[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]]
9
[ 5.]
[ 6.66666667]
[ 2.5819889]
标准化后的数据:
[[-1.54919334]
[-1.161895 ]
[-0.77459667]
[-0.38729833]
[ 0. ]
[ 0.38729833]
[ 0.77459667]
[ 1.161895 ]
[ 1.54919334]]