python labelencoder参数_sklearn-标准化标签LabelEncoder

最新推荐文章于 2024-06-25 12:36:22 发布

weixin_39826080

最新推荐文章于 2024-06-25 12:36:22 发布

阅读量472

点赞数

文章标签： python labelencoder参数

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_39826080/article/details/111914196

版权

本文介绍了Python机器学习中用于标签标准化的LabelEncoder，以及数据的标准化处理，包括StandardScaler（均值为0，方差为1）、MinMaxScaler（0到1范围）和归一化方法（L1、L2范数）。通过实例展示了这些方法的使用和效果。

摘要由CSDN通过智能技术生成

python机器学习-乳腺癌细胞挖掘(博主亲自录制视频)

sklearn.preprocessing.LabelEncoder()：标准化标签

standardScaler==features with a mean=0 and variance=1

minMaxScaler==features in a 0 to 1 range

normalizer==feature vector to a euclidean length=1

normalization

bring the values of each feature vector on a common scale

L1-least absolute deviations-sum of absolute values(on each row)=1;it is insensitive to outliers

L2-Least squares-sum of squares(on each row)=1;takes outliers in consideration during traing

# -*- coding: utf-8 -*-

"""

Created on Sat Apr 14 09:09:41 2018

@author:Toby

standardScaler==features with a mean=0 and variance=1

minMaxScaler==features in a 0 to 1 range

normalizer==feature vector to a euclidean length=1

normalization

bring the values of each feature vector on a common scale

L1-least absolute deviations-sum of absolute values(on each row)=1;it is insensitive to outliers

L2-Least squares-sum of squares(on each row)=1;takes outliers in consideration during traing

"""

from sklearn import preprocessing

import numpy as np

data=np.array([[2.2,5.9,-1.8],[5.4,-3.2,-5.1],[-1.9,4.2,3.2]])

bindata=preprocessing.Binarizer(threshold=1.5).transform(data)

print('Binarized data:',bindata)

#mean removal

print('Mean(before)=',data.mean(axis=0))

print('standard deviation(before)=',data.std(axis=0))

#features with a mean=0 and variance=1

scaled_data=preprocessing.scale(data)

print('Mean(before)=',scaled_data.mean(axis=0))

print('standard deviation(before)=',scaled_data.std(axis=0))

print('scaled_data:',scaled_data)

'''

scaled_data: [[ 0.10040991 0.91127074 -0.16607709]

[ 1.171449 -1.39221918 -1.1332319 ]

[-1.27185891 0.48094844 1.29930899]]

'''

#features in a 0 to 1 range

minmax_scaler=preprocessing.MinMaxScaler(feature_range=(0,1))

data_minmax=minmax_scaler.fit_transform(data)

print('MinMaxScaler applied on the data:',data_minmax)

'''

MinMaxScaler applied on the data: [[ 0.56164384 1. 0.39759036]

[ 1. 0. 0. ]

[ 0. 0.81318681 1. ]]

'''

data_l1=preprocessing.normalize(data,norm='l1')

data_l2=preprocessing.normalize(data,norm='l2')

print('l1-normalized data:',data_l1)

'''

[[ 0.22222222 0.5959596 -0.18181818]

[ 0.39416058 -0.23357664 -0.37226277]

[-0.20430108 0.4516129 0.34408602]]

'''

print('l2-normalized data:',data_l2)

'''

[[ 0.3359268 0.90089461 -0.2748492 ]

[ 0.6676851 -0.39566524 -0.63059148]

[-0.33858465 0.74845029 0.57024784]]

'''

weixin_39826080

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python labelencoder参数_sklearn-标准化标签LabelEncoder

python机器学习-乳腺癌细胞挖掘(博主亲自录制视频)sklearn.preprocessing.LabelEncoder()：标准化标签standardScaler==features with a mean=0 and variance=1minMaxScaler==features in a 0 to 1 rangenormalizer==feature vector to a eucli...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。