python sklearn 编码(one-hot,标签编码)

最新推荐文章于 2024-10-25 16:55:48 发布

廷益--飞鸟

最新推荐文章于 2024-10-25 16:55:48 发布

阅读量3.7k

点赞数 1

分类专栏： python

本文链接：https://blog.csdn.net/weixin_45875105/article/details/107818766

版权

python 专栏收录该内容

121 篇文章 17 订阅

订阅专栏

python sklearn one-hot

"""
    数据预处理 独热编码
"""
import numpy as np
import sklearn.preprocessing as sp

samples = np.array([
    [1, 3, 2],
    [7, 5, 4],
    [1, 8, 6],
    [7, 3, 9]
])

# 独热编码 sparse 是否采用稀疏矩阵
ohe = sp.OneHotEncoder(sparse=False, dtype="int32")
result = ohe.fit_transform(samples)
# 00列2位，01列 3位，02列4位
print(result)

在这里插入图片描述

python sklearn 标签编码

"""
    标签编码器
"""
import numpy as np
import sklearn.preprocessing as sp

# 准备数据
raw_samples = np.array(["audi", "ford", "audi", "toyota",
                        "ford", "bmw", "ford", "redflag", "audi"])
print(raw_samples)

# 训练之前 需要标签编码
lbe = sp.LabelEncoder()
result = lbe.fit_transform(raw_samples)
print("-----编码后\n", result)

# 编码 反向推导
test = [0, 0, 1, 1, 4]
print("-----编码反推\n", lbe.inverse_transform(test))