TensorFlow框架中多分类标签转换成One-hot

最新推荐文章于 2021-03-01 16:42:15 发布

zaf赵

最新推荐文章于 2021-03-01 16:42:15 发布

阅读量1.9k

点赞数

分类专栏： TensorFlow 深度学习

本文链接：https://blog.csdn.net/zaf0516/article/details/92838083

版权

TensorFlow 同时被 2 个专栏收录

56 篇文章 5 订阅

订阅专栏

深度学习

53 篇文章 1 订阅

订阅专栏

TensorFlow的one_hot函数

import tensorflow as tf
tf.one_hot(indices, depth, on_value, off_value, axis)

one_hot(
indices, #指数的张量，即label的矩阵形式（1行 n列）
depth, #一个标量，用于定义一个 one hot 维度的深度。即分类数
on_value=None, #定义在 indices[j] = i 时填充输出的值的标量。（默认：1）
off_value=None, #定义在 indices[j] != i 时填充输出的值的标量。（默认：0
axis=None, #要填充的轴（默认：-1，一个新的最内层轴）
dtype=None, #输出张量的数据类型。
name=None
)

TensorFlow 多分类标签转换成One-hot

在处理多分类问题时，将多分类标签转成One-hot编码是一种很常见的手段，以下即为Tensorflow将标签转成One-hot的tensor。以Mnist为例，如果标签为“3”，则One-hot编码为[0,0,0,1,0,0,0,0,0,0].

import tensorflow as tf  # version : '1.12.0'

NUM_CLASSES = 10 # 10分类
labels = [0,1,2,3] # sample label
batch_size = tf.size(labels) # get size of labels : 4
labels = tf.expand_dims(labels, 1) # 增加一个维度
indices = tf.expand_dims(tf.range(0, batch_size,1), 1) #生成索引
concated = tf.concat([indices, labels] , 1) #作为拼接
onehot_labels = tf.sparse_to_dense(concated, tf.stack([batch_size, NUM_CLASSES]), 1.0, 0.0) # 生成one-hot编码的标签

sparse_to_dense 函数说明：https://www.tensorflow.org/api_docs/python/tf/sparse_to_dense
将稀疏矩阵转换成密集矩阵，其中索引在concated中，值为1.其他位置的值为默认值0.

方法1：
a = ['A','B','A','C']
from sklearn.preprocessing import OneHotEncoder,LabelEncoder
label_value = label_encoder.fit_transform(a)
>>label_encoder.classes_
array(['A', 'B', 'C'], dtype='<U1')
>>label_value
array([0, 1, 0, 2], dtype=int64)
encoder = OneHotEncoder()
>>one_hot.toarray()
[[ 1. 0. 0.]
[ 0. 1. 0.]
[ 1. 0. 0.]
[ 0. 0. 1.]]

方法2：
from sklearn.preprocessing import LabelBinarizer
encoder = LabelBinarizer()
one_hot = encoder.fit_transform(a)
>>one_hot
array([[1, 0, 0],
       [0, 1, 0],
       [1, 0, 0],
       [0, 0, 1]])

方法3：
def dense_to_one_hot(labels_dense, num_classes):
   """Convert class labels from scalars to one-hot vectors."""
   num_labels = labels_dense.shape[0]
   index_offset = np.arange(num_labels) * num_classes
   labels_one_hot = np.zeros((num_labels, num_classes))
   labels_one_hot.flat[index_offset + labels_dense.ravel()] = 1
   return labels_one_hot

keras中

import numpy as np from keras.utils import to_categorical data = [1, 2, 3, 4, 5, 6, 7, 8, 9, 7] data = array(data) print(data) # [1 2 3 4 5 6 7 8 9 7] #有普通np数组转换为one-hot one_hots = to_categorical(data) print(one_hots) # [[ 0. 1. 0. 0. 0. 0. 0. 0. 0. 0.] # [ 0. 0. 1. 0. 0. 0. 0. 0. 0. 0.] # [ 0. 0. 0. 1. 0. 0. 0. 0. 0. 0.] # [ 0. 0. 0. 0. 1. 0. 0. 0. 0. 0.] # [ 0. 0. 0. 0. 0. 1. 0. 0. 0. 0.] # [ 0. 0. 0. 0. 0. 0. 1. 0. 0. 0.] # [ 0. 0. 0. 0. 0. 0. 0. 1. 0. 0.] # [ 0. 0. 0. 0. 0. 0. 0. 0. 1. 0.] # [ 0. 0. 0. 0. 0. 0. 0. 0. 0. 1.] # [ 0. 0. 0. 0. 0. 0. 0. 1. 0. 0.]] # 由one-hot转换为普通np数组 data = [argmax(one_hot)for one_hot in one_hots] print(data) # [1, 2, 3, 4, 5, 6, 7, 8, 9, 7]

https://blog.csdn.net/ictcxq/article/details/78545282

zaf赵

关注

0
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
TensorFlow框架中多分类标签转换成One-hot

TensorFlow的one_hot函数import tensorflow as tftf.one_hot(indices, depth, on_value, off_value, axis)one_hot(indices, #指数的张量，即label的矩阵形式（1行 n列）depth, #一个标量，用于定义一个 one hot 维度的深度。即分类数on_value=None, ...
复制链接

扫一扫

专栏目录