序列预处理：序列填充之pad_sequences()和one-hot转化之keras.utils.to_categorical（）

最新推荐文章于 2022-04-29 11:07:10 发布

Zero_to_zero1234

最新推荐文章于 2022-04-29 11:07:10 发布

阅读量1.5k

点赞数 1

分类专栏：自然语言处理 tensorflow 深度学习文章标签： pad_sequences utils.to_categorical padding one-hot

本文链接：https://blog.csdn.net/suiyueruge1314/article/details/90642887

版权

深度学习同时被 3 个专栏收录

112 篇文章 5 订阅

订阅专栏

自然语言处理

42 篇文章 2 订阅

订阅专栏

tensorflow

32 篇文章 2 订阅

订阅专栏

tensorflow文本处理中，经常会将 padding 和 one-hot 操作共同出现，所以以下两种方法为有效且常用的方法：

一、keras.preprocessing.sequence.pad_sequences（）
在这里插入图片描述实例：

>>>list_1 = [[2,3,4]]
>>>keras.preprocessing.sequence.pad_sequences(list_1, maxlen=10)
array([[0, 0, 0, 0, 0, 0, 0, 2, 3, 4]], dtype=int32)

>>>list_2 = [[1,2,3,4,5]]
>>>keras.preprocessing.sequence.pad_sequences(list_2, maxlen=10)
array([[0, 0, 0, 0, 0, 1, 2, 3, 4, 5]], dtype=int32)

二、keras.utils.to_categorical（）

to_categorical(y, num_classes=None, dtype='float32')

将整型标签转为onehot。y为int数组，num_classes为标签类别总数，大于max(y)（标签从0开始的）。
返回：如果num_classes=None，返回len(y) * [max(y)+1]（维度，m*n表示m行n列矩阵，下同），否则为len(y) * num_classes。说出来显得复杂，请看下面实例。

import keras

ohl=keras.utils.to_categorical([1,3])
# ohl=keras.utils.to_categorical([[1],[3]])
print(ohl)
"""
[[0. 1. 0. 0.]
 [0. 0. 0. 1.]]
"""
ohl=keras.utils.to_categorical([1,3],num_classes=5)
print(ohl)
"""
[[0. 1. 0. 0. 0.]
 [0. 0. 0. 1. 0.]]
"""

Zero_to_zero1234

关注

1
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
序列预处理：序列填充之pad_sequences()和one-hot转化之keras.utils.to_categorical（）

tensorflow文本处理中，经常会将 padding 和 one-hot 操作共同出现，所以以下两种方法为有效且常用的方法：一、keras.preprocessing.sequence.pad_sequences（）实例：>>>list_1 = [[2,3,4]]>>>keras.preprocessing.sequence.pad_sequence...
复制链接

扫一扫