python显示数据长度_python-3.x – 在Tensorflow中获取数据集的长度

最新推荐文章于 2022-12-19 19:13:27 发布

weixin_39756696

最新推荐文章于 2022-12-19 19:13:27 发布

阅读量294

点赞数

文章标签： python显示数据长度

source_dataset = tf.data.TextLineDataset('primary.csv')

target_dataset = tf.data.TextLineDataset('secondary.csv')

dataset = tf.data.Dataset.zip((source_dataset, target_dataset))

dataset = dataset.shard(10000, 0)

dataset = dataset.map(lambda source, target: (tf.string_to_number(tf.string_split([source], delimiter=',').values, tf.int32),

tf.string_to_number(tf.string_split([target], delimiter=',').values, tf.int32)))

dataset = dataset.map(lambda source, target: (source, tf.concat(([start_token], target), axis=0), tf.concat((target, [end_token]), axis=0)))

dataset = dataset.map(lambda source, target_in, target_out: (source, tf.size(source), target_in, target_out, tf.size(target_in)))

dataset = dataset.shuffle(NUM_SAMPLES) #This is the important line of code

我想完全整理我的整个数据集,但是shuffle()需要拉取一些样本,而tf.Size()不能与tf.data.Dataset一起使用.

我怎么能正常洗牌？

weixin_39756696

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python显示数据长度_python-3.x – 在Tensorflow中获取数据集的长度

source_dataset = tf.data.TextLineDataset('primary.csv')target_dataset = tf.data.TextLineDataset('secondary.csv')dataset = tf.data.Dataset.zip((source_dataset, target_dataset))dataset = dataset.shard(1...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。