Tensorflow加载数据

最新推荐文章于 2023-10-17 10:16:40 发布

hhhhhyyyyy8

最新推荐文章于 2023-10-17 10:16:40 发布

阅读量242

点赞数

分类专栏：深度学习 Python

本文链接：https://blog.csdn.net/hhhhhyyyyy8/article/details/83744073

版权

Python 同时被 2 个专栏收录

23 篇文章 0 订阅

订阅专栏

深度学习

13 篇文章 0 订阅

订阅专栏

1.reader = tf.TextLineReader(),每次读取一行

阅读器的read方法会输出一个key来表征输入的文件和其中的纪录(对于调试非常有用)，同时得到一个字符串标量，这个字符串标量可以被一个或多个解析器，或者转换操作将其解码为张量并且构造成为样本。

file1.csv内容

100	10	11	12
101	10	11	12
102	10	11	12
103	10	11	12
104	10	11	12
105	10	11	12
106	10	11	12

file0.csv内容

1	10	11	12
2	10	11	12
3	10	11	12
4	10	11	12
5	10	11	12
6	10	11	12
7	10	11	12

import tensorflow as tf
filename_queue = tf.train.string_input_producer(["file0.csv", "file1.csv"])

reader = tf.TextLineReader()
key, value = reader.read(filename_queue)

# Default values, in case of empty columns. Also specifies the type of the
# decoded result.
record_defaults = [[1], [1], [1], [1], [1]]
col1, col2, col3, col4, col5 = tf.decode_csv(
    value, record_defaults=record_defaults)
#features = tf.concat(0, [col1, col2, col3, col4])
features = [col1, col2, col3, col4]

with tf.Session() as sess:
  # Start populating the filename queue.
  coord = tf.train.Coordinator()
  threads = tf.train.start_queue_runners(coord=coord)

  for i in range(1200):
    # Retrieve a single instance:
    example, label = sess.run([features, col5])
    print(example,label)

  coord.request_stop()
  coord.join(threads)

输出结果

[1, 10, 11, 12] 0
[2, 10, 11, 12] 0
[3, 10, 11, 12] 0
[4, 10, 11, 12] 0
[5, 10, 11, 12] 0
[6, 10, 11, 12] 0
[7, 10, 11, 12] 0
[100, 10, 11, 12] 0
[101, 10, 11, 12] 0
[102, 10, 11, 12] 0
[103, 10, 11, 12] 0
[104, 10, 11, 12] 0
[105, 10, 11, 12] 0
[106, 10, 11, 12] 0
[100, 10, 11, 12] 0
[101, 10, 11, 12] 0
[102, 10, 11, 12] 0
[103, 10, 11, 12] 0
[104, 10, 11, 12] 0
[105, 10, 11, 12] 0
[106, 10, 11, 12] 0
[1, 10, 11, 12] 0
[2, 10, 11, 12] 0
可以看出一个文件的内容是按顺序读的，没有打乱。