Error: iterator should return strings, not bytes (did you open the file in text mode?)

Original Code:
# Load CSV Using Python Standard Library
import csv
import numpy as np
filename = 'pima-indians-diabetes.data.csv'
raw_data = open(filename,'rb')
reader = csv.reader(raw_data, delimiter=',',quoting=csv.QUOTE_NONE)
x = list(reader)
data = np.array(x).astype('float')
print(data.shape)
Modified Code:
# Load CSV Using Python Standard Library
import csv
import codecs
import numpy as np
filename = 'pima-indians-diabetes.data.csv'
raw_data = open(filename,'rb')
#reader = csv.reader(raw_data, delimiter=',',quoting=csv.QUOTE_NONE)
reader = csv.reader(codecs.iterdecode(raw_data, 'utf-8'),delimiter=',',quoting=csv.QUOTE_NONE)
x = list(reader)
data = np.array(x).astype('float')
print(data.shape)

本文介绍了一种在Python中使用标准库加载CSV文件时遇到的常见错误:迭代器应返回字符串,而非字节。通过使用codecs模块的iterdecode方法进行修改,解决了因文件打开模式不当导致的数据类型不匹配问题。

被折叠的 条评论
为什么被折叠?



