import numpy as np
filename = ‘./data/51job.csv’
arr = np.loadtxt(filename,
delimiter=’,’,
skiprows=1,
usecols=[0,1,2,3],
dtype=str,
)
print(arr)
用numpy加载包含中文的csv文件会报错:
UnicodeDecodeError: ‘gbk’ codec can’t decode byte 0x80 in position 36: illegal multibyte sequence
解决办法一:
arr = np.loadtxt(filename,
delimiter=’,’,
skiprows=1,
usecols=[0,1,2,3],
dtype=str,
encoding=‘utf8’
)
print(arr)
解决办法二:
arr=np.loadtxt(open(filename, encoding=‘utf8’),
dtype=str,
delimiter=’,’,
skiprows=1)
print(arr)