Pandas导入CSV文件,出现下面的报错信息。
/usr/local/lib/python3.5/dist-packages/pandas/io/parsers.py in __init__(self, src, **kwds)
1706 kwds['usecols'] = self.usecols
1707
-> 1708 self._reader = parsers.TextReader(src, **kwds)
1709
1710 passed_names = self.names is None
pandas/_libs/parsers.pyx in pandas._libs.parsers.TextReader.__cinit__()
pandas/_libs/parsers.pyx in pandas._libs.parsers.TextReader._get_header()
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xb5 in position 0: invalid start byte
解决办法
方法1,使用unicode_escape编码
dataset = pd.read_csv('sample_data.csv',encoding = 'unicode_escape')
方法2,使用ISO-8859-1编码
import sys
reload(sys)
sys.setdefaultencoding("ISO-8859-1")
方法2,跳过错误
a.encode('utf-8').strip()