在处理csv文件时,发现这个出错,怎么办呢?
Traceback (most recent call last):
File "pandas\_libs\parsers.pyx", line 1162, in pandas._libs.parsers.TextReader._convert_tokens (pandas\_libs\parsers.c:14858)
File "pandas\_libs\parsers.pyx", line 1273, in pandas._libs.parsers.TextReader._convert_with_dtype (pandas\_libs\parsers.c:17119)
File "pandas\_libs\parsers.pyx", line 1289, in pandas._libs.parsers.TextReader._string_convert (pandas\_libs\parsers.c:17347)
File "pandas\_libs\parsers.pyx", line 1524, in pandas._libs.parsers._string_box_utf8 (pandas\_libs\parsers.c:23041)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd6 in position 0: invalid continuation byte
During handling of the above exception, another exception occurred:
要解决这个问题,应该是CSV保存时有问题,需要把保存的代码进行修改:
原来: df.to_csv(outpath)
修改为: df.to_csv(outpath, encoding= u'utf-8')
这样就可以按utf-8方式读取了。