在python中读取含有中文的csv文件的时候遇到了编码错误,“UnicodeDecodeError: 'utf-8' codec can't decode byte 0xbd in position 0: invalid start byte”,因此就想把csv文件编码改为utf-8编码方式,错误如下:
import pandas as pd
Data = pd.read_csv('grade1.csv')
F:\Python37\python.exe "F:/********/dataprocess.py"
Traceback (most recent call last):
File "pandas\_libs\parsers.pyx", line 1169, in pandas._libs.parsers.TextReader._convert_tokens
File "pandas\_libs\parsers.pyx", line 1299, in pandas._libs.parsers.TextReader._convert_with_dtype
File "pandas\_libs\parsers.pyx", line 1315, in pandas._libs.parsers.TextReader._string_convert
File "pandas\_libs\parsers.pyx", line 1553, in pandas._libs.parsers._string_box_utf8
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xbd in position 0: invalid start byte
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "F:/********/dataprocess.py", line 3, in <module>
Data = pd.read_csv('grade1.csv')
File "F:\Python37\lib\site-packages\pandas\io\parsers.py", line 702, in parser_f
return _read(filepath_or_buffer, kwds)
File "F:\Python37\lib\site-packages\pandas\io\parsers.py", line 435, in _read
data = parser.read(nrows)
File "F:\Python37\lib\site-packages\pandas\io\parsers.py", line 1139, in read
ret = self._engine.read(nrows)
File "F:\Python37\lib\site-packages\pandas\io\parsers.py", line 1995, in read
data = self._reader.read(nrows)
File "pandas\_libs\parsers.pyx", line 899, in pandas._libs.parsers.TextReader.read
File "pandas\_libs\parsers.pyx", line 914, in pandas._libs.parsers.TextReader._read_low_memory
File "pandas\_libs\parsers.pyx", line 991, in pandas._libs.parsers.TextReader._read_rows
File "pandas\_libs\parsers.pyx", line 1123, in pandas._libs.parsers.TextReader._convert_column_data
File "pandas\_libs\parsers.pyx", line 1176, in pandas._libs.parsers.TextReader._convert_tokens
File "pandas\_libs\parsers.pyx", line 1299, in pandas._libs.parsers.TextReader._convert_with_dtype
File "pandas\_libs\parsers.pyx", line 1315, in pandas._libs.parsers.TextReader._string_convert
File "pandas\_libs\parsers.pyx", line 1553, in pandas._libs.parsers._string_box_utf8
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xbd in position 0: invalid start byte
从百度搜来的方法如下:
本文方法来自百度经验:首先,将.csv文件保存一下,然后鼠标右击打开方式记事本。然后,以记事本的方式打开了。文件-另存为 这时弹出一个窗口,右下方,编码,这时候你就可以选择自己想要的编码格式,然后保存,就可以了。