Python编码问题整理 - 虫师 - 博客园 https://www.cnblogs.com/fnng/p/5008884.html
中文字符集编码Unicode ,gb2312 , cp936 ,GBK,GB18030 - finallyly - 博客园 https://www.cnblogs.com/finallyliuyu/archive/2013/05/10/3071023.html
关于编码问题的深度解析 - CSDN博客 https://blog.csdn.net/sundaysunshine/article/details/53954813
python语句:
filea=open("养老补缴明细表.csv", 'r')
print("检测点A", filea, "监测点B")
df3=pd.read_csv(filea)
print(df3.dtypes)
输出错误提示:
检测点A<_io.TextIOWrapper name='养老补缴明细表.csv' mode='r' encoding='cp936'> 检测点B
Exception "unhandled UnicodeDecodeError"'gbk' codec can't decode byte 0x9f in position 9: illegal multibyte sequence
解决方法:
故障点1:打开"养老补缴明细表.csv",发现文件内容中文字符乱码:df1.to_csv("养老补缴明细表.csv")改为:df1.to_csv("养老补缴明细表.csv",encoding='utf_8'),文件内容乱码问题解决。
故障点2:'gbk' codec can't decode byte 0x9f in position 9: illegal multibyte sequence问题。修改对应语句为:
filea=open("养老补缴明细表.csv", 'r', encoding='utf-8')
问题解决。