出现问题:
UnicodeDecodeError: ‘gbk’ codec can’t decode byte 0xab in position 11126: illegal multibyte sequence
解决办法:
1.读取时加encoding='utf-8'
,如下:
open(r'C:\Z2programe\当当文学图书语料库\data\ID汇总.csv',encoding='utf-8')
2.存储时加encoding='utf_8_sig'
save.to_csv(r'C:\Z2programe\当当文学图书语料库\data\ID汇总.csv', encoding='utf_8_sig')