现象:
1.无法print文本
2.编码错误,当dataframe中有表情符号时,写入本地csv或者xlsl文件会报错
报错:
UnicodeEncodeError: 'utf-8' codec can't encode character '\ud835' in position 219: surrogates not allowed
原因:
一些表情类特殊字符无法被uf-8解码,可以ignore再解码
解决方法:
x.encode('UTF-8', 'ignore').decode('UTF-8')
eg:
data.content = data.content.apply(lambda x:x.encode('UTF-8', 'ignore').decode('UTF-8'))