报错信息如下
Traceback (most recent call last):
File "D:\Scripts\MyWorkProject\xxx.py", line 84, in <module>
email_message = data[0][1].decode('utf-8' )
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd3 in position 476: invalid continuation byte
错误翻译:
Unicode编码错误:“utf-8”编解码器无法解码位置476中的字节0xd3:无效的延续字节
报错原因:
由于设置了decode()方法中第二个参数errors为严格(strict)形式造成的,默认就是这个utf-8编码参数,因为解析的内容存在其他编码,将其更改为ignore等即可。
def decode(self, encoding, errors="strict"):
"""
Decode character string in the Series/Index using indicated encoding.
Equivalent to :meth:`str.decode` in python2 and :meth:`bytes.decode` in
python3.
Parameters
----------
encoding : str
errors : str, optional
Returns
-------
Series or Index
"""
# TODO: Add a similar _bytes interface.
if encoding in _cpython_optimized_decoders:
# CPython optimized implementation
f = lambda x: x.decode(encoding, errors)
else:
decoder = codecs.getdecoder(encoding)
f = lambda x: decoder(x, errors)[0]
arr = self._data.array
# assert isinstance(arr, (StringArray,))
result = arr._str_map(f)
return self._wrap_result(result)