参考连接:http://blog.csdn.net/pipisorry/article/details/44136297
上面这篇博客解释的很清楚,里面的一张图也很好解释了python2和python3的编码区别,自己重新绘制了博客中的草图:
python支持的编码类型在下面官网有详细介绍:
https://docs.python.org/3/library/codecs.html#standard-encodings
下面是在ipython下的运行记录:
In [58]: k="你好"
In [59]: k
Out[59]: '你好'
In [60]: type(k)
Out[60]: str
In [61]: k.encode()
Out[61]: b'\xe4\xbd\xa0\xe5\xa5\xbd'
In [62]: k.encode('gbk')
Out[62]: b'\xc4\xe3\xba\xc3'
In [63]: k.encode('big5')
Out[63]: b'\xa7A\xa6n'
In [64]: k.encode('ascii')
---------------------------------------------------------------------------
UnicodeEncodeError Traceback (most recent call last)
<ipython-input-64-7b31b95ac7e2> in <module>()
----> 1 k.encode('ascii')
UnicodeEncodeError: 'ascii' codec can't encode characters in position 0-1: ordinal not in range(128)
In [65]: k.encode('utf-16')
Out[65]: b'\xff\xfe`O}Y'
# 先进行编码
a,b,c=k.encode(),k.encode('gbk'),k.encode('utf-16')
# 分布对三种编码后的数据进行解码
a.decode('utf-8')
Out[35]: '你好'
b.decode('gbk')
Out[36]: '你好'
c.decode('utf-16')
Out[37]: '你好'