python utf 8编码_字符串编码IDNA>UTF8（Python）

最新推荐文章于 2023-03-02 08:47:35 发布

生活的手下败将

最新推荐文章于 2023-03-02 08:47:35 发布

阅读量471

点赞数

文章标签： python utf 8编码

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_33186486/article/details/113494170

版权

要从一种编码转换为另一种编码，必须首先将字符串解码为Unicode，然后在目标编码中再次编码。在

例如：idna_encoded_bytes = b'xn o3cw4h'

unicode_string = idna_encoded_bytes.decode('idna')

utf8_encoded_bytes = unicode_string.encode('utf-8')

print (repr(idna_encoded_bytes))

print (repr(utf8_encoded_bytes))

print (repr(unicode_string))

Python2结果：

^{pr2}$

如您所见，第一行是ไทย的IDNA编码，第二行是utf8编码，最后一行是Unicode代码点U-0E44、U-0E17和U-0E22的未编码序列。在

要一步完成转换，只需将操作链起来：utf8_encoded_bytes = idna_encoded_bytes.decode('idna').encode('utf8')

回复评论：I'm starting with isn't b'xn o3cw4h' but just the string 'xn o3cw4h'. [in Python3].

你那儿有只怪鸭子。显然，您已经对存储在unicode字符串中的数据进行了编码。我们需要以某种方式将其转换为bytes对象。一种简单的方法是使用(令人困惑的)ASCII编码：improperly_encoded_idna = 'xn o3cw4h'

idna_encoded_bytes = improperly_encoded_idna.encode('ascii')

unicode_string = idna_encoded_bytes.decode('idna')

utf8_encoded_bytes = unicode_string.encode('utf-8')

print (repr(idna_encoded_bytes))

print (repr(utf8_encoded_bytes))

print (repr(unicode_string))

生活的手下败将

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python utf 8编码_字符串编码IDNA>UTF8（Python）

要从一种编码转换为另一种编码，必须首先将字符串解码为Unicode，然后在目标编码中再次编码。在例如：idna_encoded_bytes = b'xn o3cw4h'unicode_string = idna_encoded_bytes.decode('idna')utf8_encoded_bytes = unicode_string.encode('utf-8')print (repr(idn...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。