‘utf-8‘ codec can‘t decode byte 0x8b in position 1 错误解决方案之一

最新推荐文章于 2024-06-03 19:53:03 发布

鑫哥呀

最新推荐文章于 2024-06-03 19:53:03 发布

阅读量3.4k

点赞数

文章标签： html python

本文链接：https://blog.csdn.net/weixin_44495539/article/details/121678362

版权

使用urllib.request.urlopen来请求网页的数据，并用read()去读取数据时报出以下错误。

response = urllib.request.Request(url=url, headers=headers)
response = urllib.request.urlopen(response)
html = response.read().decode('utf-8')
print(html)

经查资料了解到该错误是出现了无法用utf-8格式解析的内容，此时应该从浏览器中获取到该网页的响应头。

在响应头中找到Content-Type参数，在这个参数的值中找到网页的编码格式，如上图是“GBK”，将代码的解码格式改成对应的格式，就可以解决这个问题。

response = urllib.request.Request(url=url, headers=headers)
response = urllib.request.urlopen(response)
html = response.read().decode('GBK')
print(html)

当然这只是其中的一个解决方法。

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

鑫哥呀

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
‘utf-8‘ codec can‘t decode byte 0x8b in position 1 错误解决方案之一

使用urllib.request.urlopen来请求网页的数据，并用read()去读取数据时报出以下错误。response = urllib.request.Request(url=url, headers=headers)response = urllib.request.urlopen(response)html = response.read().decode('utf-8')print(html)经查资料了解到该错误是出现了无法用utf-8格式解析的内容，此时应该从浏览器中获取到该网
复制链接

扫一扫