python爬虫错误之 “UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte 0xd3 in position 252”

最新推荐文章于 2024-06-03 19:53:03 发布

叶落无痕123

最新推荐文章于 2024-06-03 19:53:03 发布

阅读量1.5k

点赞数

分类专栏： python

原文链接：https://blog.csdn.net/s_alted/article/details/116564853

版权

23 篇文章 3 订阅

订阅专栏

今天是学习爬虫第一天，俗话说万事开头难。刚写的第一个程序就报错了源代码如下：
import urllib.request

url = "https://fishc.com.cn/"
response = urllib.request.urlopen(url)
html = response.read().decode("utf-8")
print(html)

1234567
错误如下：

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd3 in position 252: invalid continuation byte

翻译过来就是： "utf-8”编解码器无法解码位置252中的字节0xd3:无效的连续字节
这个是解码出现了问题我们去要爬的网站看一下，看看他的编码方式是什么输入网站域名 --> 点击F12键

我们可以看到是gbk编码方式，至此问题原因就找到了修改代码，成功解决耶耶耶！

import urllib.request

 

url = "https://fishc.com.cn/"
response = urllib.request.urlopen(url)
html = response.read().decode("gbk")
print(html)

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

关注关注