python爬虫壁纸网站（有源码）

最新推荐文章于 2024-03-23 15:13:22 发布

VIP文章大白菜加油

最新推荐文章于 2024-03-23 15:13:22 发布

阅读量560

点赞数

分类专栏：爬虫案例文章标签： python 爬虫

本文链接：https://blog.csdn.net/m0_61848611/article/details/124643784

版权

前言

这次我尝试了从壁纸网站上面，爬取图片下来

目标网址：http://www.netbian.com/

获取网页

这次我使用的是urllib库

设置网址

设置请求头

获取网页源代码，这里没有解码成中文字符，是因为要爬取的图片我们要以二进制形式保存

import urllib.request

url='http://www.netbian.com/'
headers={
    "User-Agent":"Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.87 Safari/537.36 SE 2.X MetaSr 1.0"
    }
request=urllib.request.Request(url,headers=headers)
r= (urllib.request.urlopen(request)).read()