Python爬虫之一键保存全部必应高清1080P壁纸

最新推荐文章于 2024-01-07 21:07:44 发布

BugMiaowu2021

最新推荐文章于 2024-01-07 21:07:44 发布

阅读量1.7k

点赞数 6

分类专栏： # Python爬虫文章标签： python xpath

本文链接：https://blog.csdn.net/m0_46278037/article/details/114240397

版权

Python爬虫专栏收录该内容

17 篇文章 6 订阅

订阅专栏

必应壁纸：https://bing.ioliu.cn/
在这里插入图片描述

源码：

import requests
from lxml import etree

for i in range(1, 152):
    print('page:\t', i)
    url = 'https://bing.ioliu.cn/?p={}'.format(i)

    headers = {
        'Host': 'bing.ioliu.cn',
        'Connection': 'keep-alive',
        'Cache-Control': 'max-age=0',
        'DNT': '1',
        'Upgrade-Insecure-Requests': '1',
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.163 Safari/537.36',
        'Sec-Fetch-Dest': 'document',
        'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.9',
        'Sec-Fetch-Site': 'none',
        'Sec-Fetch-Mode': 'navigate',
        'Sec-Fetch-User': '?1',
        'Accept-Encoding': 'gzip, deflate, br',
        'Accept-Language': 'zh-CN,zh;q=0.9',
        'Cookie': '_ga=GA1.2.1389080226.1586346124; _gid=GA1.2.1179718529.1586346124; Hm_lvt_667639aad0d4654c92786a241a486361=1586346124; likes=; Hm_lpvt_667639aad0d4654c92786a241a486361=1586347115',
        'If-None-Match': 'W/"5ae9-A6K6aP64lqd/8LCoQ4XYnQ"'
    }
    res = requests.get(url, headers=headers, verify=False)
    # print(res.text)
    parseHtml = etree.HTML(res.text)
    picList = parseHtml.xpath('//img/@src')
    # print(picList)
    for pic in picList:
        try:
            # http://h1.ioliu.cn/bing/SantoriniAerial_ZH-CN9367767863_640x480.jpg?imageslim
            picUrl = pic.split('_640')[0] + '_1920x1080.jpg'
            picName = pic.split('bing/')[-1].split('_')[0] + '.jpg'
            picRes = requests.get(picUrl)
            with open(picName, 'wb') as f:
                f.write(picRes.content)

        except Exception as e:
            print(i, pic, e)

爬取结果：
在这里插入图片描述

在这里插入图片描述

BugMiaowu2021

关注

6
点赞
踩
5

收藏

觉得还不错? 一键收藏
打赏
9
评论
Python爬虫之一键保存全部必应高清1080P壁纸

视频截图：源码及注释：index.html<!DOCTYPE html><html lang="zh"><head> <meta charset="UTF-8"> <meta http-equiv="X-UA-Compatible" content="IE=edge"> <meta name="viewport" content="width=device-width, initial-scale=1.0"
复制链接

扫一扫