使用 Python 爬取每日 Bing 壁纸

最新推荐文章于 2024-08-27 11:16:30 发布

袁公白

最新推荐文章于 2024-08-27 11:16:30 发布

阅读量1k

点赞数 27

文章标签： python mysql 开发语言

本文链接：https://blog.csdn.net/m0_59328104/article/details/139292795

版权

Bing 每天都会更新一张精美的壁纸，如果你想自动下载这些壁纸作为你的桌面背景，或者仅仅是收藏，那么使用 Python 爬虫实现这个功能再适合不过了。本文将详细介绍如何编写一个简单的 Python 爬虫程序来实现每日 Bing 壁纸的自动下载。

一、前期准备

在开始编写爬虫之前，我们需要安装一些必备的 Python 库：

requests：用于发送 HTTP 请求
beautifulsoup4：用于解析 HTML
os：用于文件操作

首先，我们需要安装这些库。可以在终端或命令行中运行以下命令：

pip install requests beautifulsoup4

二、编写爬虫代码

创建一个新的 Python 文件，例如 bing_wallpaper.py，并在其中编写以下代码。

import requests
from bs4 import BeautifulSoup
import os
from datetime import datetime

# Bing 壁纸的 API 地址
BING_URL = "https://www.bing.com"
API_URL = BING_URL + "/HPImageArchive.aspx?format=js&idx=0&n=1&mkt=en-US"

# 下载壁纸
def download_wallpaper(url, save_path):
    response = requests.get(url)
    if response.status_code == 200:
        with open(save_path, 'wb') as f:
            f.write(response.content)
        print(f"壁纸已保存至 {save_path}")
    else:
        print("下载壁纸失败")

# 获取壁纸信息
def get_wallpaper_info():
    response = requests.get(API_URL)
    if response.status_code == 200:
        data = response.json()
        image_info = data['images'][0]
        image_url = BING_URL + image_info['url']
        image_title = image_info['title']
        return image_url, image_title
    else:
        print("获取壁纸信息失败")
        return None, None

# 主函数
def main():
    image_url, image_title = get_wallpaper_info()
    if image_url and image_title:
        # 使用日期作为文件名
        date_str = datetime.now().strftime("%Y-%m-%d")
        save_path = os.path.join(os.getcwd(), f"{date_str} - {image_title}.jpg")
        download_wallpaper(image_url, save_path)

if __name__ == "__main__":
    main()

三、代码解析

导入库：
```
import requests
from bs4 import BeautifulSoup
import os
from datetime import datetime
```
我们导入了所需的库，分别用于发送 HTTP 请求、解析 HTML、文件操作和处理日期时间。

定义常量：

BING_URL = "https://www.bing.com"
API_URL = BING_URL + "/HPImageArchive.aspx?format=js&idx=0&n=1&mkt=en-US"

这里定义了 Bing 的基本 URL 和获取壁纸信息的 API 地址。

下载壁纸的函数：

def download_wallpaper(url, save_path):
    response = requests.get(url)
    if response.status_code == 200:
        with open(save_path, 'wb') as f:
            f.write(response.content)
        print(f"壁纸已保存至 {save_path}")
    else:
        print("下载壁纸失败")

该函数接收壁纸 URL 和保存路径，下载壁纸并保存到指定路径。

获取壁纸信息的函数：

def get_wallpaper_info():
    response = requests.get(API_URL)
    if response.status_code == 200:
        data = response.json()
        image_info = data['images'][0]
        image_url = BING_URL + image_info['url']
        image_title = image_info['title']
        return image_url, image_title
    else:
        print("获取壁纸信息失败")
        return None, None

该函数从 Bing API 获取当天的壁纸信息，包括壁纸的 URL 和标题。

主函数：

def main():
    image_url, image_title = get_wallpaper_info()
    if image_url and image_title:
        date_str = datetime.now().strftime("%Y-%m-%d")
        save_path = os.path.join(os.getcwd(), f"{date_str} - {image_title}.jpg")
        download_wallpaper(image_url, save_path)

if __name__ == "__main__":
    main()