python爬虫实战小案例：抓取天气信息

最新推荐文章于 2024-09-27 10:11:28 发布

洋697

最新推荐文章于 2024-09-27 10:11:28 发布

阅读量331

点赞数 6

文章标签： python 爬虫开发语言

本文链接：https://blog.csdn.net/weixin_63883286/article/details/142493366

版权

简介

在数据驱动的时代，爬虫技术成为了获取数据的重要手段。本篇博客将通过一个简单的爬虫案例，教大家如何使用Python编写一个爬虫程序，用于抓取某个城市的天气信息。

环境准备

在开始之前，请确保你的环境中已安装Python和以下库：

requests：用于发起网络请求。
BeautifulSoup：用于解析HTML文档。

可以通过以下命令安装所需库：

bash

pip install requests beautifulsoup4

目标网站

本案例的目标网站是“中国天气网”的某个城市天气页面。我们将抓取该页面的天气数据。

爬虫代码

1. 导入库

python

import requests
from bs4 import BeautifulSoup

2. 发起请求

python

url = '目标城市天气页面的URL'
response = requests.get(url)
response.encoding = 'utf-8'  # 确保中文字符正确显示

3. 解析HTML

python

soup = BeautifulSoup(response.text, 'html.parser')

4. 提取数据

假设我们要抓取的是天气、温度和风速信息，这些信息可能包含在特定的HTML标签中。

python

weather_info = soup.find('div', class_='weather-info')  # 根据实际页面结构调整
weather = weather_info.find('span', class_='weather').text
temperature = weather_info.find('span', class_='temperature').text
wind = weather_info.find('span', class_='wind').text

print(f'天气：{weather}')
print(f'温度：{temperature}')
print(f'风速：{wind}')