Python爬虫之BeautifulSoup爬取天气网

最新推荐文章于 2024-09-07 16:07:11 发布

Ais永恒

最新推荐文章于 2024-09-07 16:07:11 发布

阅读量2.4k

点赞数 2

分类专栏： Python 文章标签： python BeautifulSoup

本文链接：https://blog.csdn.net/xinxin172170185/article/details/84646381

版权

本文介绍了如何利用Python的BeautifulSoup库来爬取天气网站的数据，通过给出的代码示例展示了爬虫的实现过程，并提供了项目的GitHub链接供读者参考。

摘要由CSDN通过智能技术生成

Python爬虫之BeautifulSoup爬取天气网

代码如下

import requests
from lxml import etree
from bs4 import BeautifulSoup
from pyecharts import Bar

ALL_DATA = []


def parse_page(url):
    headers = {
        "User-Agent": "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.87 Safari/537.36"
    }
    response = requests.get(url, headers=headers)
    text = response.content.decode("UTF-8")
    soup = BeautifulSoup(text, 'html5lib')
    conMidtab = soup.find('div', class_='conMidtab')
    tables = conMidtab.find_all('table')
    for table in tables:
        trs = table.find_all('tr')[2:]
        for index, tr in enumerate(trs):
            tds = tr.find_all("td")
            city_td = tds[0]
            if index == 0:
                city_td = tds[1]
            city =