python爬虫练习--爬取某城市历史气象数据（待优化）

最新推荐文章于 2024-08-14 11:10:33 发布

An_Ji_Yang

最新推荐文章于 2024-08-14 11:10:33 发布

阅读量3.1k

点赞数

分类专栏： python学习文章标签： python 爬虫气象数据

本文链接：https://blog.csdn.net/yaj13346943285/article/details/71642646

版权

python学习专栏收录该内容

8 篇文章 0 订阅

订阅专栏

# -*- coding=utf-8 -*-
from __future__ import print_function  
import urllib.request  
from bs4 import BeautifulSoup  
  
strYear = '2013'              
strFile = 'zhengzhou' + strYear + '.csv'  
f = open(strFile, 'w')  
  
for month in range(1, 13):  
    if(month < 10):  
        strMonth = '0' + str(month)  
    else:  
        strMonth = str(month)  
    strYearMonth = strYear + strMonth  
    print("\nGetting data for month" + strYearMonth + "...", end='')  
      
    url  = "http://lishi.tianqi.com/beijing/"+strYearMonth+".html"  
    page = urllib.request.urlopen(url)
    #创建BeautifulSoup对象	
    soup = BeautifulSoup(page, "html.parser")  
    weatherSet = soup.find(attrs={"class":"tqtongji2"})  
    if(weatherSet == None):  
        print("fail to get the page", end='')   
        continue  
      
    for line in weatherSet.contents:  
        if(line.__class__.__name__ == 'NavigableString'): continue  
        if(len(line.attrs) > 0): continue  
        lis = line.findAll('li')  
        strDate = lis[0].text  
        highWeather = lis[1].text  
        lowWeather  = lis[2].text
        weather = lis[3].text
        windDirection = lis[4].text
        windPower = lis[5].text		
        f.write(strDate +',' + lowWeather +',' + highWeather + ','+weather + ',' +
		windDirection + ',' + windPower +'\n')  
    print("done", end='')  
      
f.close()

参考资料：http://cuiqingcai.com/1319.html

An_Ji_Yang

关注

0
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录