爬虫实例3:爬取微博热搜

最新推荐文章于 2024-06-17 10:42:46 发布

南巷的花猫

最新推荐文章于 2024-06-17 10:42:46 发布

阅读量1.2k

点赞数

分类专栏： python 爬虫

本文链接：https://blog.csdn.net/qq_42662411/article/details/103462643

版权

1-获取微博热搜url

weibo_url = 'https://s.weibo.com/top/summary?cate=realtimehot'

2-创建存放微博热搜目录是否存在不存在就创建

if not os.path.exists(r'd:/新浪新闻'):
    os.mkdir(r'd:/新浪新闻')

3-获取所需要的字段值

eles=selector.cssselect('tbody>tr')
ls=[]
for index, ele in enumerate(eles):
    title = ele.xpath('./td[@class="td-02"]/a/text()')[0]
    #print(title)
    url = ele.xpath('./td[@class="td-02"]/a/@href')[0]
    hot = ele.xpath('./td[@class="td-02"]/span/text()')
    #print(title,url,hot)
    cwawl_time

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

南巷的花猫

关注关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
1
评论
爬虫实例3:爬取微博热搜

1-获取微博热搜urlweibo_url = 'https://s.weibo.com/top/summary?cate=realtimehot'2-创建存放微博热搜目录是否存在不存在就创建if not os.path.exists(r'd:/新浪新闻'): os.mkdir(r'd:/新浪新闻')3-获取所需要的字段值eles=selector.cssselect('tb...
复制链接

扫一扫