Python爬虫爬取新浪微博热搜

最新推荐文章于 2024-06-17 10:42:46 发布

和小灰

最新推荐文章于 2024-06-17 10:42:46 发布

阅读量2.2k

点赞数 3

文章标签： python 爬虫

本文链接：https://blog.csdn.net/qq_47880276/article/details/113572305

版权

Python爬虫爬取新浪微博热搜

文章目录

- Python爬虫爬取新浪微博热搜
网页分析
数据爬取
数据存储
全部代码

网页分析

在这里插入图片描述
找到热搜的排名，标题和热度，发现它们在同一路径

数据爬取

import requests
from lxml import etree
url= 'https://s.weibo.com/top/summary?Refer=top_hot&topnav=1&wvr=6'
#print(response.text)
headers={
    'User-Agent': 'Mozilla/5.0 (Wind ows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/88.0.4324.104 Safari/537.36'
}
response=requests.get(url,headers=headers)
html=etree.HTML(response.text)
datas=html.xpath('//*[@id="pl_top_realtimehot"]/table/tbody/tr')
for data in datas:
    data_title=data.xpath('td[2]/a/text()'

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

和小灰

关注关注

3
点赞
踩
22

收藏

觉得还不错? 一键收藏
1
评论
Python爬虫爬取新浪微博热搜

Python爬虫爬取新浪微博热搜文章目录Python爬虫爬取新浪微博热搜网页分析数据爬取数据存储全部代码网页分析找到热搜的排名，标题和热度，发现它们在同一路径数据爬取import requestsfrom lxml import etreeurl= 'https://s.weibo.com/top/summary?Refer=top_hot&topnav=1&wvr=6'#print(response.text)headers={ 'User-Agent':
复制链接

扫一扫