获取哔哩哔哩网站的排行榜

最新推荐文章于 2024-09-05 10:28:02 发布

薄荷味的鱼

最新推荐文章于 2024-09-05 10:28:02 发布

阅读量417

点赞数

分类专栏： python爬虫文章标签： xpath python

本文链接：https://blog.csdn.net/weixin_45540019/article/details/107811538

版权

python爬虫专栏收录该内容

3 篇文章 0 订阅

订阅专栏

爬取下边五个榜单

# 用requests模块做请求，页面通过etree类将html字符串转化为Element对象，以便我们使用xpath解析页面（requests,etree,xpath）

import requests
from lxml import etree

def bilibili(str):
html=requests.get(str).text
doc=etree.HTML(html)
result=doc.xpath('//div[@class="info"]/a/text()')
x=0
for results in result:
# 计数
x=x+1
print(results)
print(x)

# 全站榜
bilibili('https://www.bilibili.com/ranking/all/0/0/3')
#原创榜
bilibili('https://www.bilibili.com/ranking/origin/0/0/3')
#新番榜
bilibili('https://www.bilibili.com/ranking/bangumi/13/0/3')
#影视榜
bilibili('https://www.bilibili.com/ranking/cinema/177/0/3')
#新人榜
bilibili('https://www.bilibili.com/ranking/rookie/0/0/3')

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

薄荷味的鱼

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
获取哔哩哔哩网站的排行榜

爬取下边五个榜单用requests模块做请求，页面通过etree类将html字符串转化为Element对象，以便我们使用xpath解析页面（requests,etree,xpath）import requestsfrom lxml import etreedef bilibili(str): html=requests.get(str).text doc=etree.HTML(html) result=doc.xpath('//div[@clas...
复制链接

扫一扫