python bs4爬取腾讯新闻简单练习版

最新推荐文章于 2024-05-22 21:02:56 发布

wwxy261

最新推荐文章于 2024-05-22 21:02:56 发布

阅读量598

点赞数

分类专栏：爬虫

本文链接：https://blog.csdn.net/wwxy1995/article/details/80918324

版权

爬虫专栏收录该内容

5 篇文章 0 订阅

订阅专栏

import requests
from bs4 import BeautifulSoup
import pandas

res = requests.get("http://news.qq.com/")
soup = BeautifulSoup(res.text, 'html.parser')
newsary = []
for news in soup.select('.Q-tpWrap .text'):
    newsary.append({"title":news.select('a')[0].text,"url":news.select('a')[0]['href']})

newsdf = pandas.DataFrame(newsary)
newsdf.to_excel("news.xlsx")

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

wwxy261

关注关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
python bs4爬取腾讯新闻简单练习版

import requestsfrom bs4 import BeautifulSoupimport pandasres = requests.get("http://news.qq.com/")soup = BeautifulSoup(res.text, 'html.parser')newsary = []for news in soup.select('.Q-tpWrap .te...
复制链接

扫一扫