关于旅游新闻网站的简单爬虫

最新推荐文章于 2024-04-10 17:39:57 发布

庸_才

最新推荐文章于 2024-04-10 17:39:57 发布

阅读量847

点赞数

分类专栏：闲来无事文章标签： python 爬虫

本文链接：https://blog.csdn.net/qq_42486070/article/details/82984844

版权

闲来无事专栏收录该内容

6 篇文章 0 订阅

订阅专栏

import requests
from bs4 import BeautifulSoup


url = "http://www.cntour.cn/"
response = requests.get(url)
content = response.text
soup = BeautifulSoup(content)
data = soup.select('#main > div > div.mtop.firstMod.clearfix > div.centerBox > ul.newsList > li > a')
for item in data:
    print("hot topic:"+item.get_text('title'))
    newurl = item.get('href')
    newresponse = requests.get(newurl)
    newsoup = BeautifulSoup(newresponse.text)
    newdata = newsoup.select('#main > div > div.newListBox.clearfix > div.leftBox > div.newShow > div.content.reset')
    all = newdata[0].find_all('p')
    for each in all:
        if(each.string == None):
            continue
        print(each.string)
    print(5*"\n")

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

庸_才

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
关于旅游新闻网站的简单爬虫

import requestsfrom bs4 import BeautifulSoupurl = "http://www.cntour.cn/"response = requests.get(url)content = response.textsoup = BeautifulSoup(content)data = soup.select('#main &gt; div &gt...
复制链接

扫一扫