练习爬取煎蛋网新闻-Python

最新推荐文章于 2020-06-12 19:30:03 发布

小猪乱撞

最新推荐文章于 2020-06-12 19:30:03 发布

阅读量216

点赞数

本文链接：https://blog.csdn.net/qq_33898918/article/details/89396649

版权

import requests
from lxml import etree
count = 0
while (count<300):
    count+=1
    respone = requests.get(
        url='http://jandan.net/page/%d' % (count),
        headers={
            'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/73.0.3683.103 Safari/537.36'
        }
    )
    eroot = etree.HTML(respone.text)
    div_list = eroot.xpath('//div[@class="indexs"]')
    for div in div_list:
        item = {}
        item["新闻"] = div.xpath('./h2/a/text()')
        print(item)

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

小猪乱撞

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
练习爬取煎蛋网新闻-Python

import requestsfrom lxml import etreecount = 0while (count<300): count+=1 respone = requests.get( url='http://jandan.net/page/%d' % (count), headers={ 'User-A...
复制链接

扫一扫