循环中注意递增公式位置的(爬取**文章为例)

循环中注意递增公式位置(爬取**文章为例)

一、取出后循环(取出了两页内容)

import requests
from bs4 import BeautifulSoup

url1 = 'https://www.……articles'#网址

headers={'user-agent':'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_13_6) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/71.0.3578.98 Safari/537.36'}#请求头

offset = 10#初始值
while True:#循环,取多页
    params = {
        'include': 'data[*].comment_count,suggest_edit,is_normal,thumbnail_extra_info,thumbnail,can_comment,comment_permission,admin_closed_comment,content,voteup_count,created,updated,upvoted_followees,voting,review_info,is_labeled,label_info;data[*].author.badge[?(type=best_answerer)].topics',
        'offset': str(offset),
        'limit': '10',
        'sort_by': 'created'}
    
    res = requests.get(url1,headers = headers,params = params)
    
    articles1 = res.json()
    articles = articles1['data']    
   
    for i in articles:
        title = [i['title']]
        print(title)  
        
    **offset = offset + 20    #注:取一次,递增一次
    if offset > 30:
        break**

二、循环后取出(取出一页内容)

import requests
from bs4 import BeautifulSoup

url1 = 'https://www.……articles'#网址

headers={'user-agent':'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_13_6) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/71.0.3578.98 Safari/537.36'}#请求头

offset = 10#初始值
while True:#循环,取多页
    params = {
        'include': 'data[*].comment_count,suggest_edit,is_normal,thumbnail_extra_info,thumbnail,can_comment,comment_permission,admin_closed_comment,content,voteup_count,created,updated,upvoted_followees,voting,review_info,is_labeled,label_info;data[*].author.badge[?(type=best_answerer)].topics',
        'offset': str(offset),
        'limit': '10',
        'sort_by': 'created'}
    
    res = requests.get(url1,headers = headers,params = params)
    
    articles1 = res.json()
    articles = articles1['data']    
    
    **offset = offset + 20    #注:取第二页之前,已经停止
    if offset > 30:
        break**
           
    for i in articles:
        title = [i['title']]
        print(title)  
        

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值