scrapy部署运行测试

那些年错过的东西

已于 2022-05-29 22:24:15 修改

阅读量228

点赞数

分类专栏：笔记文章标签：爬虫 python

于 2022-05-29 21:43:55 首次发布

本文链接：https://blog.csdn.net/qq_45664532/article/details/125036146

版权

笔记专栏收录该内容

27 篇文章 0 订阅

订阅专栏

安装scrapy

pip install scrapyd
pip install scrapyd-client

安装好后输入scrapyd,结果如下
在这里插入图片描述

部署scrapy爬虫,先配置好需要部署的爬虫的scrapy.cfg文件

原先

# Automatically created by: scrapy startproject
#
# For more information about the [deploy] section see:
# https://scrapyd.readthedocs.io/en/latest/deploy.html

[settings]
default = biqu.settings

[deploy]
#url = http://localhost:6800/
project = biquge

部署后

# Automatically created by: scrapy startproject
#
# For more information about the [deploy] section see:
# https://scrapyd.readthedocs.io/en/latest/deploy.html

[settings]
default = biquge.settings

[deploy:bq] #在deploy加上：部署名
url = http://localhost:6800/   #把url前面的#去掉
project = biquge

再打开一个cmd路径切换到要部署的scrapy爬虫文件下

执行

scrapyd-deploy bq -p biquge

输出
在这里插入图片描述
网页查看

运行

格式

curl http://localhost:6800/schedule.json -d project=default -d spider=somespider

运行部署好的爬虫

curl http://localhost:6800/schedule.json -d project=biquge -d spider=biquge_spider
project=biquge #项目名 
spider=biquge_spider #爬虫名

执行后
在这里插入图片描述

在这里插入图片描述

停止

curl http://localhost:6800/schedule.json -d project=biquge -d job=87384812df5311ec8f257470fd3a1483  
#job爬虫运行返回的jobid

py文件执行部署的爬虫，运行，停止，删除

import requests
def open():
    url="http://localhost:6800/schedule.json"
    data={
        'project':'biquge',
        'spider':'biquge_spider'
    }
    r=requests.post(url,data=data)
    text=r.json()
    print(text)
def close():
    url="http://localhost:6800/schedule.json"
    data={
        'project':'biquge',
        'job':'87384812df5311ec8f257470fd3a1483'
    }
    r=requests.post(url,data=data)
    text=r.json()
    print(text)
def delete():
    # curl http://localhost:6800/delproject.json -d project=biquge
    url="http://localhost:6800/delproject.json"
    data={
         "project":"biquge"
    }
    r=requests.post(url,data=data)
    # print(r.json())
if __name__ == '__main__':
    delete()

那些年错过的东西

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
scrapy部署运行测试

安装scrapypip install scrapydpip install scrapyd-client安装好后输入scrapyd,结果如下部署scrapy爬虫,先配置好需要部署的爬虫的scrapy.cfg文件原先# Automatically created by: scrapy startproject## For more information about the [deploy] section see:# https://scrapyd.readthedocs.io/en
复制链接

扫一扫