scrapy.crawler.CrawlerProcess

最新推荐文章于 2023-03-14 12:23:58 发布

Claroja

最新推荐文章于 2023-03-14 12:23:58 发布

阅读量2.6k

点赞数 1

分类专栏：爬虫

爬虫专栏收录该内容

28 篇文章 0 订阅

订阅专栏

https://doc.scrapy.org/en/latest/topics/api.html#crawler-api

方法	描述	其他
crawl(crawler_or_spidercls, args, *kwargs)	根据传入的参数启动一个爬虫
crawlers	查看已经添加的爬虫
create_crawler(crawler_or_spidercls)	创建一个爬虫
join()	Returns a deferred that is fired when all managed crawlers have completed their executions.
start(stop_after_crawl=True)
stop()	停止

from scrapy.crawler import CrawlerProcess
from scrapy.utils.project import get_project_settings

process = CrawlerProcess(get_project_settings())

# 'followall' is the name of one of the spiders of the project.
process.crawl('followall', domain='scrapinghub.com')
process.start() # the script will block here until the crawling is finished

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

Claroja

关注关注

1
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
scrapy.crawler.CrawlerProcess

https://doc.scrapy.org/en/latest/topics/api.html#crawler-api 方法描述其他 crawl(crawler_or_spidercls, *args, **kwargs) 根据传入的参数启动一个爬虫 crawlers 查看已经添加的爬虫 create_crawler(craw...
复制链接

扫一扫