2020-12-29 01:45:47 [scrapy.statscollectors] INFO: Dumping Scrapy stats:
{'downloader/request_bytes': 309977,
'downloader/request_count': 609,
'downloader/request_method_count/GET': 609,
'downloader/response_bytes': 1549878,
'downloader/response_count': 609,
'downloader/response_status_count/200': 333,
'downloader/response_status_count/302': 276,
'elapsed_time_seconds': 303.197538,
'finish_reason': 'finished',
'finish_time': datetime.datetime(2020, 12, 28, 17, 45, 47, 574530),
'item_scraped_count': 275,
'log_count/INFO': 13,
'request_depth_max': 58,
'response_received_count': 609,
'scheduler/dequeued': 609,
'scheduler/dequeued/memory': 609,
'scheduler/enqueued': 609,
'scheduler/enqueued/memory': 609,
36%|███▌ | 36/100 [00:00<00:00, 358.73it/s]
2020-12-29 01:45:47 [scrapy.core.engine] INFO: Spider closed (finished)
我猜测是scrapy的robot协议没关闭吧?
settings.py中
|
1 2 |
|
博客记录了Scrapy运行的相关数据,包括请求字节数、请求数量、响应字节数等,还记录了运行时间、爬取的条目数量等信息。最后猜测可能是Scrapy的robot协议未关闭,并提及了settings.py文件。
2334





