我在Scrapy中苦苦挣扎,而且我不明白回调之间传递项目的确切方式.也许有人可以帮助我.
def parse_page1(self, response):
item = MyItem()
item['main_url'] = response.url
request = scrapy.Request("http://www.example.com/some_page.html",
callback=self.parse_page2)
request.meta['item'] = item
return request
def parse_page2(self, response):
item = response.meta['item']
item['other_url'] = response.url
return item
我正在尝试逐步了解那里的行动流程:
[parse_page1]
> item = MyItem()> item [‘main_url’] = response.url> request = scrapy.Request(“ http://www.example.com/some_page.html”,callback = self.parse_page2)[parse_page2]
> item = response.meta [‘item’]> item [‘other_url’] = response.url>返回项目[parse_page1]
> request.meta [‘item’] =项>返回请求我浏览了所有有关scrapy和request / response / meta的文档,但是我仍然不明白第4点和第7点的情况.