scrapy 调试

最新推荐文章于 2023-11-23 17:38:05 发布

YGR1123打SD

最新推荐文章于 2023-11-23 17:38:05 发布

阅读量303

点赞数

分类专栏： scrapy 文章标签： scrapy 调试

本文链接：https://blog.csdn.net/qq_38620956/article/details/98608811

版权

scrapy 专栏收录该内容

5 篇文章 0 订阅

订阅专栏

def parse( self, response): 
     papers = response. xpath(".// *[@class=' day']") 
     from scrapy. shell import inspect_ response
     inspect_ response( response, self) 
     for paper in papers:
         url = paper. xpath(".// *[@class=' postTitle']/ a/@ href"). extract()[ 0]
         title = paper. xpath(".// *[@class=' postTitle']/ a/ text()"). extract()[ 0]
         time = paper. xpath(".// *[@class=' dayTitle']/ a/ text()"). extract()[ 0]
         content = paper. xpath(".// *[@class=' postTitle']/ a/ text()"). extract()[ 0] 
         item = CnblogspiderItem( url= url, title= title, time= time, content= content)
         request = scrapy. Request( url= url, callback= self. parse_ body) 
         request. meta[' item'] = item 
         yield request next_ page = Selector( response). re( u'< a href="(\ S*)"> 下 一页</ a>') 
    if next_ page: 
         yield scrapy. Request( url= next_ page[ 0], callback= self. parse)

scrapy crawl spdiername 运行时程序停在 inspect _response() 一行可以进行调试

REDIRECT_ENABLED =False ???

YGR1123打SD

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
scrapy 调试

def parse( self, response): papers = response. xpath(".// *[@class=' day']") from scrapy. shell import inspect_ response inspect_ response( response, self) for paper in papers:...
复制链接

扫一扫