scrapy 下载并保存图片

最新推荐文章于 2024-03-22 08:12:01 发布

life1024

最新推荐文章于 2024-03-22 08:12:01 发布

阅读量3.4k

点赞数

分类专栏：爬虫

本文链接：https://blog.csdn.net/u013378306/article/details/70161044

版权

爬虫专栏收录该内容

25 篇文章 0 订阅 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

自定义一个pipeline

# 图片下载类
class ImageDownloadPipeline(object):

    def process_item(self, item, spider):
        global img_index
        #if 'image_urls' in item:  # 如何‘图片地址’在项目中
        imgPath="/home/abc/image"  # 下载图片的保存路径
        if not os.path.isdir(imgPath):
            os.mkdir(imgPath)

        for url in item["image_urls"]:
            print("下载:", url)
            # 未能正确获得网页 就进行异常处理
            try:
                res = urllib2.urlopen(url)
                if str(res.status) != '200':
                    print('未下载成功：', url)
                    continue
            except Exception as e:
                print('未下载成功：', url)
            filename = os.path.join(imgPath, str(img_index) + '.jpg')
            with open(filename, 'wb') as f:
                f.write(res.read())

了解本专栏

超级会员免费看

life1024

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
打赏
0
评论
scrapy 下载并保存图片

自定义一个pipeline# 图片下载类class ImageDownloadPipeline(object): def process_item(self, item, spider): global img_index #if 'image_urls' in item: # 如何‘图片地址’在项目中 imgPat
复制链接

扫一扫