python 批量下载图片

最新推荐文章于 2024-05-01 21:59:49 发布

enjoyhanziyimei

最新推荐文章于 2024-05-01 21:59:49 发布

阅读量238

点赞数

本文链接：https://blog.csdn.net/enjoyhanziyimei/article/details/119077708

版权

使用requests模块，批量下载图片；

注：分析url的组成，可以实现全站下载

# -*- coding: utf-8 -*-
import requests
import re
import os
import csv

# 设置保存路径
path = r'D:\Project_python\picture_1'
# 目标url
url = "http://pic.netbian.com/4kmeinv/index.html"
# 伪装请求头  防止被反爬
headers = {
    "User-Agent": "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.1 (KHTML, like Gecko) Chrome/21.0.1180.89 Safari/537.1",
    "Referer": "http://pic.netbian.com/4kmeinv/index.html"
}

# 发送请求  获取响应
response = requests.get(url, headers=headers)
# 打印网页源代码来看  乱码   重新设置编码解决编码问题
# 内容正常显示  便于之后提取数据
response.encoding = 'GBK'

# 正则匹配提取想要的数据  得到图片链接和名称
img_info = re.findall('img src="(.*?)" alt="(.*?)" /', response.text)

#urc/name写入log_file
if os.path.isfile('log_file.csv') == False:
    fp = open('log_file.csv','a+')
    fp.write('img_url'+','+'img_name'+','+'img_path' + '\n')

for src, name in img_info:
    img_url = 'http://pic.netbian.com' + src   # 加上 'http://pic.netbian.com'才是真正的图片url
    img_content = requests.get(img_url, headers=headers).content
    img_name = name + '.jpg'
    img_path = os.path.join(path,img_name)
    fp.write(img_url + ',' +img_name + ','+ img_path + '\n')
    with open(img_path, 'wb') as f:     # 图片保存到本地
        print(f"正在为您下载图片：{img_name}")
        f.write(img_content)

fp.close()

enjoyhanziyimei

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python 批量下载图片

使用requests模块，批量下载图片；注：分析url的组成，可以实现全站下载# -*- coding: utf-8 -*-import requestsimport reimport osimport csv# 设置保存路径path = r'D:\Project_python\picture_1'# 目标urlurl = "http://pic.netbian.com/4kmeinv/index.html"# 伪装请求头防止被反爬headers = { "U
复制链接

扫一扫