Python爬取优美图库的图片并下载到img文件夹中

最新推荐文章于 2022-11-15 19:34:11 发布

zycsdhr

最新推荐文章于 2022-11-15 19:34:11 发布

阅读量679

点赞数 1

分类专栏：爬虫文章标签： python 爬虫

本文链接：https://blog.csdn.net/qq_40071924/article/details/115525117

版权

爬虫专栏收录该内容

4 篇文章 0 订阅

订阅专栏

该博客演示了如何使用Python爬取优美图库的图片并下载到本地img文件夹。首先获取主页面的HTML，然后提取子页面链接，接着解析子页面找到图片URL，最后下载图片并保存。整个过程涉及requests和BeautifulSoup库的使用。

摘要由CSDN通过智能技术生成

Python爬取优美图库的图片并下载到img文件夹中

# coding:utf-8
# 1.拿到主页面的源代码,然后提取到子页面的链接地址 href
# 2.通过href拿到子页面内容，从子页面中找到图片的下载地址 img->src
# 3.下载图片
import requests
from bs4 import BeautifulSoup
import time

url = "https://www.umei.cc/bizhitupian/weimeibizhi/"
resp = requests.get(url)
resp.encoding='utf-8'#处理乱码
# print(resp.text)
#把源代码交给BeautifulSoup
main_page=BeautifulSoup(resp.text,"html.parser")
alist=main_page.find("div",class_="TypeList").find_all("a")#拿范围，第一次缩小
# print(alist)
for a in alist:
    href=a.get("href")
    #拿到子页面的源代码
    child_page_resp=requests.get(href)
    child_page_resp.encoding="utf-8"
    child_page_resp_text=child_page_resp.text
    #从子页面中拿到图片的下载链接
    child_page=BeautifulSoup(child_page_resp_text,"html.parser")
    p=child_page.find("p",align="center")
    img=p.find("img")
    src=img.get("src")
    #下载图片
    img_resp=requests.get(src)
    # img_resp.content#这里拿到的字节
    img_name=src.split("/")[-1] #拿到url中的最后一个/以后的内容
    with open("img/" + img_name,mode="wb") as f:
        f.write(img_resp.content) #图片内容写入到文件
    print("over！",img_name)
    time.sleep(1)
print("all_over!")

zycsdhr

关注

1
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
Python爬取优美图库的图片并下载到img文件夹中

Python爬取优美图库的图片并下载到img文件夹中# coding:utf-8# 1.拿到主页面的源代码,然后提取到子页面的链接地址 href# 2.通过href拿到子页面内容，从子页面中找到图片的下载地址 img->src# 3.下载图片import requestsfrom bs4 import BeautifulSoupimport timeurl = "https://www.umei.cc/bizhitupian/weimeibizhi/"resp = requests
复制链接

扫一扫