Python3.5下载网页图片

最新推荐文章于 2024-05-01 21:59:49 发布

vodepan

最新推荐文章于 2024-05-01 21:59:49 发布

阅读量427

点赞数

分类专栏： python

本文链接：https://blog.csdn.net/vodepan/article/details/79969723

版权

python 专栏收录该内容

41 篇文章 0 订阅

订阅专栏

过程：

1.首先获取需要下载图片的src属性

利用正则先获取img标签，利用标签的attrs属性获取到src属性地址，注意有些地址不全，在后面使用的时候要在前面加上网址

import requests,re,os
from bs4 import BeautifulSoup
def getPicUrls(url):
    try:
        r =requests.get(url)
        r.raise_for_status()
        soup = BeautifulSoup(r.text,'html.parser')
        html=soup.find('div',{'class':'wenzhangcontent'}).findAll('img')        
        return html
    except Exception as e:
        print(e)

2.open write来下载图片

    localPath = 'd:/py_pics/'
    if not os.path.exists(localPath):
        os.mkdir(localPath)
    domain ='http://www.lyjyfw.net/'
    picUrls=getPicUrls('http://www.lyjyfw.net/Html/News/201844/tR0454108.html') 
    for i,item in enumerate(picUrls):
        try:
            pic = requests.get(domain+item.attrs['src'],timeout=15)
            with open(localPath+'{}.jpg'.format(i),'wb') as f:
                f.write(pic.content)  #content写入的是二进制数
                print('成功下载第{:d}张图片:{:s}'.format((i+1),domain+item.attrs['src']))
        except Exception as e:
            print('下载第{:d}张图片失败:{:s}'.format((i+1),domain+item.attrs['src']))
            print(e)
            continue

vodepan

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Python3.5下载网页图片

过程：1.首先获取需要下载图片的src属性利用正则先获取img标签，利用标签的attrs属性获取到src属性地址，注意有些地址不全，在后面使用的时候要在前面加上网址import requests,re,osfrom bs4 import BeautifulSoupdef getPicUrls(url): try: r =requests.get(url) ...
复制链接

扫一扫