python3 爬虫-图片

最新推荐文章于 2021-02-12 11:03:43 发布

LL_Lerrety

最新推荐文章于 2021-02-12 11:03:43 发布

阅读量605

点赞数

分类专栏： python爬虫文章标签： python 爬虫脚本

本文链接：https://blog.csdn.net/LL_Lerrety/article/details/71123043

版权

python爬虫专栏收录该内容

1 篇文章 0 订阅

订阅专栏

最近迷上python爬虫，所以在这里会陆续更新这段时间做的东西。
首先本次是使用python3在一个网页上下载主体部分的图片。

参考网页：http://www.jianshu.com/p/696922f268df
获取图片的网页：http://pic.yesky.com/c/6_243.shtml
这里写图片描述

要获取的就是如上荧光圈起来的部分。首先要在和脚本相同文件夹下新建一个文件夹picture。
直接上代码

#coding=utf-8
import urllib
import urllib.request
import re

def download_page(url):
    request = urllib.request.Request(url)   #构建请求
    reponse = urllib.request.urlopen(url)   #获取服务器响应
    data = reponse.read()
    return data

def get_image(html):
    regx = r'http://dynamic-image.yesky.com/185x247/uploadImages/2017/1[\S]*\.jpg'
    pattern = re.compile(regx)
    get_img = re.findall(pattern,repr(html))
    num = 0
    for img in get_img:
        num += 1
        image = download_page(img)          #获取图片
        with open('picture/%s.jpg'%num,'wb')as fp:  
            fp.write(image)
            print('正在下载第%s张图片'%num)
    print('获取图片成功')
    return

url = 'http://pic.yesky.com/c/6_243.shtml'
html = download_page(url)                   #获取网页
get_image(html)