python3 从网页上爬取图片

最新推荐文章于 2024-05-01 14:20:05 发布

z小白

最新推荐文章于 2024-05-01 14:20:05 发布

阅读量1.1k

点赞数

分类专栏： python 文章标签： python3 爬虫

本文链接：https://blog.csdn.net/zzc15806/article/details/81666756

版权

python 专栏收录该内容

22 篇文章 9 订阅

订阅专栏

#-*- coding: UTF-8 -*-
#!/usr/python3
import urllib.request
import re
def getImage(url):
    html = urllib.request.urlopen(url).read()   # 爬取网页
    imgre = re.compile(r'src="(.+?\.jpg)"')   #匹配图片
    html = html.decode('utf-8') 
    imglist = imgre.findall(html)
    x=0
    for image in imglist:
        urllib.request.urlretrieve(image,'./image/%s.jpg' % x)
        x+=1
print(getImage("https://www.csdn.net/"))

可能遇到的问题：

1. AttributeError: module 'urllib' has no attribute 'urlopen'
解决办法：将urllib改成urllib.request
2. TypeError: cannot use a string pattern on a bytes-like object
解决办法：python3中需要使用html = html.decode('utf-8') 进行转化

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

z小白

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python3 从网页上爬取图片

#-*- coding: UTF-8 -*-#!/usr/python3import urllib.requestimport redef getImage(url): html = urllib.request.urlopen(url).read() # 爬取网页 imgre = re.compile(r'src="(.+?\.jpg)"') #匹配图片 ...
复制链接

扫一扫