python 爬虫小程序

最新推荐文章于 2024-09-09 00:00:00 发布

dihong2558

最新推荐文章于 2024-09-09 00:00:00 发布

阅读量73

点赞数

文章标签： python 爬虫

原文链接：http://www.cnblogs.com/aeronfay/articles/4892867.html

版权

 1 import urllib
 2 import re
 3 
 4 #读取网页内容
 5 def getHtml(url):
 6 
 7     return urllib.urlopen(url).read()
 8 #获取图片
 9 def getImg(html):
10     reg = r'src="(.+?\.jpg)" pic_ext'
11     imgre = re.compile(reg)
12     imagelist = re.findall(imgre,html)
13     x= 0
14     for imgurl in imagelist:
15         urllib.urlretrieve(imgurl,'%s.jpg' % x)
16         x+=1
17         #图片地址
18         print(imgurl)
19 
20 html = getHtml("http://www.baidu.com")
21 
22 getImg(html)

转载于:https://www.cnblogs.com/aeronfay/articles/4892867.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

dihong2558

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python 爬虫小程序

1 import urllib 2 import re 3 4 #读取网页内容 5 def getHtml(url): 6 7 return urllib.urlopen(url).read() 8 #获取图片 9 def getImg(html):10 reg = r'src="(.+?\.jpg)" pic_ext'1...
复制链接

扫一扫