python网络爬虫抓取图片

最新推荐文章于 2023-02-06 11:12:55 发布

wickedvalley

最新推荐文章于 2023-02-06 11:12:55 发布

阅读量407

点赞数

分类专栏： python 文章标签： python 正则表达式网络爬虫源代码图片

python 专栏收录该内容

43 篇文章 0 订阅

订阅专栏

出处：http://blog.csdn.net/longshengguoji/article/details/9946675

利用python抓取网络图片的步骤：

1.根据给定的网址获取网页源代码

2.利用正则表达式把源代码中的图片地址过滤出来

3.根据过滤出来的图片地址下载网络图片

[python]view plaincopy 
   
 import re  
 import urllib  
   
 def getHtml(url):  
     page = urllib.urlopen(url)  
     html = page.read()  
     return html  
   
 def getImg(html):  
     reg = r'src="(.+?\.jpg)" pic_ext'  
     imgre = re.compile(reg)  
     imglist = imgre.findall(html)  
     x = 0  
     for imgurl in imglist:  
         urllib.urlretrieve(imgurl,'%s.jpg' % x)  
         x = x + 1          
      
 html = getHtml("http://tieba.baidu.com/p/2460150866")  
 getImg(html)  

优惠劵

wickedvalley

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python网络爬虫抓取图片

出处：http://blog.csdn.net/longshengguoji/article/details/9946675利用python抓取网络图片的步骤：1.根据给定的网址获取网页源代码2.利用正则表达式把源代码中的图片地址过滤出来3.根据过滤出来的图片地址下载网络图片[python] view plain copy
复制链接

扫一扫