gif爬虫

最新推荐文章于 2022-07-19 19:10:19 发布

zl_1628563296@qq.com

最新推荐文章于 2022-07-19 19:10:19 发布

阅读量141

点赞数

分类专栏： python编程

本文链接：https://blog.csdn.net/qq_29684215/article/details/105118040

版权

python编程专栏收录该内容

12 篇文章 0 订阅

订阅专栏

本人使用python版本为3.6.5

python3中内置urllib模块

python2中内置urllib2模块

#coding:utf-8
import urllib.request
import re


# 将正则表达式编译成Pattern对象
rex=r'src="(https://.*?\.gif)"';
pages = ('1','2');
x=1;
#输入您要爬的网址
pageurl=input()
for page in pages:
    #pageurl = "http://***********.com/default_%s.html" % page;
    Response=urllib.request.urlopen(pageurl);
    print(Response)
    Html=Response.read();
    print(Html.decode('utf-8','ignore'))
    lists = re.findall(rex, Html.decode('utf-8','ignore'));
    #print(lists)
    lensofpage=len(lists);
    print (lensofpage)
    
    picname = 'page' + page; 
    print (picname)
    
    for picurl in lists:
        #设置存储路径
        urllib.request.urlretrieve(picurl,r'C:\Users\hipeson\Desktop\pic\%s.gif' %x);
        print (page+picurl)
        x=x+1;

   
   
print ('DownLoadPicOver')

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

zl_1628563296@qq.com

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
gif爬虫

本人使用python版本为3.6.5python3中内置urllib模块python2中内置urllib2模块#coding:utf-8import urllib.requestimport re# 将正则表达式编译成Pattern对象rex=r'src="(https://.*?\.gif)"';pages = ('1','2');x=1;#输入您要爬的网址pa...
复制链接

扫一扫