python抓取bing主页背景图片

最新推荐文章于 2020-04-25 11:04:41 发布

weixin_30606461

最新推荐文章于 2020-04-25 11:04:41 发布

阅读量112

点赞数

文章标签： python

原文链接：http://www.cnblogs.com/camilla/p/7144768.html

版权

最初Python2写法：

#!/usr/bin/env python

# -*- coding:utf-8 -*-

# -*- author:nancy -*-

# python2抓取bing主页所有背景图片

import urllib,re,sys,os

def get_bing_backphoto():

if (os.path.exists('photos')== False):

os.mkdir('photos')

for i in range(0,1000):

url = 'http://cn.bing.com/HPImageArchive.aspx?format=js&idx='+str(i)

+'&n=1&nc=1361089515117&FORM=HYLH1'

html = urllib.urlopen(url).read()

if html == 'null':

print 'open & read bing error!'

sys.exit(-1)

reg = re.compile('"url":"(.*?)","urlbase"',re.S)

text = re.findall(reg,html)

#http://s.cn.bing.net/az/hprichbg/rb/LongJi_ZH-CN8658435963_1366x768.jpg

for imgurl in text:

right = imgurl.rindex('/')

name = imgurl.replace(imgurl[:right+1],'')

savepath = 'photos/'+ name

urllib.urlretrieve(imgurl, savepath)

print name + ' save success!'

get_bing_backphoto()

Python3与Python2的错误调整：

TypeError: can't use a string pattern on a bytes-like object

原因为Python3 findall数据类型用bytes类型，因此在正则表达式前应添加html = html.decode('utf-8')。

“AttributeError: 'module' object has no attribute 'urlopen'”

原因是Python3里的urllib模块已经发生改变，此处的urllib都应该改成urllib.request。

由于bing图片对外接口的图片json格式变了，python第三方库的导入格式有变化，因此代码调整如下：

#!/usr/bin/env python

# -*- coding:utf-8 -*-

# -*- author:nancy-*-

# python3抓取bing主页所有背景图片

import urllib.request,re,sys,os

def get_bing_backphoto():

if (os.path.exists('photos')== False):

os.mkdir('photos')

for i in range(0,10):

url = 'http://cn.bing.com/HPImageArchive.aspx?format=js&idx='+str(i)+'&n=1&nc=1361089515117&FORM=HYLH1'

html = urllib.request.urlopen(url).read()

if html == 'null':

print( 'open & read bing error!')

sys.exit(-1)

html = html.decode('utf-8')

html = html.replace('/az/','http://s.cn.bing.net/az/')

reg = re.compile('"url":"(.*?)","urlbase"',re.S)

text = re.findall(reg,html)

for imgurl in text :

right = imgurl.rindex('/')

print(imgurl)

name = imgurl.replace(imgurl[:right+1],'')

savepath = 'photos/'+ name

urllib.request.urlretrieve(imgurl, savepath)

print (name + ' save success!')

get_bing_backphoto()

转载于:https://www.cnblogs.com/camilla/p/7144768.html

weixin_30606461

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python抓取bing主页背景图片

最初Python2写法：#!/usr/bin/env python# -*- coding:utf-8 -*-# -*- author:nancy -*-# python2抓取bing主页所有背景图片 import urllib,re,sys,os def get_bing_backphoto(): if (os.path.exists('photos')== ...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。