>>> importos>>>os.getcwd()'C:\\Python33'
>>> os.chdir('E:\\python\\mmy')>>>os.getcwd()'E:\\python\\mmy'
>>> importurllib.request>>> urllib.request.urlopen('http://image.edai.com/avatar/000/88/14/23_avatar_middle.jpg')
>>> response = urllib.request.urlopen('http://image.edai.com/avatar/000/88/14/23_avatar_middle.jpg')>>>response.getcode()200
>>>response.geturl()'http://image.edai.com/avatar/000/88/14/23_avatar_middle.jpg'
>>>response.info()
>>> print(response.info())
Expires: Thu,31 Dec 2037 23:55:55GMT
Date: Sat,30 Jan 2016 13:18:38GMT
Server: nginx/0.8.42Content-Type: image/jpeg
Content-Length: 8053Last-Modified: Thu, 08 Jan 2015 06:46:11GMT
Pragma: public
Accept-Ranges: bytes
Age:1X-Via: 1.1 scxx84:1(Cdn Cache Server V2.0)
Connection: close
Cache-Control: public, must-revalidate, proxy-revalidate>>> pic =response.read()>>> with open('liuhui.jpg', 'wb') as f:
f.write(pic)8053
>>>
在本地磁盘已经有了liuhui.jpg了!!!
以上代码是在idle下操作的,其实真实的代码只有如下:
importurllib.request
response= urllib.request.urlopen('http://image.edai.com/avatar/000/88/14/23_avatar_middle.jpg')
pic=response.read()
with open('liuhui.jpg', 'wb') as f:
f.write(pic)
注:
1.urlopen()的参数可以是一个字符串或一个request对象,当为一个字符串时,其实是执行了两个步骤:
(1)req = urllib.request.Request('http://image.edai.com/avatar/000/88/14/23_avatar_middle.jpg');
(2)response = urllib.request.urlopen(req).
2.urllib.request.urlopen()的返回值是一个http.client.HTTPResponse对象,即客户端http响应结果对象。我们知道http协议,对客户端请求是需要响应的,而响应的数据包还包含了http头部信息。因此,urllib.request.urlopen()返回的对象还有以下3个重要的方法:
(1)getcode(): Htpp状态码
(2)geturl():客户端请求的url地址
(3)info():就是http的头部信息(header)