//交互模式下
import re
import urllib.request //python3
req = urllib.request.urlopen('www.baidu.con')
buf = req.read()
buf = bur.decode('utf-8') //python3
list = re.findall(r"http:.+\.jpg", buf)
i = 0
for url in list:
f = open(str(i)+'.jpg', 'w')
req = rullib.request.urlopen(url)
buf = req.read()
f.write(buf)
i+=1
ls
代码中有注释部分均为Python3 学爬虫时需要注意的
1.TypeError: cannot use a string pattern on a bytes-like object 这种情况解决方法就是加上
buf = bur.decode('utf-8')
2.AttributeError: module ‘urllib’ has no attribute ‘urlopen’这种情况的解决办法就是将urllib改成urllib.request就行了