自学Python之小爬虫实例

最新推荐文章于 2024-10-08 16:37:40 发布

炫彩灵感

最新推荐文章于 2024-10-08 16:37:40 发布

阅读量1.5k

点赞数 1

分类专栏： Python

本文链接：https://blog.csdn.net/xuancailinggan/article/details/50448689

版权

Python 专栏收录该内容

14 篇文章 0 订阅

订阅专栏

学了两天Python，总要做点什么吧，那就来个小爬虫。

2.x版本和3.x的版本是不同的，我这里采用的是3.5版。

以下代码是爬取贴吧某个页面的全部jpg图片

代码：

import urllib.request
import re
response = urllib.request.urlopen("http://tieba.baidu.com/p/3646792267?fr=ala0&pstaala=2&tpl=5")
html = response.read()
a='src="(.*?\.jpg)"'
c=re.findall(a,html).decode("utf-8")
s=0
for i in c:
    urllib.request.urlretrieve(i,"%s.png" % s)
    s=s+1

效果：