简单爬取百度贴吧中的美图

最新推荐文章于 2024-10-28 11:00:05 发布

apodxx

最新推荐文章于 2024-10-28 11:00:05 发布

阅读量334

点赞数 1

分类专栏： python 文章标签： python 爬虫

本文链接：https://blog.csdn.net/qq_35190534/article/details/62237898

版权

python 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

最近再看贴吧的壁纸吧的时候发现了好多自己喜欢的壁纸,
这里写图片描述
2.右键图片,选择检查,可以看到我们要爬取的图片的class都为:’BDE_Image’,如图:

3接下来便可以调用beautifulsoup类进行匹配了,爬取了
附上代码:

#coding:utf-8
import urllib
from bs4 import BeautifulSoup//导入beautiful包

url='https://tieba.baidu.com/p/4645322258'//我们要爬取的网页
webpage=urllib.urlopen(url);
webcontent=webpage.read();//读出html代码
soup = BeautifulSoup(webcontent,'lxml')
imgTags=soup.find_all('img',{'class':'BDE_Image'})//我们爬取的网页的类
x=0;
for imgtag in imgTags:
    pic_url=imgtag['src']
    urllib.urlretrieve(pic_url,'d:\\crawler\\%s.jpg' % x)//下载图片到D盘clawler文件夹中
    x+=1

print soup.title

4.结果:
这里写图片描述