用BeautifulSoup爬取豆瓣妹子的图片

最新推荐文章于 2023-10-31 11:39:14 发布

江南剑雨

最新推荐文章于 2023-10-31 11:39:14 发布

阅读量1.8k

点赞数

分类专栏： python 文章标签： python Beautifuls

本文链接：https://blog.csdn.net/u011533425/article/details/44537379

版权

python 专栏收录该内容

24 篇文章 1 订阅

订阅专栏

用BeautifulSoup处理html文件

#!/usr/bin/env python
# coding=utf-8
import urllib2
import urllib
from bs4 import BeautifulSoup 
import re
def getContent(url):
    content = urllib2.urlopen(url).read()
    soup=BeautifulSoup(content)
    global siteUrls
    siteUrls = soup.findAll('li',attrs={'class':'span3'})
    for i in siteUrls:
        file=i.findAll('img')   
        for t in file:
            id=t.get('data-id')
            name=t.get('data-src')
            imgpath='H:\python_learn\photo/%s.jpg' % id
            urllib.urlretrieve(name,imgpath)      
for i in xrange(1,7):
    url='http://www.dbmeizi.com/?p=%s' % i
    getContent(url)