怎样用Python 写一个爬图片的程序?

最新推荐文章于 2023-10-22 12:00:00 发布

易枭寒

最新推荐文章于 2023-10-22 12:00:00 发布

阅读量7.7k

点赞数 1

分类专栏： Python 文章标签： Python spider

本文链接：https://blog.csdn.net/xiaowanggedege/article/details/8650034

版权

Python 专栏收录该内容

30 篇文章 0 订阅

订阅专栏

在V2EX上看到一个问题"怎样用Python 写一个爬图片的程序?"http://www.v2ex.com/t/61686#reply18点击打开链接

需求:我很喜欢杉本有美大妹子,想从贴吧抓取她的全部写真照,这样就不用一张张右键,我该怎么一步步做呢?

http://tieba.baidu.com/p/2166231880

Ubuntu12.04系统:

1.安装:sudo easy_install-2.7 -U pip
2.安装:sudo pip-2.7 install requests
3.安装:sudo easy_install lxml
  第3步会报错:
    ERROR: /bin/sh: 1: xslt-config: not found
    ** make sure the development packages of libxml2 and libxslt are installed **
  解决方法:google search "ubuntu install python lxml"
     安装:sudo apt-get install python-lxml

代码:https://gist.github.com/anonymous/5066411点击打开链接

downloadimages.py

import requests
import lxml.html
 
page = requests.get('http://tieba.baidu.com/p/2166231880').text
doc = lxml.html.document_fromstring(page)
for idx, el in enumerate(doc.cssselect('img.BDE_Image')):
    with open('%03d.jpg' % idx, 'wb') as f:
        f.write(requests.get(el.attrib['src']).content)

执行:

python downloadimages.py

去目录里看看妹子图片吧....

------------------------------------------------------------------------------

Windows下运行该代码：

1、安装相关包（注意版本号）：cmd 下 easy_install lxml==2.3

2、Windows二进制安装（也可以采用此方法）：http://www.lfd.uci.edu/~gohlke/pythonlibs/ 点击打开

链接

3、运行 downloadimages.py，图片就下载下来了。-:)

----------------------------------------------------------------------------------

使用urllib和re库也可以解决问题：

代码如下：

#! /usr/bin/env python
#coding:utf-8

import urllib,re

def get_html(url):
    page = urllib.urlopen(url)
    html = page.read()
    return html

def get_img(html):
    reg = r'src="(.*?\.jpg)" bdwater='
    imgre = re.compile(reg)
    imglist = re.findall(imgre, html)
    i = 0
    for imgurl in imglist:
        urllib.urlretrieve(imgurl, '%s.jpg'%i)
        i+=1
html = get_html('http://tieba.baidu.com/p/2166231880')
print get_img(html)

易枭寒

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
1
评论
怎样用Python 写一个爬图片的程序?

在V2EX上看到一个问题"怎样用Python 写一个爬图片的程序?"http://www.v2ex.com/t/61686#reply18点击打开链接需求:我很喜欢杉本有美大妹子,想从贴吧抓取她的全部写真照,这样就不用一张张右键,我该怎么一步步做呢?http://tieba.baidu.com/p/2166231880Ubuntu12.04系统:1.安装:sudo ea
复制链接

扫一扫