python爬虫：获取网上图片

最新推荐文章于 2023-10-07 00:53:31 发布

python小白要逆袭

最新推荐文章于 2023-10-07 00:53:31 发布

阅读量219

点赞数

分类专栏： python 文章标签： python 爬虫正则表达式

本文链接：https://blog.csdn.net/weixin_55447224/article/details/116561355

版权

python 专栏收录该内容

68 篇文章 52 订阅

订阅专栏

import re
import requests

def getHTMLText(url): # H获得url网页对应的html文本
    try:
        r = requests.get(url)
        return r.text
    except:
        return ""

html = getHTMLText("https://image.baidu.com/search/index?tn=baiduimage&ipn=r&ct=201326592&cl=2&lm=-1&st=-1&fm=index&fr=&hs=0&xthttps=111111&sf=1&fmq=&pv=&ic=0&nc=1&z=&se=1&showtab=0&fb=0&width=&height=&face=0&istype=2&ie=utf8&word=猫")
 # 换掉“猫”这个字，就能获得其他搜索结果

pt = '\"objURL\":\"((http://|https://)[^\"]*)\"'
# 正则表达式，用于寻找"objURL":"http://img.25pp.com/uploadfile/soft/images/2015/0225/20150225010456738.jpg"这样的图片地址

i = 0
for x in re.findall(pt,html):
    print(x[0])
    try:
        r = requests.get(x[0], stream=True) #二进制读，适用于url是一个文件
        pos = x[0].rfind(".")  # 处理后缀名
        f = open('c:\\tmp2\\{0}{1}'.format(i, x[0][pos:]), "wb") # 二进制，打开文件
        for chunk in r.iter_content(): #图片内容写入文件
            f.write(chunk)
        f.close()
    except:
        pass
    i = i+1

python小白要逆袭

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
1
评论
python爬虫：获取网上图片

import reimport requestsdef getHTMLText(url): # H获得url网页对应的html文本 try: r = requests.get(url) return r.text except: return ""html = getHTMLText("https://image.baidu.com/search/index?tn=baiduimage&ipn=r&ct=2013265
复制链接

扫一扫