图片爬取

最新推荐文章于 2023-08-21 08:00:00 发布

神行影

最新推荐文章于 2023-08-21 08:00:00 发布

阅读量171

点赞数

分类专栏：爬虫文章标签： python

本文链接：https://blog.csdn.net/qq_40779250/article/details/105867725

版权

爬虫专栏收录该内容

2 篇文章 0 订阅

订阅专栏

记录  
图片爬取 整页  爬的不是4k   4k要会员
import requests  #http库
from lxml import etree #数据提取第三方库

url = 'http://pic.netbian.com/4kdongwu/'#地址
count = 1
headers = {
    'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.129 Safari/537.36'
}#请求头

response = requests.get(url,headers=headers).content.decode('gbk')      #发送请求

html = etree.HTML(response)
clearfix = html.xpath('//div/ul[@class="clearfix"]/li/a/img/@src')

for i in clearfix:
    ID = i[16:-4]         #截取
    urls = 'http://pic.netbian.com/uploads/allimg/'+ID+'.jpg'   #图片地址
    img_response = requests.get(urls,headers=headers)
    f = open('./img/{}.jpg'.format(count),'ab')   #a 文件写入  追加方式 b 进制文件读写方式
    f.write(img_response.content)
    f.close()
    count +=1

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

神行影

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
图片爬取

记录图片爬取整页爬的不是4k 4k要会员import requests #http库from lxml import etree #数据提取第三方库url = 'http://pic.netbian.com/4kdongwu/'#地址count = 1headers = { 'User-Agent':'Mozilla/5.0 (Windows NT 10.0...
复制链接

扫一扫