一个爬取lativ诚衣网站上模特穿搭图片的爬虫

  show the code:

[peter@localhost savvy]$ vi lativ.py
# -*- coding:utf-8 -*-
import requests,lxml,os
from bs4 import BeautifulSoup as sb

def get_html():
        url = 'https://www.lativ.com/Style'
        headers = {'User-Agent':'Mozilla/5.0 (Linux; Android 6.0; Nexus 5 Build/MRA58N) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/46.0.2490.76 Mobile Safari/537.36'}
        html = requests.post(url,headers).text
        return html

def soup_html(html):
        soup = sb(html, 'lxml')
        a = soup.find_all('a')[12:190]
        return a

def save_img(a):
        for i in a:
                l = i.get('href')
                print l
                j = l[-14:-9]
                with open(str(j)+'.jpg','wrb') as f:
                        img = requests.get(l)
                        f.write(img.content)
                        print str(j)+'saved'

if __name__=='__main__':
        html = get_html()
        a = soup_html(html)
        save_img(a)

 

转载于:https://www.cnblogs.com/peter1994/p/7301840.html

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值