scrapy——京东

import scrapy
import json


class CatalogSpider(scrapy.Spider):
    name = 'catalog'
    allowed_domains = ['3.cn']
    start_urls = ['https://dc.3.cn/category/get']

    def parse(self, response):
        jd_json = json.loads(
            str(response.body,encoding='gbk'),
            encoding='gbk'
        )
        result=[]
        for data in jd_json['data']:
            for data2 in data['s']:
                url=data2['n'].split('|')[0]
                title = data2['n'].split('|')[1]
                res1 = {
                    "url": url,
                    "title": title,
                    "child": []
                }
                result.append(res1

                )

                for data3 in data2['s']:
                    url2=data3['n'].split('|')[0]
                    title2 = data3['n'].split('|')[1]
                    res2 = {
                        "url": url2,
                        "title": title2,
                        "child": []
                    }
                    res1["child"].append(
                        res2

                    )
                    for data4 in data3['s']:
                        url3 = data4['n'].split('|')[0]
                        title3 = data4['n'].split('|')[1]
                        res2['child'].append({
                            "url":url3,
                            "title":title3
                        })
                    res1["child"].append(res2)
                result.append(res1)
        print(result)

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值