scrapy 导出 JSON 时中文乱码
问题介绍
通过 Scrapy 命令导出爬虫的结果,命令如下:
scrapy crawl hotel -o hotel.json
源代码如下(hotel.py):
# -*- coding: utf-8 -*-
import scrapy
class HotelSpider(scrapy.Spider):
name = 'hotel'
allowed_domains = ['bnb.qunar.com']
start_urls = ['http://bnb.qunar.com/hotcity.jsp']
def parse(self, response):
all = response.css('div.e_city_list dl.e_letter_order dd ul.e_city_name li')
for city in all:
url = city.css('li a::attr(href)').get('no_url')
name = city.css('li a::text').get('no_city')
yield<