Scrapy指定顺序输出 -《狗嗨默示录》-

items.py

import scrapy

class CollectipsItem(scrapy.Item):

    IP = scrapy.Field()
    PORT = scrapy.Field()
    POSITION = scrapy.Field()
    TYPE = scrapy.Field()
    SPEED = scrapy.Field()
    CONNECT_TIME = scrapy.Field()
    SURVIVE_TIME = scrapy.Field()
    LAST_CHECK_TIME = scrapy.Field()

(1)在spiders中增加文件csv_item_exporter.py

from scrapy.conf import settings
from scrapy.contrib.exporter import CsvItemExporter

class MyProjectCsvItemExporter(CsvItemExporter):

    def __init__(self, *args, **kwargs):
        delimiter = settings.get('CSV_DELIMITER', ',')
        kwargs['delimiter'] = delimiter

        fields_to_export = settings.get('FIELDS_TO_EXPORT', [])
        if fields_to_export :
            kwargs['fields_to_export'] = fields_to_export

        super(MyProjectCsvItemExporter, self).__init__(*args, **kwargs)

(2)在settings.py中配置

FEED_EXPORTERS = {
    'csv': 'CollectIPs.spiders.csv_item_exporter.MyProjectCsvItemExporter',
} #CollectIPs为工程名

FIELDS_TO_EXPORT = [
    'IP',
    'PORT',
    'POSITION',
    'TYPE',
    'SPEED',
    'CONNECT_TIME',
    'SURVIVE_TIME',
    'LAST_CHECK_TIME'
]

在settings.py中也可以指定csv文件中的分隔符

CSV_DELIMITER = "\t"

 

转载于:https://www.cnblogs.com/LiGoHi/p/7398320.html

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值