- 博客(5)
- 收藏
- 关注
原创 政府工作报告
# -*- coding: utf-8 -*-"""Spyder EditorThis is a temporary script file."""##1.通过网络数据采集获取政府工作报告(2014-2021年度报告),#并对政府工作报告历年的数据进行存储、数据整理、中文分词、统计词语关联,并可视化,#同时给出政府工作报告重心随时间的转移情况统计。#导入必要的包import requestsfrom lxml import etreeimport jiebafrom wordcl
2022-01-01 12:49:52 132
原创 pipline
# Define your item pipelines here## Don't forget to add your pipeline to the ITEM_PIPELINES setting# See: https://docs.scrapy.org/en/latest/topics/item-pipeline.html# useful for handling different item types with a single interfacefrom itemadapter i
2022-01-01 12:49:01 462
原创 item代码
import scrapyclass ZjzxItem(scrapy.Item): # define the fields for your item here like: shujuriqi = scrapy.Field() gnsczzjde = scrapy.Field() gnsczztbzj = scrapy.Field() dycyjde =scrapy.Field() dycytbzj = scrapy.Field() decyjde...
2022-01-01 12:47:02 1141
原创 seting
# Scrapy settings for zjzx project## For simplicity, this file contains only settings considered important or# commonly used. You can find more settings consulting the documentation:## https://docs.scrapy.org/en/latest/topics/settings.html# h...
2022-01-01 12:46:21 226
原创 爬虫zjzx.py
import scrapyimport reimport requestsfrom ..items import ZjzxItemdef getURL(No): for i in range(No): url="http://finance.stockstar.com/finance/macrodata/gdplist.aspx?page={}&order=1&by=1".format(i) yield urlclass Zjzx1Spide...
2022-01-01 12:45:04 273
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人