原数据集只有2000-2016年的山东各市产量气温降水数据,从山东统计局(http://tjj.shandong.gov.cn/tjnj/nj2020/zk/indexce.htm)获得数据将数据更新至19年。
爬取与插入原数据集代码:
import pandas
import re
import os
import pandas as pd
import numpy as np
from numpy import nan as NaN
import xlrd
import xlwt
import json
from xlutils.copy import copy
path='D:\项目实训--农产品\农产品数据\农作物产量气温降水/'
files=os.listdir(path)
nian=[2017,2018,2019]
for t in range(len(files)):
workbook = xlrd.open_workbook(files[t][0:9] + 'xls')
newbook = copy(workbook)
newsheet = newbook.get_sheet(0)
newsheet2 = newbook.get_sheet(1)