【Python】使用Pandas进行Excel多表外连接(outer join)

源代码

import pandas as pd
#读入.xlsx类型Excel文件
df1= pd.read_excel('C:/Users/Kinglake/Desktop/1.xlsx')
df2 = pd.read_excel('C:/Users/Kinglake/Desktop/2.xlsx')
#读入.csv类型Excel文件
df3 = pd.read_csv('C:/Users/Kinglake/Desktop/3.csv')

#多表连接并导出excel(ignore_index=True使索引不会混乱)(.fillna(0)用0替换所有空单元格)
pd.concat([df1,df2,df3,df4,df5,df6],join='outer',ignore_index=True).fillna(0).to_excel("C:/Users/Kinglake/Desktop/output.xlsx")

节省内存的尝试

#节省内存的尝试
import pandas as pd
df1= pd.read_csv('/home/ubuntu/Downloads/Thecountry-eachlevel/Thecountry-level-6/The-country01-level-6.csv')
df2 = pd.read_csv('/home/ubuntu/Downloads/Thecountry-eachlevel/Thecountry-level-6/The-country02-level-6.csv')
temp1=pd.concat([df1,df2],join='outer',ignore_index=True)
del df1,dl2
df3 = pd.read_csv('/home/ubuntu/Downloads/Thecountry-eachlevel/Thecountry-level-6/The-country03-level-6.csv')
df4 = pd.read_csv('/home/ubuntu/Downloads/Thecountry-eachlevel/Thecountry-level-6/Thecountry04-level-6.csv')
temp2=pd.concat([df3,df4],join='outer',ignore_index=True)
del df3,dl4
df5 = pd.read_csv('/home/ubuntu/Downloads/Thecountry-eachlevel/Thecountry-level-6/Thecountry05-level-6.csv')
df6 = pd.read_csv('/home/ubuntu/Downloads/Thecountry-eachlevel/Thecountry-level-6/Thecountry06-level-6.csv')
temp3=pd.concat([df5,df6],join='outer',ignore_index=True)
del df5,dl6
temp4=pd.concat([temp1,temp2],join='outer',ignore_index=True)
del temp1,temp2
pd.concat([temp3,temp4],join='outer',ignore_index=True).fillna(0).to_excel("/home/ubuntu/Downloads/Thecountry-eachlevel/Thecountry-level-6/output6.xlsx")
  • 0
    点赞
  • 4
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值