df0 = pd.DataFrame(np.ones((3,4))*0,columns=['a','b','c','d'])
df1 = pd.DataFrame(np.ones((3,4))*1,columns=['a','b','c','d'])
df2 = pd.DataFrame(np.ones((3,4))*2,columns=['a','b','c','d'])
print(df0)
print(df1)
print(df2)
上下合并DataFrame
res = pd.concat([df0,df1,df2],axis=0)
# res1 = pd.concat([df0,df1,df2],axis=0,ignore_index=True)
print(res)
此时我们发现合并后的DataFrame的index还是由各个DataFrame的index直接合并,而没有生成新的index
可通过参数ignore_index=True生成新的index
res = pd.concat([df0,df1,df2],axis=0,ignore_index=True)
两种jion模式['inner','outer']
df0 = pd.DataFrame(np.ones((3,4))*0,columns=['a','b','c','d'],index=[1,2,3])
df1 = pd.DataFrame(np.ones((3,4))*1,columns=['b','c','d','e'],index=[2,3,4])
print(df0)
print(df1)
res = pd.concat([df0,df1],join='outer')
print(res)
'inner'模式会对数据进行裁剪,寻找两个 DataFrame相同的部分然后合并
res = pd.concat([df0,df1],ignore_index=True,join='inner')
这里可以看到df0的columns = ‘a’和df的columns=‘e’被丢弃了
join_axes
res = pd.concat([df0,df1],axis=1,join_axes=[df0.index]) #axis=1 左右合并
join_axes=[df0.index]表示只考虑df0.index
res = pd.concat([df0,df1],axis=1)
没有join_axes=[df0.index],df0、df1的index都kaol
append
默认往DataFrame一行行向下的添加数据
往df0里添加df1:
res = df0.append(df1,ignore_index=True)
print(res)
在DataFrame里添加多个DataFrame
df0 = pd.DataFrame(np.ones((3,4))*0,columns=['a','b','c','d'])
df1 = pd.DataFrame(np.ones((3,4))*1,columns=['a','b','c','d'])
df2 = pd.DataFrame(np.ones((3,4))*2,columns=['a','b','c','d'])
res = df0.append([df1,df2],ignore_index=True) # 以list的形式[df1,df2]即可
print(res)
添加Series
df0 = pd.DataFrame(np.ones((3,4))*0,columns=['a','b','c','d'])
s1 = pd.Series([1,2,3,4],index=['a','b','c','d'])
res = df0.append(s1,ignore_index=True)
print(res)