In [1]: import pandas as pd
In [2]: import numpy as np
In [3]: df1 = pd.DataFrame(np.ones((2,4)),columns=list("abcd"), index=list("AB"))
In [4]: df1
Out[4]:
a b c d
A 1.0 1.0 1.0 1.0
B 1.0 1.0 1.0 1.0
In [5]: df2 = pd.DataFrame(np.zeros((3,3)), columns=list("xyz"), index=list("ABC"))
In [6]: df2
Out[6]:
x y z
A 0.0 0.0 0.0
B 0.0 0.0 0.0
C 0.0 0.0 0.0
join就是按照行索引进行相应的合并
In [7]: df1.join(df2)
Out[7]:
a b c d x y z
A 1.0 1.0 1.0 1.0 0.0 0.0 0.0
B 1.0 1.0 1.0 1.0 0.0 0.0 0.0
In [8]: df2.join(df1)
Out[8]:
x y z a b c d
A 0.0 0.0 0.0 1.0 1.0 1.0 1.0
B 0.0 0.0 0.0 1.0 1.0 1.0 1.0
C 0.0 0.0 0.0 NaN NaN NaN NaN
merge:按照指定的列把数据按照一定的方式合并
In [41]: df3 = pd.DataFrame(np.arange(9).reshape(3,3), columns=list("fax"