添加列
1)在最后添加
df = pd.DataFrame({'num_legs': [2, 4, 8, 0],
'num_wings': [2, 0, 0, 0],
'num_specimen_seen': [10, 2, 1, 8]},
index=['falcon', 'dog', 'spider', 'fish'])
print(df)
df['rank']=[1,2,3,4]#添加列,添加列的个数要与存在的样本个数一致,样本4个,rank也需要4个
print(df)
输出
num_legs num_wings num_specimen_seen
falcon 2 2 10
dog 4 0 2
spider 8 0 1
fish 0 0 8
num_legs num_wings num_specimen_seen rank
falcon 2 2 10 1
dog 4 0 2 2
spider 8 0 1 3
fish 0 0 8 4
2)在指定位置添加列
infor=df.columns.tolist() #将数据框的列名全部提取出来存放在列表里
print(infor)
infor.insert(1,'rank') # 在列索引为1的位置插入一列,列名为:rank,刚插入时不会有值,整列都是NaN
df=df.reindex(columns=infor) # DataFrame.reindex() 对原行/列索引重新构建索引值
print(df)
df['rank']=[1,2,3,4] #给rank赋值
print (df)
输出:
num_legs num_wings num_specimen_seen
falcon 2 2 10
dog 4 0 2
spider 8 0 1
fish 0 0 8
['num_legs', 'num_wings', 'num_specimen_seen']
num_legs rank num_wings num_specimen_seen
falcon 2 NaN 2 10
dog 4 NaN 0 2
spider 8 NaN 0 1
fish 0 NaN 0 8
num_legs rank num_wings num_specimen_seen
falcon 2 1 2 10
dog 4 2 0 2
spider 8 3 0 1
fish 0 4 0 8