pandas入门基础-分组运算
演示用:
DataFrame新建
in:新建一个二维数组DataFrame
import pandas as pd
import numpy as np
df= pd.DataFrame({
'key1':['a','a','b','b','a'],
'key2':['one','two','one','two','one'],
'data1':np.random.randint(1,10,5),
'data2':np.random.randint(1,10,5)})
df
out:
1.按列分组后计算平均值
in:
df['data1'].groupby(df['key1']).mean()#data1根据key1进行分组后计算平均值
out:
key1
a 4.0
b 6.5
Name: data1, dtype: float64
2.按自定义列进行分组后计算平均值
in:
key=[1