import pandas as pd
import numpy as np
d = {
'company': ['A', 'B', 'A', 'C', 'C', 'B', 'C', 'A'],
'salary': [8, 15, 10, 15, 15, 28, 30, 15],
'age': [26, 29, 26, 30, 30, 30, 30, 35]
}
df = pd.DataFrame(data=d)
print(df, end='\n\n')
res1 = df.groupby(['salary']).count()
res2 = df.groupby(['salary']).sum()
grp = list(df.groupby(['salary']))
print(grp, end='\n\n')
print(res1, end='\n\n')
print(res2, end='\n\n')
groupby是将选定那一列的相同元素拿出来放一块
并且放到一个一个的元组中,就像这个例子中
对salary进行groupby
[((8,), company salary age
0 A 8 26),
((10,), company salary age
2 A 10 26),
((15,), company salary age
1 B 15 29
3 C 15 30
4 C 15 30
7 A 15 35),
((28,), company salary age
5 B 28 30),
((30,), company salary age
6 C 30 30)]
可以转成一个list格式,是一个元组列表
每个元组是一个元组+dataframe的格式