前言
pandas作为数据分析的利器,能够很好的解决我们对于数据的处理需求,同时,还能够满足我们对于excel表的处理
使用
安装pandas
pip install pandas
生成表列数据
方法一
>>> import pandas as pd
>>> df = pd.DataFrame({'Animal': ['Falcon', 'Falcon', 'Parrot', 'Parrot'], 'Max Speed': [380., 370., 24., 26.]})
>>> df
Animal Max Speed
0 Falcon 380.0
1 Falcon 370.0
2 Parrot 24.0
3 Parrot 26.0
>>> df.groupby(['Animal']).mean()
Max Speed
Animal
Falcon 375.0
Parrot 25.0
#####方法二
>>> import pandas as pd
>>> lst = = [['Falcon', 'Falcon', 'Parrot', 'Parrot'], [['Falcon', 'Falcon', 'Parrot', 'Parrot'],
KeyboardInterrupt
>>> lst = [['Falcon', 'Falcon', 'Parrot', 'Parrot'], ['Captive', 'Wild', 'Captive', 'Wild']]
>>> index = pd.MultiIndex.from_arrays(lst, names=('Animal', 'Type'))
>>> df = pd.DataFrame({'Max Speed': [390., 350., 30., 20.]}, index=index)
>>> df
Max Speed
Animal Type
Falcon Captive 390.0
Wild 350.0
Parrot Captive 30.0
Wild 20.0
>>> df.groupby(level=0).mean()
Max Speed
Animal
Falcon 370.0
Parrot 25.0
>>> df.groupby(level="Type").mean()
Max Speed
Type
Captive 210.0
Wild 185.0
结语
详见pandas文档