pandas 作图统计_pandas 之 groupby

最新推荐文章于 2024-05-24 20:33:23 发布

慕容狐

最新推荐文章于 2024-05-24 20:33:23 发布

阅读量2.8k

点赞数 1

文章标签： pandas 作图统计

本文链接：https://blog.csdn.net/weixin_31011587/article/details/112248230

版权

这篇博客介绍了pandas的groupby功能，通过实例展示了如何按种族进行数据分组，进行计数、求均值、方差等统计操作，并通过可视化展示不同种族中逃逸方式的分布和年龄分布。还介绍了如何对多个列进行不同操作，如求年龄的中位数和精神异常占比，以及同时计算多个统计指标。最后，文章讨论了如何结合不同场景进行更复杂的分组操作。

摘要由CSDN通过智能技术生成

groupby 的 MutilIndex

df.reset_index()

df.index.get_level_values('abc') / df.index.get_level_values(0)

准备

这个博客是用 Jupyter Notebook 写的, 如果你没有用过也不影响阅读哦. 这里只要电脑装了python和pandas就好, 我们会先读入一个数据集.

# 读入一个数据集, 我使用了美国警方击毙数据集.%matplotlib inline%config InlineBackend.figure_format = 'retina'import matplotlib.pyplot as pltimport pandas as pdimport numpy as npplt.style.use('ggplot')path = 'https://raw.githubusercontent.com/HoijanLai/dataset/master/PoliceKillingsUS.csv'data = pd.read_csv(path, encoding ='latin1')data.sample(3)

name date race age signs_of_mental_illness flee 683 Tyrone Holman 09/09/15 B 37.0 True Not fleeing 1941 Michael Alan Altice 25/12/16 W 61.0 True Not fleeing 652 Manuel Soriano 27/08/15 H 29.0 False Not fleeing