Python - pandas - groupby+agg聚合重命名解决办法

最新推荐文章于 2025-03-29 17:53:46 发布

摸鱼同学

最新推荐文章于 2025-03-29 17:53:46 发布

阅读量1.5w

点赞数 10

分类专栏： Python 文章标签： python 1024程序员节

本文链接：https://blog.csdn.net/qq_24256877/article/details/108732042

版权

9 篇文章

订阅专栏

本文介绍了Pandas库中数据聚合和重命名的方法，包括使用`groupby()`函数配合`agg()`进行数据计数，通过`as_index`参数控制返回结果的索引方式。同时展示了如何利用`rename()`、`agg()`内的别名定义以及直接操作列名进行重命名。这些技巧对于数据处理和分析非常实用。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1.数据准备

import pandas as pd
df = pd.read_csv('/data/Mall_Customers_nom.csv')
df.head()

gender_df = df.groupby("Gender", as_index=True).agg({'CustomerID':'count'})
gender_df

gender_df = df.groupby("Gender", as_index=False).agg({'CustomerID':'count'})
gender_df

2.1 rename，注意这里agg里是大括号{}

gender_df2 = df.groupby("Gender", as_index=False)\
    .agg({'CustomerID':'count'})\
    .rename(columns={'CustomerID': 'user_count'})

2.2 agg(’new列名‘=(’列名‘, ’统计方法‘))，注意是括号()，as_index须为True，即作为索引返回。

gender_df3 = df.groupby("Gender")\
        .agg(user_count=('CustomerID','count'))

2.3 groupby(as_index=False)['列名']的方式，注意这种方式as_index须为False。

gender_df4 = df.groupby('Gender', as_index=False)['CustomerID']\
        .agg({"user_count": "count"})