pandas 数据计算，格式化

最新推荐文章于 2025-04-03 20:07:09 发布

Bachelor??

最新推荐文章于 2025-04-03 20:07:09 发布

阅读量203

点赞数

文章标签： pandas python

本文链接：https://blog.csdn.net/hmdzjp/article/details/129603216

版权

文章展示了如何使用Pandas库进行数据计算，包括求和、平均值、最大值、中位数、众数、方差、标准差和分位数。此外，还介绍了数据格式化的技巧，如保留小数位数和转换为百分比格式。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

import pandas as pd
data=[[11,22,22],[11,22,99],[77,11,99]]
index=['a','b','c']
columns=['aa','bb','cc']
df=pd.DataFrame(data=data,index=index,columns=columns)
print(df)
#数据计算
#print(df.sum(axis=1))    # 求和，axis=1计算行，0计算列
print(df.mean())     #求平均值     默认为0计算列
#print(df.max())     #最大值
#df=df.append(df.max(),ignore_index=True)       #增加一行时要加ignore_index=true来忽略索引
#print(df)     
#print(df.median())      #中位数
#print(df.mode())      #每列中的众数   默认为0计算列
#print(df['cc'].mode())   #某列的众数
#print(df.var())     #求方差
#print(df.std())     #标准差
#df.quantile(0.35)       #分位数   35%     计算日期，时间和时间增量数据的分位数加参数numeric_only=False

#数据格式化
import pandas as pd
import numpy as np
df=pd.DataFrame(np.random.random([5,5]),columns=['a1','a2','a3','a4','a5'])    #5行5列
print(df)
#print(df.round(2))     #保留2位小数
#print(df.round({'a1':1,'a2':2}))    #指定某列保留位数
#s1=pd.Series([1,0,2],index=['a1','a2','a3'])
#print(s1)
#print(df.round(s1))      #通过series是指位数
#df['百分比']=df['a1'].apply(lambda x:format(x,'.0%'))       #整列保留0位小数      #设置百分比用apply()和format()  格式难记。。。
#df['百分比']=df['a1'].map(lambda x:'{:.0%}'.format(x))      #同上，改用map
#print(df)
#还有个设置千位分隔符，估计用不上，先不学了