import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
df = pd.DataFrame({'A': list(range(200)), 'B': np.random.randn(200)})
bins = [0, 10, 20, 50, 80, 100, 120, 140, 160, 180, 200]
dff = df.groupby(pd.cut(df['A'], bins=bins))['B'].sum()
dff.plot(kind='bar')
plt.show()
关键词:pd.cut()
bins = [0, 200, 500, 1000]
lables = ["A", "B", "C"]
df["value_range"] = pd.cut(x=df["column"],bins=bins)
df["lable_range"] = pd.cut(x=df["column"],bins=bins,labels=lables)
pd.cut() 参数解释:
- right: default True, 左开右闭, 即包左不包右。
- retbins:True表示返回划分的区间,False表示不返回划分的区间(默认为False)