筛选出'a'列中带有'b'的行,然后对这些数据进行groupby,结果倒序排
df[df['a'].str.contains('b')].groupby(['a','c'])['uid'].agg({'uv':'count'}).sort_values(by='uv',ascending=0)
筛选groupby之后排序,分组取top值(分组排序的迂回方法,不知道有没有更好的方法)
df[df['from'].str.contains('oppo r9')].groupby(['from','to'])['uid'].agg({'uv':'count'}).sort_values(by='uv',ascending=0)['uv'].groupby(level=0,group_keys=False).nlargest(5000).to_csv('/Users/cici/Documents/group_huanji.csv',encoding='utf-8')
二 输出A列和B列带有某字符串的C列
df[(df['from']=='苹果-iphone 6s') & (df['to']=='苹果-iphone 7')]['uid'].to_csv('/Users/cici/Documents/iphone6_ip7.csv',header=0,index=False)