方法
pandas.DataFrame.duplicated(),详情见:pandas.DataFrame.duplicated用法
例子
>>> df = pd.DataFrame({
'brand': ['YumYum','YumYum', 'YumYum', 'Indomie', 'Indomie', 'Indomie'],
'style': ['cup','cup', 'cup', 'cup', 'pack', 'pack'],
'rating': [4, 4, 4, 3.5, 15, 5]})
>>> df
brand style rating
0 YumYum cup 4.0
1 YumYum cup 4.0
2 YumYum cup 4.0
3 Indomie cup 3.5
4 Indomie pack 15.0
5 Indomie pack 5.0
>>> df_none_duplicated = df[~df.duplicated()]
>>> df_none_duplicated
brand style rating
0 YumYum cup 4.0
3 Indomie cup 3.5
4 Indomie pack 15.0
5 Indomie pack 5.0
可以看到剔除掉了索引为1、2的重复行