pandas使用drop_duplicates去除DataFrame重复项参数
DataFrame中存在重复的行或者几行中某几列的值重复,这时候需要去掉重复行,示例如下:
data.drop_duplicates(subset=[‘A’,‘B’],keep=‘first’,inplace=True)
实例:
#保存至csv中
s=({"YYYY":Year,"State":data["State"],"TDRState":TDRState})
submit=pd.DataFrame(data=s)
submit=submit.drop_duplicates(subset=['State','TDRState','YYYY'],keep='first',inplace=False)
submit.to_csv('/Users/liyixin/Desktop/result.csv',index=False)