#1.数据集按列去重
data.drop_duplicate(subset=['col1','col2'],keep='first',inplace=TRUE) #keep 默认first,inplace 是否在本数据集修改
#2.数据集按索引去重
data[~data.index.duplicated(keep='last')] #keep=last,则最后last为FALSE,前面重复的全部返回true
# 3.单列去重
data['col1'].drop_duplicate()
PYTHON_数据去重
最新推荐文章于 2023-07-24 16:09:18 发布