import pandas as pd 1. 读取 df = pd.read_excel(file) 2. nan替换处理 df.fillna("", inplace=True) 3. 去重 subset按照设置的主键去重 df = df.drop_duplicates(subset