有DataFrame数据:
train, test;作如下操作(主要是方法一):
combine = train.append(test)
# 做一系列数据预处理...
# 处理完之后,如何分离train和test?
# 方法一、找一个能唯一区分train和test的列;比如id:
x_train = combine[combine['id'].isin(train['id'])]
x_test = combine[combine['id'].isin(test['id'])]
# 方法二、根据拼接时的前后顺序截取:
x_train = combine.iloc[:train.shape[0], :]
x_test = combine.iloc[train.shape[0]:, :]