两个excel表,其中表1有41列,表2有42列,表2第十列为空列,删除后数据格式和表1相同。
直接用pandas删掉第十列,合并后的表3还是42列,原因是表1标题为0-40,表2标题为0-8,9-41.
二者标题不一致,合并后的标题为0-41,一共42列,解决办法,删掉第十列后,重命名表2标题为0-40,这样合并后共41列,标题0-40.
columns_name = ['Unnamed: 0', 'Unnamed: 1', 'Unnamed: 2', 'Unnamed: 3', 'Unnamed: 4',
'Unnamed: 5', 'Unnamed: 6', 'Unnamed: 7', 'Unnamed: 8', 'Unnamed: 9',
'Unnamed: 10', 'Unnamed: 11', 'Unnamed: 12', 'Unnamed: 13',
'Unnamed: 14', 'Unnamed: 15', 'Unnamed: 16', 'Unnamed: 17',
'Unnamed: 18', 'Unnamed: 19', 'Unnamed: 20', 'Unnamed: 21',
'Unnamed: 22', 'Unnamed: 23', 'Unnamed: 24', 'Unnamed: 25',
'Unnamed: 26', 'Unnamed: 27', 'Unnamed: 28', 'Unnamed: 29',
'Unnamed: 30', 'Unnamed: 31', 'Unnamed: 32', 'Unnamed: 33',
'Unnamed: 34', 'Unnamed: 35', 'Unnamed: 36', 'Unnamed: 37',
'Unnamed: 38', 'Unnamed: 39', 'Unnamed: 40']
li = []
for i in file_list:
print(i)
m = pd.read_excel(i)
print(m.shape)
# 如果出现42列情况,删除第十列空列
if m.shape[1] > 41:
# print(m.shape)
删掉第十列
m.drop(m.columns[9], axis=1, inplace=True)
#重命名标题
m.columns = columns_name