第四章: 多表的合并(merge, append的使用)

最新推荐文章于 2023-11-16 19:10:34 发布

逸枚俗人

最新推荐文章于 2023-11-16 19:10:34 发布

阅读量1.1k

点赞数

文章标签： python

本文链接：https://blog.csdn.net/qq_43570534/article/details/124556869

版权

第四章: 多表的合并(merge, append的使用)

Python数据处理入门

第四章: 多表的合并(merge, append的使用)
- 字段方向上的拓展(更多列) --- merge
- 元组方向的拓展(更多行) --- append

字段方向上的拓展(更多列) — merge

import pandas as pd
filepath_01 = r"file01.xlsx"
filepath_02 = r"file02.xlsx"
df_table_01 = pd.read_excel(filepath_01)
df_table_02 = pd.read_excel(filepath_02)

# merge()会将两张表中所有的字段都保留下来, 同名属性会添加后缀, 默认是_x, _y 
# 手动选择需要保留的列, 假设第一张表的字段名为x1, x2, x3; 第二张表的字段为y1, y2, y3
need_column = ["x1", "x2", "y1", "y2"]
df_table_merge = pd.merge(left = df_table_01, right = df_table_02, #left和right分别表示要合并的两张表
		 				  left_on = ["x3"], right_on = ["y3"], #left_on和right_on分别表示两张表中具有相同含义的字段, 根据该条件进行连接. 因为不同表中相同含义的字段的命名可能不一致
		 				  how = "left", #表示左外连接, 即当左表中的left_on字段在右表的right_on字段中无法对应时, 会保留左表中的数据. 也是默认左表作为主表
		 				  suffixes = ("_delete", "")) # 对同名字段添加后缀的规则进行修改, 默认右表中有着左表中需要的字段数据, 因此将左表中的同名字段的后缀命名为_delete, 方便后面进行筛选
result = df_table_merge[need_column] # 根据前面手动选择的列名来去除掉冗余数据

元组方向的拓展(更多行) — append

import pandas as pd
filepath_01 = r"file01.xlsx"
filepath_02 = r"file02.xlsx"
df_table_01 = pd.read_excel(filepath_01)
df_table_02 = pd.read_excel(filepath_02)

# 注意保持列名一致, 追加后进行赋值替换, 否则不生效 
df_table_01 = df_table_01.append(df_table_02, ignore_index = True)

逸枚俗人

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
第四章: 多表的合并(merge, append的使用)

第四章: 多表的合并(merge, append的使用)Python数据处理入门第四章: 多表的合并(merge, append的使用)字段方向上的拓展(更多列) --- merge元组方向的拓展(更多行) --- append字段方向上的拓展(更多列) — mergeimport pandas as pdfilepath_01 = r"file01.xlsx"filepath_02 = r"file02.xlsx"df_table_01 = pd.read_excel(filepath_01)
复制链接

扫一扫