python pandas 读入column列标签不在第一行的csv文件，处理并存储为2个excel sheet

最新推荐文章于 2023-09-01 09:05:00 发布

Steven全

最新推荐文章于 2023-09-01 09:05:00 发布

阅读量1.1k

点赞数

分类专栏： excel python command pandas 文章标签： python 开发语言数据挖掘

本文链接：https://blog.csdn.net/weixin_42668334/article/details/124789429

版权

python command 同时被 3 个专栏收录

8 篇文章 0 订阅

订阅专栏

excel

5 篇文章 0 订阅

订阅专栏

pandas

1 篇文章 0 订阅

订阅专栏

最近python pandas读入一个column index不在第一行，而在第10行的csv, 搜索很久，最后找到bug. 发现一个现象，如果选择了第n行做为column label, 在n行以前的数据就自动忽略了。在df中不需要专门制定，也不需要专门删除（drop)，删除和不删结果是一样的。

file_dir = r"D:\datalog\py_tool"
in_file  = "myfile.csv"
fpath_in = os.path.join(file_dir, filenames)
out_file_dir = os.path.join(file_dir, 'PY')                     #原路径下面建子路径"PY"
print("'Step0.0,Check out_file_dir as '", out_file_dir)
print("Step0.1, mission start to process in_file", in_file)
out_file = str(in_file[0:in_file.index('.')]) + '_py.xls'
fpath_out = os.path.join(out_file_dir, out_file)

df = pd.read_csv(fpath_in, header=9)  		
# stdf, column name at line10，python3.7第10行以前自动忽略了。
print("Step0,Initial Print df.head()")
print(df.head(10))
#删除列标题文字中的空格，读入时以 ","号分割不干净
col_names = df.columns.tolist()  # 获取列名字
for index, value in enumerate(col_names):
    col_names[index] = value.replace(" ", "")
df.columns = col_names
print("The First Row of the stdf csv DataFrame is:")
print(col_names)
#
df2 = df.filter(regex='^PWM')  # some column begin with PWM;
print(df2.head(5))
#
writer = pd.ExcelWriter(fpath_out)
df.to_excel(writer, sheet_name="csv_converted")  # mutiple file into .xlsx use ExcelWriter, single DF just ".to_excel".
df2.to_excel(writer, sheet_name="PWM")  # 一次写入2个dataframe；
writer.save()

Steven全

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
python pandas 读入column列标签不在第一行的csv文件，处理并存储为2个excel sheet

最近python pandas读入一个column index不在第一行，而在第10行的csv, 搜索很久，最后找到bug. 发现一个现象，如果选择了第n行做为column label, 在n行以前的数据就自动忽略了。在df中不需要专门制定，也不需要专门删除（drop)，删除和不删结果是一样的。file_dir = r"D:\datalog\py_tool"in_file = "myfile.csv"fpath_in = os.path.join(file_dir, filenames)out_
复制链接

扫一扫

专栏目录