本文实现在Python3环境下,编程实现csv文件和arff文件的相互转换。本文所用数据来自于weka3.8或者weka3.9中data子目录下的airline.arff文件。使用Python代码实现airline.arff到airline.csv文件的转换(实质是信息的抽取和重新组织)和airline.csv文件到airline1.arff文件的转换(实质是行文本数据的拼接)。airline.arff部分数据展示如下:
%% Monthly totals of international airline passengers (in thousands) for
%% 1949-1960.
@relation airline_passengers
@attribute passenger_numbers numeric
@attribute Date date 'yyyy-MM-dd'
@data
112,1949-01-01
118,1949-02-01
132,1949-03-01
129,1949-04-01
121,1949-05-01
135,1949-06-01
148,1949-07-01
148,1949-08-01
136,1949-09-01
119,1949-10-01
104,1949-11-01
118,1949-12-01
115,1950-01-01
126,1950-02-01
141,1950-03-01
1.将arff文件转换成为csv文件
"""将arff文件转换成为csv文件"""
import pandas as pd
def arff_to_csv(fpath):
#读取arff数据
if fpath.find('.arff') <0:
print('the file is nott .arff file')
return
f = open(f