问题:
- 给定成分数据,使用Pandas计算各成分之间的相关系数
代码
1、单个文件的情况
import pandas as pd
file_name = './data.csv'
data = pd.read_csv(file_name)
corr = data.corr();
corr.to_csv('correlation.csv')
2、多个文件的情况
import pandas as pd
file_name_1 = '../dic1/dic2/name1.csv'
file_name_2 = '../dic1/dic2/name2.csv'
file_name_3 = '../dic1/dic2/name3.csv'
file_name_4 = '../dic1/dic2/name4.csv'
lists = {file_name_1, file_name_2, file_name_3, file_name_4}
for file_name in lists:
data = pd.read_csv(file_name)
corr = data.corr();
out_name = file_name.split('/')[3]
corr.to_csv('./correlation_' + out_name)