本文参考:《机器学习算法原理与编程实践》郑捷,第1章第四节
python读入该数据,并转成array,代码如下:
import os
import numpy as np
# 数据文件转矩阵
# path: 数据文件路径
# delimiter: 行内字段分隔符
def file2array(path, delimiter):
recordlist = []
fp = open(path, 'r', encoding='utf-8')
content = fp.read() # content现在是一行字符串,该字符串包含文件所有内容
fp.close()
rowlist = content.splitlines() # 按行转换为一维表,splitlines默认参数是‘\n’
# 逐行遍历
# 结果按分隔符分割为行向量
recordlist = [row.split(delimiter) for row in rowlist if row.strip()]