用python划分数据集
对已经处理好的数据集,根据7:3的规模划分为训练集和测试集:
with open('./data/collection_labeled.txt',"r",encoding='utf-8') as file_object:
lines= file_object.readlines()
sum=len(lines)
num = sum * 7 / 10
i=0
train=open('./data/train_data.txt', 'w',encoding='utf-8'
原创
2021-04-25 20:20:51 ·
407 阅读 ·
0 评论