【交叉验证】k折交叉验证_从文件中划分

from sklearn.model_selection import KFold
import pandas as pd

data = pd.read_csv("F:\\PaperCode\\Mypaper_python_code\\data\\dataset_split\\data_trainANDval.tsv", sep = "\t")

X = data.iloc[:, 0]
y = data.iloc[:, 1]

# print(X)
# print(type(X))


kf = KFold(n_splits=5, shuffle=False)  # 初始化KFold

# 存放5折的训练集划分
X_train_files = []
y_train_files = []
X_test_files = []
y_test_files = []

p = 0
for train_index , test_index in kf.split(X):  # 调用split方法切分数据
#     # print('train_index:%s , test_index: %s ' %(train_index,test_index))
    X_train, X_test = X[train_index], X[test_index]
    y_train, y_test = y[train_index], y[test_index]

    # 保存方式:
    # 1.保存到列表
    X_train_files.append(X_train)
    y_train_files.append(y_train)
    X_test_files.append(X_test)
    y_test_files.append(y_test)

    # 2.存入tsv文件中
    X_train.to_csv("F:\\PaperCode\\Mypaper_python_code\\data\\dataset_split\\" + str(p) + "\\X_train.tsv", index=False)
    y_train.to_csv("F:\\PaperCode\\Mypaper_python_code\\data\\dataset_split\\" + str(p) + "\\y_train.tsv", index=False)
    X_test.to_csv("F:\\PaperCode\\Mypaper_python_code\\data\\dataset_split\\" + str(p) + "\\X_test.tsv", index=False)
    y_test.to_csv("F:\\PaperCode\\Mypaper_python_code\\data\\dataset_split\\" + str(p) + "\\y_test.tsv", index=False)

    p += 1


# print(y_test_files)

  • 2
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
引用提到,cross_val_score是用于进行交叉验证的函数。交叉验证是将数据集划分为K个折,其K-1折用于训练,1折用于测试。参数cv可以指定将数据集划分成几折,同时cv数值最大值为数据集总量的1/3。cross_val_score的参数scoring用于指定评分标准,例如准确度、F1值、精度等。在分成K折后,如果数据量太小,评分具有较大的偶然性。 虽然没有直接提到k折交叉验证,但引用的cross_val_score函数就是用于进行k折交叉验证的。通过指定cv参数为K,就可以实现k折交叉验证的功能。因此,k折交叉验证和cross_val_score是一样的。123 #### 引用[.reference_title] - *1* [sklearn的cross_val_score交叉验证](https://blog.csdn.net/qq_43592352/article/details/120812580)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v92^chatsearchT0_1"}} ] [.reference_item] - *2* [sklearn交叉验证函数cross_val_score用法及参数解释](https://blog.csdn.net/worther/article/details/126909270)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v92^chatsearchT0_1"}} ] [.reference_item] - *3* [cross_val_score 交叉验证与 K折交叉验证,嗯都是抄来的,自己作个参考](https://blog.csdn.net/weixin_34260071/article/details/114359870)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v92^chatsearchT0_1"}} ] [.reference_item] [ .reference_list ]
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值