【笔记10】out-of-sample testing 和 in-sample testing

train, validation and test

train_validation_test
通常分为train和test,在train中分出一份validation,第二步骤重复n次(通常取10)。

Generally splits are done like this:
a) Train
b) Test
Generally, the train data is then split in n parts. n−1 of them are used for training and remaining 1 is used for validation. And, this process is repeated until all the n parts become validation sets once.

out of sample and in sample

\out of sample datain sample data
trainnoyes
validationnoyes
testyesno

in sample testing <-> purpose: high train accuracy
out of sample testing <-> purpose: high test accuracy

从监督学习看testing:
  1. 回归:分为训练集和测试集,(X, y)作为pair送入学习,得到函数或神经网络后,测试为input(X)-ouput(y)。因为通常加入正则项,因此训练准确率不会是百分百,但误差项会达到最小,计算error项就是一个in sample testing;利用得到的模型,预测训练集中没有的y值,就是out of sample testing,可能有`部分``X在训练集中出现过。
  2. 分类:(X, label)作为pair送入学习,测试为input(X)-ouput(label)。计算error项就是一个in sample testing;利用得到的模型,预测测试集label,就是out of sample testing

References:

  1. Parameter Optimization in Algorithmic Finance Part I: Walk forward optimization
  2. Out of Sample and In Sample testing
  3. in-sample data vs out-of-sample data
  4. Difference between “in-sample” and “pseudo out-of-sample” forecasts
  • 11
    点赞
  • 14
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值