Training set - Validation set - Test set - Development set (dev set)
1. Training set - Validation set - Test set
-
Training set: The data you will use to train your model. This will be fed into an algorithm that generates a model. Said model maps inputs to outputs.
你将用来训练模型的数据。这将被输入到生成模型的算法中。所述模型将输入映射到输出。 -
Validation set: This is smaller than the training set, and is used to evaluate the performance of models with different hyperparameter values. It’s also used to detect overfitting during the training stages.
它小于训练集,用于评估具有不同超参数值的模型的性能。它也用于在训练阶段检测过拟合。 -
Test set: This set is used to get an idea of the final performance of a model after hyperparameter tuning. It’s also useful to get an idea of how different models (SVMs, Neural Networks, Random forests…) perform against each other.
使用此集可以了解超参数调整后模型的最终性能。了解不同模型 (SVMs, Neural Networks, Random forests…) 之间如何相互影响也很有用。
2. development set (dev set)
development set (dev set) = hold out cross validation set
cross validation:交叉验证
References
https://www.brainstobytes.com/test-training-and-validation-sets/