验证集和测试集的区别

最新推荐文章于 2024-09-13 22:01:30 发布

lyy8021

最新推荐文章于 2024-09-13 22:01:30 发布

阅读量389

点赞数 3

文章标签：深度学习人工智能

本文链接：https://blog.csdn.net/it_lyy_/article/details/139297417

版权

问：验证集和测试集的区别？
答：
参考stackoverflow的一个回答，写得很透彻。

参考链接： https://stackoverflow.com/questions/2976452/whats-is-the-difference-between-train-validation-and-test-set-in-neural-netwo

The training and validation sets are used during training.

for each epoch
    for each training data instance
        propagate error through the network
        adjust the weights
        calculate the accuracy over training data
    for each validation data instance
        calculate the accuracy over the validation data
    if the threshold validation accuracy is met
        exit training
    else
        continue training

Once you’re finished training, then you run against your testing set and verify that the accuracy is sufficient.

Training Set: this data set is used to adjust the weights on the neural network.

Validation Set: this data set is used to minimize overfitting. You’re not adjusting the weights of the network with this data set, you’re just verifying that any increase in accuracy over the training data set actually yields an increase in accuracy over a data set that has not been shown to the network before, or at least the network hasn’t trained on it (i.e. validation data set). If the accuracy over the training data set increases, but the accuracy over the validation data set stays the same or decreases, then you’re overfitting your neural network and you should stop training.

Testing Set: this data set is used only for testing the final solution in order to confirm the actual predictive power of the network.

问：数据增强是在数据划分之后还是之前？
答：