训练集、验证集、测试集

最新推荐文章于 2024-05-15 10:06:12 发布

是晨星啊

最新推荐文章于 2024-05-15 10:06:12 发布

阅读量742

点赞数

分类专栏：机器学习

机器学习专栏收录该内容

34 篇文章 1 订阅

订阅专栏

validation_data: tuple (x_val, y_val) or tuple (x_val, y_val, val_sample_weights) on which to evaluate the loss and any model metrics at the end of each epoch. The model will not be trained on this data. This will override validation_split.
验证数据集是用来对每一个epoch都会做一次评估，模型在验证数据集上不进行训练
“validation loss is the value of cost function for cross validation set and loss is the value of cost function for training set.”
val_loss is the value of cost function for your cross validation data and loss is the value of cost function for your training data. On validation data, neurons using drop out do not drop random neurons. The reason is that during training we use drop out in order to add some noise for avoiding over-fitting. During calculating cross validation, we are in recall phase and not in training phase. We use all the capabilities of the network.

在训练中我们使用dropout是为了增加一些噪声，避免过度拟合。
而在验证数据上，使用dropout的神经元不会随机丢弃神经元。

Thanks to one of our dear friends, I quote and explain the contents from here which are definitely useful.

validation_split: Float between 0 and 1. Fraction of the training data to be used as validation data. The model will set apart this fraction of the training data, will not train on it, and will evaluate the loss and any model metrics on this data at the end of each epoch. The validation data is selected from the last samples in the x and y data provided, before shuffling.

validation_data: tuple (x_val, y_val) or tuple (x_val, y_val, val_sample_weights) on which to evaluate the loss and any model metrics at the end of each epoch. The model will not be trained on this data. This will override validation_split.

As you can see

fit(self, x=None, y=None, batch_size=None, epochs=1, verbose=1, callbacks=None, validation_split=0.0, validation_data=None, shuffle=True, class_weight=None, sample_weight=None, initial_epoch=0, steps_per_epoch=None, validation_steps=None)
fit method used in Keras has a parameter named validation_split, which specifies the percentage of data used for evaluating the model which is created after each epoch. After evaluating the model using this amount of data, that will be reported by val_loss if you’ve set verbose to 1; moreover, as the documentation clearly specifies, you can use either validation_data or validation_split. Cross validation data is used to investigate whether your model over-fits the data or does not. This is what we can understand whether our model has generalization capability or not.

Keras difference beetween val_loss and loss during training
https://datascience.stackexchange.com/questions/25267/keras-difference-beetween-val-loss-and-loss-during-training?rq=1

是晨星啊

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
训练集、验证集、测试集

validation_data: tuple (x_val, y_val) or tuple (x_val, y_val, val_sample_weights) on which to evaluate the loss and any model metrics at the end of each epoch. The model will not be trained on this da...
复制链接

扫一扫