kaggle_course_overfitting & early-stopping

最新推荐文章于 2022-11-16 12:45:42 发布

qq_42839893

最新推荐文章于 2022-11-16 12:45:42 发布

阅读量484

点赞数

分类专栏：深度学习文章标签： tensorflow 机器学习深度学习可视化神经网络

本文链接：https://blog.csdn.net/qq_42839893/article/details/112533266

版权

本文探讨了过拟合和早停策略在深度学习中的作用。通过学习曲线解释模型在训练和验证集上的损失，指出模型容量与过拟合、欠拟合之间的关系。介绍了增加模型容量的方法，并详细说明了早停策略如何防止过拟合，同时避免欠拟合，以找到最佳平衡点。最后，提供了一个数据预处理、模型构建和使用早停策略训练的实例。

摘要由CSDN通过智能技术生成

overfitting & early-stopping

Introduction

Recall from the example in the previous lesson that Keras will keep a history of the training and validation loss over the epochs that it is training the model. In this lesson, we’re going to learn how to interpret these learning curves and how we can use them to guide model development. In particular, we’ll examine at the learning curves for evidence of underfitting and overfitting and look at a couple of strategies for correcting it.

Interpreting the Learning Curves

You might think about the information in the training data as being of two kinds: signal and noise. The signal is the part that generalizes, the part that can help our model make predictions from new data. The noise is that part that is only true of the training data; the noise is all of the random fluctuation that comes from data in the real-world or all of the incidental, non-informative patterns that can’t actually help the model make predictions. The noise is the part might look useful but really isn’t.

We train a model by choosing weights or parameters that minimize the loss on a training set. You might know, however, that to accurately assess a model’s performance, we need to evaluate it on a new set of data, the validation data. (You could see our lesson on model validation in Introduction to Machine Learning for a review.)

When we train a model we’ve been plotting the loss on the training set epoch by epoch. To this we’ll add a plot the validation data too. These plots we call the learning curves. To train deep learning models effectively, we need to be able to interpret them.