TensorFlow2.x 学习笔记(八)过拟合

最新推荐文章于 2023-02-11 12:40:11 发布

IDrandom

最新推荐文章于 2023-02-11 12:40:11 发布

阅读量274

点赞数

分类专栏： tensorflow 文章标签： tensorflow 深度学习 Overfitting

本文链接：https://blog.csdn.net/IDrandom/article/details/106081434

版权

tensorflow 专栏收录该内容

10 篇文章 1 订阅

订阅专栏

过拟合与欠拟合

Model capacity

当模型的复杂度增强时，模型的表达性也随之增强，以多项式为例
$\beta_0 + \beta_1x + \beta_2x^2 + ... + \beta_nx^n + \epsilon$
随着阶数的增加，可表示的模型也就越多，模型的表示能力也就越强。

Underfitting

model capacity < ground truth
train loss/acc is bad
test loss/acc is bad as well

Overfitting

model capacity < ground truth
train loss/acc is good
test loss/acc is bad
generalization performance is bad

detect and reduce

Splitting

Train/Val/Test Set

(x, y), (x_test, y_test) = datasets.mnist.load_data()

x_train, x_val = tf.split(x, num_or_size_splits=[50000, 10000])
y_train, y_val = tf.split(y, num_or_size_splits=[50000, 10000])

K-fold cross-validation

randomly sample 1/k as val dataset

network.fit(tf.cast(x, dtype=tf.float32)/255., tf.one_hot(tf.cast(y, dtype=tf.int32), depth=10), batch_size=64, epochs=10, validation_split=0.1, validation_freq=2)

这里要注意，validation_split参数是不能用于以dataset格式的输入的
官网所述

The argument validation_split (generating a holdout set from the training data) is not supported when training from Dataset objects, since this features requires the ability to index the samples of the datasets, which is not possible in general with the Dataset API.

更多内容，参见官网训练与评估

Reduce

More Data
Consttraint model complexity
- Shallow
- Regularization
Dropout
Data augmentation
Early Stopping

Regularization

$L_1:J(\theta) = \frac{1}{m}\sum{loss} + \lambda\sum\limits_{i=1}^{n}{|\theta_i|}$
$L_1:J(\theta) = \frac{1}{m}\sum{loss} + \lambda\sum\limits_{i=1}^{n}{\theta_i^2}$

layers.Dense(256, kernel_regularizer=keras.regularizers.l2(0.001), activation='relu'),

Early Stopping

在验证集上达到峰值的时候就停下

Dropout

随机断连

layers.Dropout(0.5)

IDrandom

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
TensorFlow2.x 学习笔记(八)过拟合

过拟合与欠拟合Model capacity当模型的复杂度增强时，模型的表达性也随之增强，以多项式为例y=β0+β1x+β2x2+...+βnxn+ϵy = \beta_0 + \beta_1x + \beta_2x^2 + ... + \beta_nx^n + \epsilony=β0+β1x+β2x2+...+βnxn+ϵ随着阶数的增加，可表示的模型也就越多，模型的表示能力也就越强。Underfittingmodel capacity < ground truthtrain
复制链接

扫一扫