keras重载继续训练的问题

问题

colab的时间有限额,被中断后,要重新连接,加载模型继续训练。出现的问题是,每次重新加载模型后,训练开始的loss都会比中断前的loss大很大,训练几个batch后,loss会慢慢降下来。

原因

重新加载的代码有问题,模型优化器状态被重初始化了。

原来错误的步骤
  • 定义并编译模型
  • 加载权重(load_weights)
  • 训练
  • 保存模型及权重
  • 连接中断
  • 重新连接,回到第一步

参考stackoverflow,

When to use?

If you’re using compile, surely it must be after load_model(). After all, you need a model to compile. (PS: load_model automatically compiles the model with the optimizer that was saved along with the model)

What does compile do?
Compile defines the loss function, the optimizer and the metrics. That’s all.
It has nothing to do with the weights and you can compile a model as many times as you want without causing any problem to pretrained weights.
You need a compiled model to train (because training uses the loss function and the optimizer). But it’s not necessary to compile a model for predicting.

Do you need to use compile more than once?
Only if:

  • You want to change one of these:
    - Loss function
    - Optimizer / Learning rate
    - Metrics
    - The trainable property of some layer
  • You loaded (or created) a model that is not compiled yet. Or your load/save method didn’t consider the previous compilation.

Consequences of compiling again:
If you compile a model again, you will lose the optimizer states. This means that your training will suffer a little at the beginning until it adjusts the learning rate, the momentums, etc. But there is absolutely no damage to the weights (unless, of course, your initial learning rate is so big that the first training step wildly changes the fine tuned weights).

错误点在于:每次重新编译了模型,然后加载的是权重。编译后模型优化器的初始状态发生了变化。

正确的步骤
  • 定义模型(第一次训练要编译模型)
  • 加载整个模型(keras.models.load_model)
  • 训练
  • 保存模型及权重
  • 连接中断
  • 重新连接,回到第一步

这个问题比较另类,只是自己记录下。

自定义metrics出现的问题

模型定义中的代码,

def r_square(y_true, y_pred):
	from keras import backend as K
	SS_res =  K.sum(K.square(y_true - y_pred)) 
	SS_tot = K.sum(K.square(y_true - K.mean(y_true))) 
	return (1 - SS_res/(SS_tot + K.epsilon()))

model.compile(loss = "mse", optimizer = opt, metrics = [r_square])

加载模型中出现报错,

ValueError: Unknown metric function: r_square. Please ensure this object is passed to the `custom_objects` argument. See https://www.tensorflow.org/guide/keras/save_and_serialize#registering_the_custom_object for details.

原因是使用了自定义的metric函数,参看stackoverflow,和tensorflow官方文档
需要使用custome_objects参数,解决方法如下,

model = keras.models.load_model(model_file, custom_objects={"r_square":r_square})
  • 1
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值