TensorFlow构建模型（保存和加载模型）四

最新推荐文章于 2023-09-01 10:50:19 发布

superY25

最新推荐文章于 2023-09-01 10:50:19 发布

阅读量613

点赞数

分类专栏：人工智能文章标签： tensorflow 模型构建 ModelCheckpoint

本文链接：https://blog.csdn.net/superY_26/article/details/124019726

版权

人工智能专栏收录该内容

68 篇文章 10 订阅

订阅专栏

概要

本文主要讲模型构建训练过程中的保存和加载方法。
主要用到了tensoflow中的tf.keras.callbacks.ModelCheckpoint

内容

import os
import tensorflow as tf
from tensorflow import keras

print(tf.version.VERSION)

# 加载模型训练的数据
(train_images, train_labels), (test_images, test_labels) = tf.keras.datasets.mnist.load_data()

train_labels = train_labels[:1000]
test_labels = test_labels[:1000]

train_images = train_images[:1000].reshape(-1, 28 * 28) / 255.0
test_images = test_images[:1000].reshape(-1, 28 * 28) / 255.0

# 定义一个模型
def create_model():
  model = tf.keras.models.Sequential([
    keras.layers.Dense(512, activation='relu', input_shape=(784,)),
    keras.layers.Dropout(0.2),
    keras.layers.Dense(10)
  ])

  model.compile(optimizer='adam',
                loss=tf.losses.SparseCategoricalCrossentropy(from_logits=True),
                metrics=[tf.metrics.SparseCategoricalAccuracy()])

  return model

# Create a basic model instance
model = create_model()
# Display the model's architecture
model.summary()

我们在训练的过程中，通过定义一个tf.keras.callbacks.ModelCheckpoint的回调来保存权重。

checkpoint_path = "training_1/cp.ckpt"
checkpoint_dir = os.path.dirname(checkpoint_path)

# Create a callback that saves the model's weights
cp_callback = tf.keras.callbacks.ModelCheckpoint(filepath=checkpoint_path,
                                                 save_weights_only=True,
                                                 verbose=1)

# Train the model with the new callback
model.fit(train_images, 
          train_labels,  
          epochs=10,
          validation_data=(test_images, test_labels),
          callbacks=[cp_callback]) 
# This may generate warnings related to saving the state of the optimizer.
# These warnings (and similar warnings throughout this notebook)
# are in place to discourage outdated usage, and can be ignored.

这将创建一个tensorflow的checkpoint文件，并且训练过程中每轮结束之后更新文件。

os.listdir(checkpoint_dir) # ['checkpoint', 'cp.ckpt.index', 'cp.ckpt.data-00000-of-00001']

只要两个模型的结构一致，就能共享保存的权重，下面进行一下测试：

# 新构建一个模型，直接评估，其精确度是一个随机状态的值
# Create a basic model instance
model = create_model()

# Evaluate the model
loss, acc = model.evaluate(test_images, test_labels, verbose=2)
print("Untrained model, accuracy: {:5.2f}%".format(100 * acc))  # Untrained model, accuracy:  4.10%

# Loads the weights
model.load_weights(checkpoint_path)

# Re-evaluate the model
loss, acc = model.evaluate(test_images, test_labels, verbose=2)
print("Restored model, accuracy: {:5.2f}%".format(100 * acc)) # Restored model, accuracy: 86.40%

tf.keras.callbacks.ModelCheckpoint提供了一些参数，可以灵活使用callback。

# Include the epoch in the file name (uses `str.format`)
checkpoint_path = "training_2/cp-{epoch:04d}.ckpt"
checkpoint_dir = os.path.dirname(checkpoint_path)

batch_size = 32

# Create a callback that saves the model's weights every 5 epochs
cp_callback = tf.keras.callbacks.ModelCheckpoint(
    filepath=checkpoint_path, 
    verbose=1, 
    save_weights_only=True,
    save_freq=5*batch_size) # 每5轮保存一次

# Create a new model instance
model = create_model()

# Save the weights using the `checkpoint_path` format
model.save_weights(checkpoint_path.format(epoch=0))

# Train the model with the new callback
model.fit(train_images, 
          train_labels,
          epochs=50, 
          batch_size=batch_size, 
          callbacks=[cp_callback],
          validation_data=(test_images, test_labels),
          verbose=0)

# 取最新的权重文件。
latest = tf.train.latest_checkpoint(checkpoint_dir)

# Create a new model instance
model = create_model()

# Load the previously saved weights
model.load_weights(latest)

# Re-evaluate the model
loss, acc = model.evaluate(test_images, test_labels, verbose=2)
print("Restored model, accuracy: {:5.2f}%".format(100 * acc)) # Restored model, accuracy: 87.80%

superY25

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
TensorFlow构建模型（保存和加载模型）四

概要本文主要讲模型构建训练过程中的保存和加载方法。主要用到了tensoflow中的tf.keras.callbacks.ModelCheckpoint内容import osimport tensorflow as tffrom tensorflow import kerasprint(tf.version.VERSION)# 加载模型训练的数据(train_images, train_labels), (test_images, test_labels) = tf.keras.data
复制链接

扫一扫

专栏目录