深度学习实战(二) 模型重用及冻结层

最新推荐文章于 2024-05-14 20:38:59 发布

小T是我

最新推荐文章于 2024-05-14 20:38:59 发布

阅读量2.6k

点赞数

分类专栏：深度学习之TensoFlow实战文章标签：深度学习神经网络冻结层实战 tensorflow

本文链接：https://blog.csdn.net/junjun150013652/article/details/81260494

版权

本文探讨了迁移学习的概念，通过创建一个深度神经网络，重用预训练模型的隐藏层并冻结它们，只替换softmax输出层。使用仅100张图像的数据集对5到9的数字进行训练，尽管样本数量有限，但能实现高精度。训练过程中的log信息揭示了模型学习的效率。

摘要由CSDN通过智能技术生成

Transfer learning.

Create a new DNN that reuses all the pretrained hidden layers of the previous model, freezes them, and replaces the softmax output layer with a fresh new one.

Train this new DNN on digits 5 to 9, using only 100 images per digit, and time how long it takes. Despite this small number of examples, can you achieve high precision?

import tensorflow as tf
import numpy as np
from datetime import datetime
import os
import time

def shuffle_batch(X, y, batch_size):
    rnd_idx = np.random.permutation(len(X))
    n_batches = len(X) // batch_size
    for batch_idx in np.array_split(rnd_idx, n_batches):
        X_batch, y_batch = X[batch_idx], y[batch_idx]
        yield X_batch, y_batch

(X_train, y_train), (X_test, y_test) = tf.keras.datasets.mnist.load_data()
X_train = X_train.astype(np.float32).reshape(-1, 28*28) / 255.0
X_test = X_test.astype(np.float32).reshape(-1, 28*28) / 255.0
y_train = y_train.astype(np.int32)
y_test = y_test.astype(np.int32)
X_valid, X_train = X_train[:5000], X_train[5000:]
y_valid, y_train = y_train[:5000], y_train[5000:]

X_train = X_train[y_train > 4]
y_train = y_train[y_train > 4]-5
X_valid = X_valid[y_valid > 4]
y_valid = y_valid[y_valid > 4]-5
X_test = X_test[y_test > 4]
y_test = y_test[y_test > 4]-5

rnd_idx = np.random.permutation(len(X_train))
rnd_idx = rnd_idx[0:100]
X_train = X_train[rnd_idx,:]
y_t

最低0.47元/天解锁文章

小T是我

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
2
评论
深度学习实战(二) 模型重用及冻结层

Transfer learning.Create a new DNN that reuses all the pretrained hidden layers of the previous model, freezes them, and replaces the softmax output layer with a fresh new one.Train this new DNN o...
复制链接

扫一扫

专栏目录