mnist学习实例(2)

最新推荐文章于 2022-08-18 09:22:22 发布

安安爸Chris

最新推荐文章于 2022-08-18 09:22:22 发布

阅读量252

点赞数

分类专栏：深度学习文章标签： mnist

本文链接：https://blog.csdn.net/mimiduck/article/details/115097188

版权

深度学习专栏收录该内容

28 篇文章 1 订阅

订阅专栏

环境: Ubuntu 18.04, tensorflow 2.4.1

该版本是全连接网络的优化版本，采用了卷积神经网络，参考。
可以看到BATCH_SIZE没有改动，而EPOCH明显减少。
但是EPOCH一轮的时间明显增长了很多。

代码

import tensorflow as tf
import numpy as np
from tensorflow import keras

EPOCH = 15
BATCH_SIZE = 128
VERBOSE = 1
NUM_CLASSES = 10


# the data, split between train and test sets
(x_train, y_train), (x_test, y_test) = keras.datasets.mnist.load_data()

# Scale images to the [0, 1] range
x_train = x_train.astype("float32") / 255
x_test = x_test.astype("float32") / 255
# Make sure images have shape (28, 28, 1)
x_train = np.expand_dims(x_train, -1)
x_test = np.expand_dims(x_test, -1)
print("x_train shape:", x_train.shape)
print(x_train.shape[0], "train samples")
print(x_test.shape[0], "test samples")

# convert class vectors to binary class matrices
y_train = keras.utils.to_categorical(y_train, NUM_CLASSES)
y_test = keras.utils.to_categorical(y_test, NUM_CLASSES)


## build network
model = tf.keras.Sequential(
    [
        tf.keras.Input(shape=(28, 28, 1)),
        tf.keras.layers.Conv2D(32, kernel_size=(3, 3), activation="relu"),
        tf.keras.layers.MaxPooling2D(pool_size=(2, 2)),
        tf.keras.layers.Conv2D(64, kernel_size=(3, 3), activation="relu"),
        tf.keras.layers.MaxPooling2D(pool_size=(2, 2)),
        tf.keras.layers.Flatten(),
        tf.keras.layers.Dropout(0.5),
        tf.keras.layers.Dense(NUM_CLASSES, activation="softmax"),
    ]
)

model.summary()

## compile network
model.compile(loss="categorical_crossentropy", optimizer="adam", metrics=["accuracy"])
model.fit(x_train, y_train, batch_size=BATCH_SIZE, epochs=EPOCH, validation_split=0.1)

## validation  0.991
val_loss,val_acc = model.evaluate(x_test, y_test, verbose=VERBOSE)
print("Test loss: ", val_loss)
print("Test accuracy: ", val_acc)

model summary解读

model summary

输入层是（，28，28，1）4维张量，分别是数量，长度，宽度，通道（灰度）

第一层conv2d

kernel_size=(3,3)
因为输入时长宽相等，我们就看一边。没有指定strides，默认为1，
$(28 - 3) / 1 + 1 = 26$
filter=32
表示有32个kernel，每个kernel在原张量上卷积后，都会有一个结果，所以最后一维就是32.
这个就是第一层conv2dshape（,26,26,32)的由来。
param个数
$((核宽 * 核高) * 通道数 + 1) * 卷积核数$ 注：+1是因为一个bias
$((3 * 3) * 1 + 1) * 32 = 320$

第二层pooling

它主要是做收缩，所以不产生新的param。
pool_size=(2, 2)，所以shape变成 （,13,13,32)

第三层conv2d

kernel_size=(3, 3)
$(13 - 3) / 1 + 1 = 11$
filter=64
所以shape为（,11,11,64)
param个数
$（（核宽 * 核高） * 通道数 + 1 ） * 卷积核数$ 注：这一层的通道数就是上层卷积核数
$((3 * 3) * 32 + 1) * 64 = 18496$