改善手写数字识别的准确率过拟合

最新推荐文章于 2022-04-01 22:36:21 发布

仓鼠球球－O－

最新推荐文章于 2022-04-01 22:36:21 发布

阅读量512

点赞数

本文链接：https://blog.csdn.net/weixin_43873671/article/details/113353237

版权

该博客介绍了如何通过在神经网络中添加L1正则化和Dropout层来缓解过拟合问题。首先，代码展示了如何在Keras中构建一个包含正则化的两层全连接网络，接着应用Dropout层以随机关闭一部分神经元。模型经过编译和训练后，在MNIST数据集上进行了评估，结果显示了正则化和Dropout对提高模型泛化能力的效果。

摘要由CSDN通过智能技术生成

在隐藏层添加正则化，并使得部分神经元丧失功能，可以改善准确率过拟合

network.add(layers.Dense(units=128, activation='relu', input_shape=(28*28, ),kernel_regularizer=regularizers.l1(0.0001)))# L1范式正则
network.add(layers.Dropout(0.01)) #以百分之一的概率使得神经元丧失功能

全部代码

#第二次优化识别，改善过拟合（添加正则化，且使部分网络丧失功能）
from keras.utils import to_categorical
from keras import models, layers, regularizers
from keras.optimizers import RMSprop
from keras.datasets import mnist
import matplotlib.pyplot as plt
# import torch
#加载数据集
(train_images,train_labels),(test_images,test_labels)=mnist.load_data()#归一化处理，注意必须进行归一化操作，否则准确率非常低，图片和标签
# print(train_images.shape,test_images.shape)
# print(train_images[0])
# print(train_labels[0])
# plt.figure()
# plt.imshow(train_images[0])
# plt.show()
#将图片由二维铺开成一维
train_images=train_images.reshape((60000,28*28)).astype('float')  #将28*28的二维数组转变为784的一维数组，浮点数类型
test_images=test_images.reshape((10000,28*28)).astype ('float')
train_labels=to_categorical(train_labels)  #to_categorical就是将类别向量转换为二进制（只有0和1）的矩阵类型表示
test_labels=to_categorical(test_labels)
#print(train_labels[0])

#搭建神经网络
network = models.Sequential()   #选用的是Sequential 序贯模型
network.add(layers.Dense(units=128, activation='relu', input_shape=(28*28, ),kernel_regularizer=regularizers.l1(0.0001)))#添加一个(隐藏层)全连接层，神经元为15，激活函数是relu线性整流函数,输入形状为28*28
network.add(layers.Dropout(0.01)) #以百分之一的概率使得神经元丧失功能
network.add(layers.Dense(units=32, activation='relu',kernel_regularizer=regularizers.l1(0.0001)))#添加一个(隐藏层)全连接层，神经元为15，激活函数是relu线性整流函数,输入形状为28*28
network.add(layers.Dropout(0.01))
network.add(layers.Dense(units=10, activation='softmax'))#添加一个(输出层)全连接层，神经元为10，激活函数为softmax(Softmax 具有更好的解释性，包含属于猫的这一类的特征越多，输出为猫的概率就越大)

print(network.summary())  #查看神经网络model结构


#神经网络的训练
# 编译步骤，损失函数是模型优化的目标，优化器使用RMSporp,学习率为0.001，损失函数是categorical_crossentropy，评价函数为accuracy准确率
network.compile(optimizer=RMSprop(lr=0.001), loss='categorical_crossentropy', metrics=['accuracy'])
# 训练网络，用fit函数（model.fit()方法用于执行训练过程）, epochs表示训练多少个回合， batch_size表示每次训练给多大的数据，一次训练所选取的样本数。
network.fit(train_images, train_labels, epochs=22, batch_size=128, verbose=2)  #verbose：日志显示 0 为不在标准输出流输出日志信息 1 为输出进度条记录2 为每个epoch输出一行记录
print(network.summary())  #查看神经网络model结构

# 测试集上测试模型性能
y_pre = network.predict(test_images[:5])  #预测前五张图片的，model.predict 实际预测，其输出是目标值，根据输入数据预测。
print(y_pre, test_labels[:5])
test_loss, test_accuracy = network.evaluate(test_images, test_labels)  #model.evaluate函数预测给定输入的输出
print("test_loss:", test_loss, "    test_accuracy:", test_accuracy)

更改之前

在这里插入图片描述

更改之后

仓鼠球球－O－

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
打赏
0
评论
改善手写数字识别的准确率过拟合

在隐藏层添加正则化，并使得部分神经元丧失功能，可以改善准确率过拟合network.add(layers.Dense(units=128, activation='relu', input_shape=(28*28, ),kernel_regularizer=regularizers.l1(0.0001)))# L1范式正则network.add(layers.Dropout(0.01)) #以百分之一的概率使得神经元丧失功能全部代码#第二次优化识别，改善过拟合（添加正则化，且使部分网络丧失功能）
复制链接

扫一扫