逻辑回归softmax神经网络实现手写数字识别(cs)

最新推荐文章于 2024-04-27 15:32:49 发布

marsggbo

最新推荐文章于 2024-04-27 15:32:49 发布

阅读量2.5k

点赞数

文章标签：神经网络机器学习 python softmax 逻辑回归

本文链接：https://blog.csdn.net/marsggbo/article/details/77506740

版权

本文详细介绍了使用逻辑回归softmax神经网络实现手写数字识别的过程，包括数据预处理、算法原理、模型训练、前向反向传播、验证测试以及梯度下降算法的推导。通过TensorFlow读取MNIST数据集，将数据分为训练集、验证集和测试集，并展示了部分手写数字。最后，作者测试了模型性能，发现识别准确性有待提升。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

逻辑回归softmax神经网络实现手写数字识别全过程

项目已上传至github：marsggbo/MLinAction

1 - 导入模块

import numpy as np
import matplotlib.pyplot as plt
from ld_mnist import load_digits

%matplotlib inline

2 - 导入数据及数据预处理

mnist = load_digits()

Extracting C:/Users/marsggbo/Documents/Code/ML/TF Tutorial/data/MNIST_data\train-images-idx3-ubyte.gz
Extracting C:/Users/marsggbo/Documents/Code/ML/TF Tutorial/data/MNIST_data\train-labels-idx1-ubyte.gz
Extracting C:/Users/marsggbo/Documents/Code/ML/TF Tutorial/data/MNIST_data\t10k-images-idx3-ubyte.gz
Extracting C:/Users/marsggbo/Documents/Code/ML/TF Tutorial/data/MNIST_data\t10k-labels-idx1-ubyte.gz

print("Train: "+ str(mnist.train.images.shape))
print("Train: "+ str(mnist.train.labels.shape))
print("Test: "+ str(mnist.test.images.shape))
print("Test: "+ str(mnist.test.labels.shape))

Train: (55000, 784)
Train: (55000, 10)
Test: (10000, 784)
Test: (10000, 10)

mnist数据采用的是TensorFlow的一个函数进行读取的，由上面的结果可以知道训练集数据X_train有55000个，每个X的数据长度是784（28*28）。

另外由于数据集的数量较多，所以TensorFlow提供了批量提取数据的方法，从而大大提高了运行速率，方法如下：

x_batch, y_batch = mnist.train.next_batch(100)
print(x_batch.shape)
print(y_batch.shape)

>>>
(100, 784)
(100, 10)

x_train, y_train, x_test, y_test = mnist.train.images, mnist.train.labels, mnist.test.images, mnist.test.labels

因为训练集的数据太大，所以可以再划分成训练集，验证集，测试集，比例为6:2:2

x_train_batch, y_train_batch = mnist.train.next_batch(30000)
x_cv_batch, y_cv_batch = mnist.train.next_batch(15000)
x_test_batch, y_test_batch = mnist.train.next_batch(10000)
print(x_train_batch.shape)
print(y_cv_batch.shape)
print(y_test_batch.shape)

(30000, 784)
(15000, 10)
(10000, 10)

展示手写数字

nums = 6
for i in range(1,nums+1):
    plt.subplot(1,nums,i)
    plt.imshow(x_train[i].reshape(28,28), cmap="gray")

这里写图片描述

3 - 算法介绍

3.1 算法

对单个样本数据 $x^{(i)}$ :

z (i) = w T x (i) + b (1)

最低0.47元/天解锁文章