《动手学深度学习》 MLP with gluon

最新推荐文章于 2022-05-07 18:22:04 发布

Aissen_F

最新推荐文章于 2022-05-07 18:22:04 发布

阅读量137

点赞数 1

分类专栏： d2lzh

本文链接：https://blog.csdn.net/weixin_42152526/article/details/96568508

版权

d2lzh 专栏收录该内容

16 篇文章 2 订阅

订阅专栏

Multilayer Perceptron

多层感知机，一个隐层

代码

import d2lzh as d2l
from mxnet import gluon, init, autograd
from mxnet.gluon import loss as gloss, nn


batch_size = 256
train_iter, test_iter = d2l.load_data_fashion_mnist((batch_size))

net = nn.Sequential()
net.add(nn.Flatten())
net.add(nn.Dense(256, activation='relu'))
net.add(nn.Dense(10))
net.initialize(init.Normal(sigma=0.01))

loss = gloss.SoftmaxCrossEntropyLoss()  # 定义损失函数Softmax

trainer = gluon.Trainer(net.collect_params(), 'sgd', {'learning_rate': 0.1})  # 训练器初始化

num_epochs = 5
lr = 0.1
for epoch in range(num_epochs):
    train_l_sum, train_acc_sum, n = 0.0, 0.0, 0
    for X, y in train_iter:
        with autograd.record():
            y_hat = net(X)
            l = loss(y_hat, y).sum()
        l.backward()  # 求导
        trainer.step(batch_size)  # 迭代并更新
        y = y.astype('float32')
        train_l_sum += l.asscalar()
        train_acc_sum += (y_hat.argmax(axis=1) == y).sum().asscalar()
        n += y.size
    test_acc_sum, test_n = 0.0, 0
    for test_X, test_y in test_iter:
        test_y = test_y.astype('float32')
        test_acc_sum += (net(test_X).argmax(axis=1) == test_y).sum().asscalar()
        test_n += test_y.size
    test_acc = test_acc_sum / test_n
    print('epoch {}, loss {:.4f}, train acc {:.3f}, test_acc {:.3f}'.format(epoch + 1, train_l_sum / n, train_acc_sum / n, test_acc))

结果

epoch 1, loss 1.0439, train acc 0.636, test_acc 0.758
epoch 2, loss 0.6027, train acc 0.788, test_acc 0.820
epoch 3, loss 0.5202, train acc 0.818, test_acc 0.837
epoch 4, loss 0.4866, train acc 0.829, test_acc 0.844
epoch 5, loss 0.4600, train acc 0.839, test_acc 0.849

Aissen_F

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
《动手学深度学习》 MLP with gluon

Multilayer Perceptron多层感知机，一个隐层代码import d2lzh as d2lfrom mxnet import gluon, init, autogradfrom mxnet.gluon import loss as gloss, nnbatch_size = 256train_iter, test_iter = d2l.load_data_fashi...
复制链接

扫一扫