动手学深度学习Pytorch 4.4练习

最新推荐文章于 2024-07-26 01:45:52 发布

听微雨

最新推荐文章于 2024-07-26 01:45:52 发布

阅读量300

点赞数

文章标签：深度学习 pytorch 人工智能

本文链接：https://blog.csdn.net/m0_54136420/article/details/132190079

版权

1.这个多项式回归问题可以准确地解出吗？提⽰：使⽤线性代数。
可以,把多项式问题，用matlab的str2sym表示出来，再用solve求解。

2.考虑多项式的模型选择。

1. 绘制训练损失与模型复杂度（多项式的阶数）的关系图。观察到了什么？需要多少阶的多项式才能将训练损失减少到0?
画图代码（阶数1-100）：

# 记得把max_degree改为100

# 把train改成这个函数
def trainLossComplex(train_features,test_features,train_labels,test_labels,
          num_epochs=1000):
    loss=nn.MSELoss(reduction='none')
    input_shape=train_features.shape[-1]
    # 不设置偏置，因为我们已经在多项式中实现了它
    net = nn.Sequential(nn.Linear(input_shape,1,bias=False))
    batch_size=min(10,train_labels.shape[0])
    train_features=torch.tensor(train_features, dtype=torch.float32)
    test_features=torch.tensor(test_features, dtype=torch.float32)
    train_iter=d2l.load_array((train_features,train_labels.reshape(-1,1)),
                            batch_size)
    test_iter=d2l.load_array((test_features,test_labels.reshape(-1,1)),batch_size,
                            is_train=False)
    trainer=torch.optim.SGD(net.parameters(),lr=0.01)
    for epoch in range(num_epochs):
        d2l.train_epoch_ch3(net,train_iter,loss,trainer)
    return evaluate_loss(net,train_iter,loss),evaluate_loss(net,test_iter,loss)

trainLoss=[]
textLoss=[]
x=np.arange(1,100)
for i in np.arange(1,100):
    train_loss,text_loss=trainLossComplex(poly_features[:n_train, :i], poly_features[n_train:,:i],
        labels[:n_train],labels[n_train:])
    trainLoss.append(train_loss)
    textLoss.append(text_loss)

d2l.plot(x, y, xlabel='degree', ylabel='train_loss', legend=None, xlim=None,
         ylim=[1e-3+0.007,1*1e-2+0.005], xscale='linear', yscale='linear',
         fmts=('-', 'm--', 'g-.', 'r:'), figsize=(3.5, 2.5), axes=None)

在这里插入图片描述
从图中看，1-100的阶数的多项式都不能把训练损失减少到0

2. 在这种情况下绘制测试的损失图。

d2l.plot(x, textLoss, xlabel='degree', ylabel='train_loss', legend=None, xlim=None,
         ylim=[1e-3,1.2], xscale='linear', yscale='linear',
         fmts=('-', 'm--', 'g-.', 'r:'), figsize=(3.5, 2.5), axes=None)