Pytorch学习笔记：model.train()和model.eval()

最新推荐文章于 2024-07-20 17:13:41 发布

豆爸OS

最新推荐文章于 2024-07-20 17:13:41 发布

阅读量1.6k

点赞数 1

分类专栏： Pytorch学习笔记文章标签： pytorch 深度学习机器学习

本文链接：https://blog.csdn.net/weixin_45383706/article/details/122886251

版权

Pytorch学习笔记专栏收录该内容

6 篇文章

订阅专栏

文章目录

前言
1.model.train()
2.model.eval()
总结

前言

训练时的model.train()和测试时的model.eval()分别做了什么？

1.model.train()

源码如下：

def train(self: T, mode: bool = True) -> T:
    r"""Sets the module in training mode.

    This has any effect only on certain modules. See documentations of
    particular modules for details of their behaviors in training/evaluation
    mode, if they are affected, e.g. :class:`Dropout`, :class:`BatchNorm`,
    etc.

    Args:
        mode (bool): whether to set training mode (``True``) or evaluation
                     mode (``False``). Default: ``True``.

    Returns:
        Module: self
    """
    self.training = mode
    for module in self.children():
        module.train(mode)
    return self

2.model.eval()

源码如下：

def eval(self: T) -> T:
    r"""Sets the module in evaluation mode.

    This has any effect only on certain modules. See documentations of
    particular modules for details of their behaviors in training/evaluation
    mode, if they are affected, e.g. :class:`Dropout`, :class:`BatchNorm`,
    etc.

    This is equivalent with :meth:`self.train(False) <torch.nn.Module.train>`.

    Returns:
        Module: self
    """
    return self.train(False)

总结

提示：这里对文章进行总结：

简单来说，是设置了训练或者测试模式，定义模型是否需要学习。对部分层有影响，如Dropout和BN。具体影响如下：
Dropout: 训练过程中，为防止模型过拟合，增加其泛化性，会随机屏蔽掉一些神经元，相当于输入每次走过不同的“模型”。测试模式时，所有神经元共同作用，类似于boosting。
BN: 训练过程中，模型每次处理一个minibatch数据，BN根据一个minibatch来计算mean和std后做归一化处理，这也是为什么模型的性能和minibatch的大小关系很大（后续也有系列文章来解决BN在小minibatch下表现不佳的问题）。测试时，BN会利用训练时得到的参数来处理测试数据。如果不设置model.eval()，输入单张图像，会报错。