RuntimeError: weight tensor should be defined either for all or no classes

翰墨大人

已于 2022-09-20 10:50:10 修改

阅读量4.6k

点赞数 3

分类专栏： pytorch踩坑文章标签：深度学习 python 人工智能

于 2022-09-20 10:46:37 首次发布

本文链接：https://blog.csdn.net/qq_43733107/article/details/126947581

版权

pytorch踩坑专栏收录该内容

33 篇文章 6 订阅

订阅专栏

今天在添加网络中添加深监督的时候遇到了这个错误，先看出现错误的位置，是在计算交叉熵的时候：

    def forward(self, inputs_scales, targets_scales):
        losses = []
        for inputs, targets in zip(inputs_scales, targets_scales):
            # mask = targets > 0
            targets_m = targets.clone() #深拷贝
            targets_m -= 1 #减去
            loss_all = self.ce_loss(inputs, targets_m.long())

在网上冲浪一个小时，发现了各路大神解决方案：参考方案

1：检查输出预测图与标签的尺寸是否一样

2：没有区分背景及边缘：

3：分割类别与分割标签种类不一样：

我想在深监督中，使用多尺度标签进行监督，在decoder中输出经过self.side_output,得到了一个通道为37的语义分割图。

self.side_output = nn.Conv2d(in_channels//2,
                                     out_channels=37,
                                     kernel_size=1)

在图像的预处理中，我们对标签进行缩放，使标签的大小和decoder输出的大小一样，这样才可以进行损失计算。

# 
class MultiScaleLabel:
    def __init__(self, downsampling_rates=None):
        if downsampling_rates is None:
            self.downsampling_rates = [8, 4, 2]
        else:
            self.downsampling_rates = downsampling_rates

    def __call__(self, sample):
        label = sample['label']

        h, w = label.shape

        sample['label_down'] = dict()

        # Nearest neighbor interpolation
        for rate in self.downsampling_rates:
            label_down = cv2.resize(label.numpy(), (w // rate, h // rate),
                                    interpolation=cv2.INTER_NEAREST)
            sample['label_down'][rate] = torch.from_numpy(label_down)

        return sample

而在NYUDepth-v2中，标签的类别为40，我的分割类别只有37，因此问题就出现了。即上面的解决办法3。

所以我们将输出的分割类别调整37为标签类别40就可以了。

self.side_output = nn.Conv2d(in_channels//2,
                                     out_channels=40,
                                     kernel_size=1)