【深度学习技巧】语义分割中的多分类与二分类使用dice系数

只搬烫手的砖

已于 2022-11-17 18:45:34 修改

阅读量6.7k

点赞数 5

文章标签：深度学习分类人工智能

于 2022-10-22 15:23:49 首次发布

本文链接：https://blog.csdn.net/qq_44747572/article/details/127461922

版权

本文详细介绍了在语义分割任务中，如何运用Dice系数进行二分类和多分类评估，并提供了相应的代码示例，包括Sigmoid-BCELoss和Softmax-CELoss的组合。通过 Dice 系数计算，有助于优化模型性能和理解分割效果。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

在语义分割过程中，进行二分类和多分类时，如何使用dice系数以及相应的代码

1. dice系数的基础知识

参考链接：https://www.aiuai.cn/aifarm1159.html 在这里插入图片描述

2. 语义分割

参考链接：https://editor.csdn.net/md?not_checkout=1&spm=1000.2115.3001.4503&articleId=127461922
语义分割的目标是：将一张RGB图像或是灰度图作为输入，输出的是分割图，其中每一个像素包含了其类别的标签。
在这里插入图片描述
从图中可以发现，每一个类用相同的像素值表示，因此在设计多分类的dice系数时，可以采用对每个类（像素值），进行一次dice计算

3. 代码

二分类时，采用sigmoid与Bceloss的组合

#计算Dice系数二分类
def meandice(pred, label):
    sumdice = 0
    smooth = 1e-6
    
    pred_bin = pred
    label_bin = label

    pred_bin = pred_bin.contiguous().view(pred_bin.shape[0], -1)
    label_bin = label_bin.contiguous().view(label_bin.shape[0], -1)

    intersection = (pred_bin * label_bin).sum()
    dice = (2. * intersection + smooth) / (pred_bin.sum() + label_bin.sum() + smooth)
    sumdice += dice

    return sumdice

#设置loss
criterion = torch.nn.BCELoss()  # define loss and cross entropy
#设置保存模型的条件
bestdice = 0
bestepoch = 0

for epoch in range(1000):
    # 训练
    # scheduler.step(epoch)  

    cnt = 0
    losssum = 0

    model.train()
    for image, label in tqdm(dataloader):
        optimizer.zero_grad()
        image, label = image.to(device), label.to(device)
        label[label != 1] = 0
        out = model(image)
       
        m  = nn.Sigmoid()
        out = m(out)
        # 计算loss
        loss = criterion(out, label)


        loss.backward()
        optimizer.step()

	 # 测试
	 with torch.no_grad():
	     model.eval()
	     features = None
	     print("validating....")
	     testnum = 0
	     for image_t, label_t in tqdm(dataloader_test):
	         image_t, label_t = image_t.to(device), label_t.to(device)
	         label_t[label_t != 1] = 0         # 屏蔽掉，像素值为1的其他像素值，只保留这一类的像素值。
	         out_t = model(image_t)
	         out_t[out_t > 0.5] = 1            # 将预测图中，概率值大于0.5的置为1（如果为其他像素值，可以设置2，3等） 小于置为0， mask的标签对应的是0 和 1
	         out_t[out_t < 0.5] = 0
	         rawdice = meandice(out_t, label_t)

多分类时，采用softmax与CEloss的组合

#计算Dice系数 多分类
def meandice(pred, label):
    sumdice = 0
    smooth = 1e-6

    for i in range(1, 5):
        pred_bin = (pred==i)*1
        label_bin = (label==i)*1

        pred_bin = pred_bin.contiguous().view(pred_bin.shape[0], -1)
        label_bin = label_bin.contiguous().view(label_bin.shape[0], -1)

        intersection = (pred_bin * label_bin).sum()
        dice = (2. * intersection + smooth) / (pred_bin.sum() + label_bin.sum() + smooth)
        sumdice += dice

    return sumdice/4


for epoch in range(1000):
    # 训练
    # scheduler.step(epoch)
    cnt = 0
    losssum = 0

    model.train()
    for image, label in tqdm(dataloader):
        optimizer.zero_grad()
        image, label = image.to(device), label.to(device).long()
        
        out = model(image)     #  [1， 6， 256，256]， 6表示为5个类和一个背景
        m = nn.Softmax(1)      
        out = m(out)            
      
        # 计算loss
        loss = criterion(out, label)    # label 为 [1, 256, 256]  out [1, 6, 256, 256],计算每个类的误差

        loss.backward()
        optimizer.step()
        cnt += 1
        losssum += loss
    print('Epoch {0},train_loss {1}'.format(epoch, losssum / cnt))
    # '''

    dicecat = []
    resultcat = []
    imgcat = []

    # 测试
    with torch.no_grad():
        model.eval()
        features = None
        print("validating....")
        testnum = 0
        for image_t, label_t in tqdm(dataloader_test):
            image_t, label_t = image_t.to(device), label_t.to(device)
            out_t = model(image_t)   # out_t [1, 6, 256, 256]
            prediction = torch.argmax(out_t, dim=1)    #  prediction [1, 256, 256] torch.argmax(out_t, dim=1)比较6个[256, 256]上的最大值，并进行赋值，赋值为0，1，2，3，4，5即最后得到的prediction全为0，1，2，3，4，5的值。
            rawdice = meandice(prediction,label_t)   # label_t 标签值分别为1， 2， 3， 4，5