论文阅读笔记：From Image-level to Pixel-level Labeling with Convolutional Networks

最新推荐文章于 2022-09-19 15:46:46 发布

忘泪

最新推荐文章于 2022-09-19 15:46:46 发布

阅读量442

点赞数

分类专栏：论文阅读文章标签： MIL Weakly-supervised Learning

本文链接：https://blog.csdn.net/wl1710582732/article/details/102550255

版权

论文阅读专栏收录该内容

10 篇文章 1 订阅

订阅专栏

论文阅读笔记：From Image-level to Pixel-level Labeling with Convolutional Networks

Introduce

Task

Weakly supervised learning for image semantic segmentation, only use image class label.

Contribution

combine MIL(Multiple Instance Learning) with CNN
performance state of the art

Framework

Train

在这里插入图片描述 For input image $I:3\times h \times w$ , pass a backbone(i.e. Overfeat + Segmentation Net), output feature maps $\times h^{o} \times w^{o}$ , then $Y$ pass a LSE(Log-Sum-Exp) pooling, output $\times 1 \times 1$ . Finally compute a softmax cross entrophy loss for $s$ , gradients backpropagation to train backbone.

Inference

在这里插入图片描述
$p_{i,j}(k)$ be the $Y$ for location $(i, j)$ and $k^{th}$ class label. ILP $p (k)$ be the $s$ by softmax.
$\widehat{y}_{i,j}=p_{i,j}(k|I) \times p(k|I)$
Finally, $\widehat{y}_{i,j}$ pass a interpolation to restore input image resolution. Then use a threshold(Smoothing Prior) to get the final segmentation results.

Log-Sum-Exp(LSE)

$s^k = \frac{1}{r}\log \left[ \frac{1}{h^o w^o} \sum\limits_{i.j} exp\left( r s_{i,j}^k \right)\right]$
LSE is a pooling method for $\times h^{o} \times w^{o}$ to $\times 1 \times 1$ , it is more smooth. When $s$ is high LSE similar to max pooling, $r$ low LSE similar to average pooling.

在这里插入图片描述
For accuracy, performance be more high compare to max pooling and sum pooling.

Summary

LSE is smooth pooling than max and average pooling. Maybe it is useful.

忘泪

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
论文阅读笔记：From Image-level to Pixel-level Labeling with Convolutional Networks

论文阅读笔记：From Image-level to Pixel-level Labeling with Convolutional NetworksIntroduceTaskWeakly supervised learning for image semantic segmentation, only use image class label.ContributionCombine...
复制链接

扫一扫