论文阅读笔记:From Image-level to Pixel-level Labeling with Convolutional Networks

论文阅读笔记:From Image-level to Pixel-level Labeling with Convolutional Networks

Introduce

Task

Weakly supervised learning for image semantic segmentation, only use image class label.

Contribution

  • combine MIL(Multiple Instance Learning) with CNN
  • performance state of the art

Framework

Train

在这里插入图片描述For input image I : 3 × h × w I:3\times h \times w I:3×h×w, pass a backbone(i.e. Overfeat + Segmentation Net), output feature maps Y : ( ∣ C ∣ + 1 ) × h o × w o Y:(|C|+1) \times h^{o} \times w^{o} Y:(C+1)×ho×wo, then Y Y Y pass a LSE(Log-Sum-Exp) pooling, output s : ( ∣ C ∣ + 1 ) × 1 × 1 s:(|C|+1) \times 1 \times 1 s:(C+1)×1×1. Finally compute a softmax cross entrophy loss for s s s, gradients backpropagation to train backbone.

Inference

在这里插入图片描述
p i , j ( k ) p_{i,j}(k) pi,j(k) be the Y Y Y for location ( i , j ) (i,j) (i,j) and k t h k^{th} kth class label. ILP p ( k ) p(k) p(k) be the s s s by softmax.
y ^ i , j = p i , j ( k ∣ I ) × p ( k ∣ I ) \widehat{y}_{i,j}=p_{i,j}(k|I) \times p(k|I) y i,j=pi,j(kI)×p(kI)
Finally, y ^ i , j \widehat{y}_{i,j} y i,j pass a interpolation to restore input image resolution. Then use a threshold(Smoothing Prior) to get the final segmentation results.

Log-Sum-Exp(LSE)

s k = 1 r log ⁡ [ 1 h o w o ∑ i . j e x p ( r s i , j k ) ] s^k = \frac{1}{r}\log \left[ \frac{1}{h^o w^o} \sum\limits_{i.j} exp\left( r s_{i,j}^k \right)\right] sk=r1log[howo1i.jexp(rsi,jk)]
LSE is a pooling method for Y : ( ∣ C ∣ + 1 ) × h o × w o Y:(|C|+1) \times h^{o} \times w^{o} Y:(C+1)×ho×wo to s : ( ∣ C ∣ + 1 ) × 1 × 1 s:(|C|+1) \times 1 \times 1 s:(C+1)×1×1, it is more smooth. When s s s is high LSE similar to max pooling, r r r low LSE similar to average pooling.

在这里插入图片描述
For accuracy, performance be more high compare to max pooling and sum pooling.

Summary

  • LSE is smooth pooling than max and average pooling. Maybe it is useful.
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值