deeplabv3+ 笔记

Deeplab v3+
Abstract
(1)Deeplab v3+ adds a decoder modeule to refine the segmentation result along object boundaries.
(2)Explore the Xception model and apply the depthwise separable convolution to both Atrous Spatial Pyramid Pooling and decoder modules
(3)好句:encoder multi-scale contextual information by probing the incoming features with filters or pooling operations at multiple rates and multiple effective fields-of-view. Network can capture sharper object boundaries by recovering the spatial information
Introduce
(1)in order to capture the contextual information at multiple scales, deeplabv3 applies several parallel atrous conv with different rates(ASPP)
(2)Information related to object boundaries miss due to pool or conv with striding operations, So apply atrous conv can alleviate
(3)Deeplabv3+ contains rich segmentation information from encoder, (ASPP) , while recovery the object boundaries from decoder既能通过encoder的ASPP结构得到丰富的上下文信息,又能通过decoder来恢复物体的边缘空间信息。
在这里插入图片描述
Related work
(1)Spatial pyramind pooling: capture rich contextual information
(2)Encoder-decoder: recover the spatial information,refine the segmentation result
(3)Depthwise seperation conv (or group conv): reduce the computation cost and number of params
Method
在这里插入图片描述
(1)Image pooling是GAP, output是16倍的,Low-Level-Feature是resnet的conv2的特征图。
(2)Xception调整 backbone 中max pool instead of conv with striding operations, bn+relu after each 3x3 depthwise conv(like mobilenet)
在这里插入图片描述
注:图片全部来源于Deeplabv3+论文原文。

  • 1
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值