strided convolution vs pooling

最新推荐文章于 2022-10-06 16:17:58 发布

爆米花好美啊

最新推荐文章于 2022-10-06 16:17:58 发布

阅读量5k

点赞数 2

分类专栏：深度学习论文学习笔记文章标签： DeepLearning

本文链接：https://blog.csdn.net/u013010889/article/details/85635926

版权

深度学习同时被 2 个专栏收录

72 篇文章 9 订阅

订阅专栏

论文学习笔记

52 篇文章 3 订阅

订阅专栏

Striving for Simplicity: The All Convolutional Net

We find that max-pooling can simply be replaced by a convolutional layer with increased stride without loss in accuracy on several image recognition benchmarks’

在这里插入图片描述

只看baseline的model-C, conv+conv+max pool变换以下3种情况：

Strided-CNN-C: conv+ strided conv
ConvPool-CNN-C: conv+conv+conv+max pool
All-CNN-C: conv + conv+ strided conv

Strided-CNN-C与Model C对比，直接用strided conv代替conv+max pool效果是变差的，All-CNN-C与Model C对比，用conv+ strided conv代替conv+max pool效果是变好了，但是这样比Model C引入了更多的参数，于是又做了一个和All-CNN-C参数量一样的模型ConvPool-CNN-C，发现不如All-CNN-C

在这里插入图片描述

Deep Learning with Python

keras作者的观点

So the most reasonable subsampling strategy is to first produce
dense maps of features (via unstrided convolutions) and then look at the maximal activation of the features over small patches, rather than looking at sparser windows of the inputs (via strided convolutions) or averaging input patches, which could cause you to miss or dilute feature-presence information

第一个观点是下采样最好是用dense conv(stride=1)加max pool，而不是直接用一个strided conv(stride>1，类似上文的Strided-CNN-C)或者用average pool.
第二个观点是gan的D一般不用max pool，因为提供的梯度是稀疏的不利于指导G的学习(好的D是能够更多信息给G，而不是分类能力强)。max pool和ReLU都会造成梯度稀疏，gan里面一般用strided convolutions和LeakyReLU代替
其实这个和上一篇论文观点基本一致，就是分类网络里不要直接用strided convolution代替conv+max pool，要用的话也像上一篇文章说的那样conv+strided convolution。

其它

A：
Convolutions with stride>1 (which I believe is what you mean by conv subsampling) may work better for generative models (segmentation, style transfer, etc) since they can be better reversed.
For classification, combining max and average pooling seems to be best (although maxout may be even better), although I haven’t seen a thorough comparison vs subsampling.
B：
Both, strided convolution and pooling, summarize the data.
Pros of doing strided convolution compared to pooling:

You can learn how to summarize
Every pooling type as a strict equivalent convolution (sometimes there have to be multiple convolutional layers). Hence convolution is more general.

Cons of doing strided convolution compared to pooling: