Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
crop & warp
Spatial pyramid pooling
In this paper, we introduce a spatial pyramid pooling layer to remove the fixed-size constraint of the network.
Add a SPP layer on top of the last convolutional layer.
The SPP layer pools the features and generates fixed-length outputs, which are then fed into the fully-connected layers.
也就是,这是一种 信息聚集 的方法,避免来cropping 和 warping
- SPP 能够产生固定长度的输出
- SPP使用了多层次的特征
- SPP可以提取不同层次的特征