CNN如何支持位置偏移缩放

pooling层,将特征的精度模糊化,支持细微的位移和缩放。

大范围的位移和缩放,还是需要fully connected layer,通过学习大量不同位置和形状比例的物体,来支持物体位置和缩放的识别。


参考资料:

https://www.quora.com/How-is-a-convolutional-neural-network-able-to-learn-invariant-features

https://stats.stackexchange.com/questions/208936/what-is-translation-invariance-in-computer-vision-and-convolutional-netral-netwo


   1、The pooling regimes make convolution process invariant to translation, rotation and shifting. Most widely used one is max-pooling. You take the highest activation to propagate at the interest region so called receptive field. Even a images are relatively a little shifted, since we are looking for highest activation, we are able to capture commonalities between images.
   2、For scale invariance, up to my knowledge, no way other than providing different scales of images to network or learned network filters might be applied at different scales.   
   3、Other forms of invariances are built up artificially by rotating, mirroring and scaling up the training examples. This is because it is important to see training sets from different points of view in order to generalize better.

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值