Transformer系列：Shunted self-attention (CVPR2022 oral)

最新推荐文章于 2024-07-17 21:25:38 发布

CV小白升级中

最新推荐文章于 2024-07-17 21:25:38 发布

阅读量271

点赞数

分类专栏： Classification Object detection Transformer 文章标签： transformer 深度学习人工智能

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_34992700/article/details/125406246

版权

文章地址：https://arxiv.org/abs/2111.15193

1. Motivation

ViT的每层特征的感受野大小是相似的，导致无法处理多尺度目标大小的任务。

2. Contribution

提出SSA，将attention head分组，每组负责不同的attention granularity，来处理hybrid-scale attention

3. Methods

3.1 Shunted transformer block

Shunted self-attention: multi-head self-attention中不同head的key和value采用不同的下采样率

Data-specific f

最低0.47元/天解锁文章

CV小白升级中

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Transformer系列：Shunted self-attention (CVPR2022 oral)

The key idea of SSA is to inject heterogeneous receptive field sizes into tokens: before computing the self-attention matrix, it selectively merges tokens to represent larger object features while keeping certain tokens to preserve fine-grained features.
复制链接

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。