Title:Learning Correlation Structures for Vision Transformers
Paper:Learning Correlation Structures for Vision Transformers
Code:Learning Correlation Structures for Vision Transformers (kimmanjin.github.io)
导读
本文提出一种新的注意力机制,称为结构自注意力(StructSA),并提出StructViT:结构视觉Transformer,StructVit可以有效提取图像中的结构化信息,在图像和视频分类任务上性能表现SOTA!