Sparse Spatial Transformers for Few-Shot Learning https://arxiv.org/abs/2109.12932v1 一篇将transformer结构融入到小样本学习中的论文