Transformer 的基础结构
NLP Structure
VIT
SWIN
DERT
Transformer 常用terms
https://www.pinecone.io/learn/batch-layer-normalization/
https://wandb.ai/wandb_fc/LayerNorm/reports/Layer-Normalization-in-Pytorch-With-Examples—VmlldzoxMjk5MTk1
https://towardsdatascience.com/different-normalization-layers-in-deep