Towards General Text Embeddings with Multi-stage Contrastive Learning
:https://arxiv.org/abs/2308.03281
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
:https://arxiv.org/abs/2407.19669
论文中,sentence-embedding取最后一层均值,
实际上代码中取的是最后一个token的embedding