20240729 大模型评测
参考:MMBench:基于ChatGPT的全方位多模能力评测体系_哔哩哔哩_bilibilihttps://en.wikipedia.org/wiki/Levenshtein_distancecider: https://zhuanlan.zhihu.com/p/698643372GitHub - open-compass/opencompass: OpenCompass is an LLM evaluation platform, supporting a wide range of models (
复制链接