Model Name | Win Rate | Length |
---|---|---|
GPT-4 Turbo 📄 | 50.00% | 2049 |
Contextual AI (KTO-Mistral-PairRM) 📄 | 33.23% | 2521 |
Yi 34B Chat 📄 | 29.66% | 2123 |
Claude 3 Opus (02/29) 📄 | 29.04% | 1388 |
Claude 3 Sonnet (02/29) 📄 | 25.56% | 1420 |
GPT-4 📄 | 23.58% | 1365 |
GPT-4 0314 📄 | 22.07% | 1371 |
Mistral Medium 📄 | 21.86% | 1500 |
Mistral Large (24/02) 📄 | 21.44% | 1362 |
Mixtral 8x7B v0.1 |