Running on CPU Upgrade 240 MMLU-Pro Leaderboard 🥇 240 More advanced and challenging multi-task evaluation
Running on CPU Upgrade 13.8k Open LLM Leaderboard 🏆 13.8k Track, rank and evaluate open LLMs and chatbots