Model Leaderboards - a davidberenstein1957 Collection

davidberenstein1957 's Collections

Smol but mighty

LLM evals and benchmark datasets

Dataset Viber annotators

Cool and fun Spaces

Model Leaderboards

Useful datasets

Model Leaderboards

updated Jan 22, 2025

Running on CPU Upgrade

7.32k

MTEB Leaderboard

🥇

7.32k

Embedding Leaderboard
Running

Agents

428

Reward Bench Leaderboard

📐

428

Explore RewardBench model rankings and scores
Runtime error

14k

Open LLM Leaderboard

🏆

14k

Track, rank and evaluate open LLMs and chatbots
Running

4.86k

Arena Leaderboard

🏆

4.86k

View the LMArena model leaderboard
Running

Agents

1.5k

Big Code Models Leaderboard

📈

1.5k

Explore and submit code model evaluations on a leaderboard
Running

Agents

232

AI2 WildBench Leaderboard (V2)

🦁

232

Display and explore a leaderboard of language models
Running on CPU Upgrade

Agents

1.01k

Open VLM Leaderboard

🌎

1.01k

VLMEvalKit Evaluation Results Collection
Running

Agents

230

BigCodeBench Leaderboard

🥇

230

Explore code-generation model leaderboards and task details
Running

Agents

Featured

586

LLM-Perf Leaderboard

🏆

586

Explore LLM performance across hardware configurations
Running

116

MTEB Arena

⚔

116

Display MTEB Arena interface