Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
davidberenstein1957 's Collections
guardrails
Smol but mighty
Useful Spaces
LLM evals and benchmark datasets
Dataset Viber annotators
Cool and fun Spaces
Model Leaderboards
Useful models
Useful datasets

Model Leaderboards

updated Jan 22, 2025
Upvote
1

  • Running on CPU Upgrade
    7.32k

    MTEB Leaderboard

    🥇
    7.32k

    Embedding Leaderboard


  • Running
    Agents
    428

    Reward Bench Leaderboard

    📐
    428

    Explore RewardBench model rankings and scores


  • Runtime error
    14k

    Open LLM Leaderboard

    🏆
    14k

    Track, rank and evaluate open LLMs and chatbots


  • Running
    4.86k

    Arena Leaderboard

    🏆
    4.86k

    View the LMArena model leaderboard


  • Running
    Agents
    1.5k

    Big Code Models Leaderboard

    📈
    1.5k

    Explore and submit code model evaluations on a leaderboard


  • Running
    Agents
    232

    AI2 WildBench Leaderboard (V2)

    🦁
    232

    Display and explore a leaderboard of language models


  • Running on CPU Upgrade
    Agents
    1.01k

    Open VLM Leaderboard

    🌎
    1.01k

    VLMEvalKit Evaluation Results Collection


  • Running
    Agents
    230

    BigCodeBench Leaderboard

    🥇
    230

    Explore code-generation model leaderboards and task details


  • Running
    Agents
    Featured
    586

    LLM-Perf Leaderboard

    🏆
    586

    Explore LLM performance across hardware configurations


  • Running
    116

    MTEB Arena

    ⚔
    116

    Display MTEB Arena interface

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs