SCALE: Selective Resource Allocation for Overcoming Performance Bottlenecks in Mathematical Test-time Scaling Paper ⢠2512.00466 ⢠Published Nov 29, 2025 ⢠10
Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States Paper ⢠2505.17663 ⢠Published May 23, 2025 ⢠15
LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling Paper ⢠2505.19187 ⢠Published May 25, 2025 ⢠13
Cosmos Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/nvidia-cosmos-2 ⢠31 items ⢠Updated 6 days ago ⢠299
Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale Paper ⢠2409.17115 ⢠Published Sep 25, 2024 ⢠64
Running on CPU Upgrade 13.8k Open LLM Leaderboard š 13.8k Track, rank and evaluate open LLMs and chatbots
CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark Paper ⢠2401.11944 ⢠Published Jan 22, 2024 ⢠27
CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark Paper ⢠2401.11944 ⢠Published Jan 22, 2024 ⢠27