VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models Paper • 2606.16140 • Published 8 days ago • 108
Running on Zero Agents Featured 64 Gemma Diffusion Website Builder 🌐 64 Watch a diffusion LLM write a website live, then tweak it
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published May 9 • 82