PRISM: A Multi-Dimensional Benchmark for Evaluating LLM Peer Reviewers Paper • 2605.26730 • Published 10 days ago • 16
Off-the-Shelf LLMs as Process Scorers: Training-Free Alternative to PRMs for Mathematical Reasoning Paper • 2606.01682 • Published 5 days ago • 7