Submitted by
Zhiyuan Hu
Massachusetts Institute of Technology
university
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs