SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis Paper • 2505.16834 • Published May 22, 2025
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published Feb 3 • 41
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published Feb 3 • 39
ClawGym: A Scalable Framework for Building Effective Claw Agents Paper • 2604.26904 • Published 2 days ago • 36
Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration Paper • 2604.11446 • Published 18 days ago • 4
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published Feb 3 • 41
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published Feb 3 • 39
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26
Adaptive Ability Decomposing for Unlocking Large Reasoning Model Effective Reinforcement Learning Paper • 2602.00759 • Published Jan 31 • 5
Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models Paper • 2508.10751 • Published Aug 14, 2025 • 29
YuLan-Mini Resources Collection Pre-Training & post-training resources for YuLan-Mini • 29 items • Updated May 7, 2025 • 3
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published Apr 30, 2025 • 59
YuLan-Mini Resources Collection Pre-Training & post-training resources for YuLan-Mini • 29 items • Updated May 7, 2025 • 3