yang's picture

11

yang

fengfan933

·

AI & ML interests

None yet

Organizations

None yet

upvoted 3 papers 8 months ago

ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published May 26, 2025 • 45

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

Paper • 2504.13914 • Published Apr 10, 2025 • 4

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 154

upvoted a paper 9 months ago

BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation

Paper • 2504.14538 • Published Apr 20, 2025 • 30

upvoted 2 papers 10 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144

Implicit Reasoning in Transformers is Reasoning through Shortcuts

Paper • 2503.07604 • Published Mar 10, 2025 • 23

upvoted 2 papers 11 months ago

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

Paper • 2502.09082 • Published Feb 13, 2025 • 30

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47

upvoted 3 papers 12 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 126

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 434

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20, 2025 • 109