·
AI & ML interests
LLM post-training
Organizations
ydeng9/OpenVLThinker-grpo-hard
Viewer
• Updated • 6.25k • 39
• 1
ydeng9/OpenVLThinker-grpo-medium
Viewer
• Updated • 3.3k • 20
Viewer
• Updated • 960 • 4
Viewer
• Updated • 2.3k • 17
Viewer
• Updated • 82.8k • 12
Viewer
• Updated • 1.76k • 7
Viewer
• Updated • 1.32k • 10
Viewer
• Updated • 789 • 11
Viewer
• Updated • 6 • 17
ydeng9/swe-smith-rl-distill
Viewer
• Updated • 7.81k • 16
ydeng9/OpenVLThinker-sft-iter3
Viewer
• Updated • 3.28k • 28
ydeng9/OpenVLThinker_sft_iter2
Viewer
• Updated • 5.54k • 7
ydeng9/captioned-data-subsetv1
Viewer
• Updated • 59.3k • 19
Viewer
• Updated • 3.11k • 71
• 1
Viewer
• Updated • 5.87k • 297
• 1