RLFR Extending Reinforcement Learning for LLMs with Flow Environment JingHaoZ/RLFR-Qwen2.5-Math-7B Text Generation • 8B • Updated Oct 14, 2025 • 7 JingHaoZ/RLFR-Qwen2.5-VL-7B-Instruct Image-to-Text • 8B • Updated Oct 14, 2025 • 2 • 1 JingHaoZ/RLFR-Dataset-LM Viewer • Updated Nov 14, 2025 • 102k • 62 JingHaoZ/RLFR-Dataset-VLM Preview • Updated Oct 14, 2025 • 17
RLFR Extending Reinforcement Learning for LLMs with Flow Environment JingHaoZ/RLFR-Qwen2.5-Math-7B Text Generation • 8B • Updated Oct 14, 2025 • 7 JingHaoZ/RLFR-Qwen2.5-VL-7B-Instruct Image-to-Text • 8B • Updated Oct 14, 2025 • 2 • 1 JingHaoZ/RLFR-Dataset-LM Viewer • Updated Nov 14, 2025 • 102k • 62 JingHaoZ/RLFR-Dataset-VLM Preview • Updated Oct 14, 2025 • 17