RLLab

https://github.com

AI & ML interests

None defined yet.

Recent Activity

JixuanLeng updated a model about 6 hours ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-675

JixuanLeng published a model about 6 hours ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-675

JixuanLeng updated a model about 6 hours ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-300

View all activity

RLLab 's datasets 9

RLLab/Dolci-Instruct-DPO

Viewer • Updated 10 days ago • 203k • 60

RLLab/Dolci-Instruct-DPO-Generations

Viewer • Updated 10 days ago • 349k • 33

RLLab/Dolci-DPO-Generations

Viewer • Updated 15 days ago • 1.09M • 97

RLLab/Dolci-Instruct-DPO-Delta-Generations

Viewer • Updated 18 days ago • 3.18M • 114

RLLab/Dolci-Instruct-SFT-NoFuncCalls

Viewer • Updated 26 days ago • 1.92M • 72

RLLab/cve-dpo-4b

Viewer • Updated Dec 10, 2025 • 32k • 12

RLLab/cve-all

Viewer • Updated Dec 10, 2025 • 16.7k • 6

RLLab/math-rl

Viewer • Updated Nov 25, 2025 • 57.5k • 73

RLLab/eval-set

Viewer • Updated Oct 27, 2025 • 12.4k • 78