Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

RLLab

https://github.com
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

JixuanLeng  updated a model about 6 hours ago
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-675
JixuanLeng  published a model about 6 hours ago
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-675
JixuanLeng  updated a model about 6 hours ago
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-300
View all activity

Jixuan Leng's profile picture

RLLab 's datasets 9

RLLab/Dolci-Instruct-DPO

Viewer • Updated 10 days ago • 203k • 60

RLLab/Dolci-Instruct-DPO-Generations

Viewer • Updated 10 days ago • 349k • 33

RLLab/Dolci-DPO-Generations

Viewer • Updated 15 days ago • 1.09M • 97

RLLab/Dolci-Instruct-DPO-Delta-Generations

Viewer • Updated 18 days ago • 3.18M • 114

RLLab/Dolci-Instruct-SFT-NoFuncCalls

Viewer • Updated 26 days ago • 1.92M • 72

RLLab/cve-dpo-4b

Viewer • Updated Dec 10, 2025 • 32k • 12

RLLab/cve-all

Viewer • Updated Dec 10, 2025 • 16.7k • 6

RLLab/math-rl

Viewer • Updated Nov 25, 2025 • 57.5k • 73

RLLab/eval-set

Viewer • Updated Oct 27, 2025 • 12.4k • 78
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs