-
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
Paper • 2602.05885 • Published • 28 -
hkust-nlp/drkernel-14b
Text Generation • 15B • Updated • 55 • 6 -
hkust-nlp/drkernel-8b
Text Generation • 8B • Updated • 118 • 4 -
hkust-nlp/drkernel-14b-coldstart
Text Generation • 0.5B • Updated • 1.13k
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
-
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
Paper • 2602.05885 • Published • 28 -
hkust-nlp/drkernel-14b
Text Generation • 15B • Updated • 55 • 6 -
hkust-nlp/drkernel-8b
Text Generation • 8B • Updated • 118 • 4 -
hkust-nlp/drkernel-14b-coldstart
Text Generation • 0.5B • Updated • 1.13k
models 66
hkust-nlp/drkernel-8b-coldstart
Text Generation • 0.3B • Updated
• 2.36k
hkust-nlp/drkernel-14b-coldstart
Text Generation • 0.5B • Updated
• 1.13k
hkust-nlp/drkernel-14b
Text Generation • 15B • Updated
• 55 • 6
hkust-nlp/drkernel-8b
Text Generation • 8B • Updated
• 118 • 4
hkust-nlp/WebExplorer-8B
Image-Text-to-Text • 8B • Updated
• 248 • 12
hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier
Reinforcement Learning • 8B • Updated
• 1
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated
• 3
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated
• 2
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated
• 5 • 1
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated
• 3 • 1
datasets 32
hkust-nlp/drkernel-validation-data
Viewer
• Updated
• 100 • 73 • 1
hkust-nlp/drkernel-rl-data
Viewer
• Updated
• 72k • 49
hkust-nlp/drkernel-coldstart-8k
Viewer
• Updated
• 8.92k • 61 • 2
hkust-nlp/Toolathlon-Trajectories
Preview
• Updated
• 1.15k • 18
hkust-nlp/WebExplorer-QA
Viewer
• Updated
• 100 • 14 • 6
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated
• 22 • 2
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
• Updated
• 59 • 56
hkust-nlp/rl-verifier-pitfalls_hacking_data
Viewer
• Updated
• 6.12k • 331 • 1
hkust-nlp/deepscaler_simplelr
Viewer
• Updated
• 40.3k • 9
hkust-nlp/Laser-Deepscaler-Dataset
Viewer
• Updated
• 40.8k • 86