·
AI & ML interests
None yet
Organizations
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_2e-5-Epoch_2
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_1e-4-Epoch_2
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_2e-5
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_5e-5
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_1e-5
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_5e-6
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen-1.5B-SFT-OpenR1-LR_2e-5
Text Generation
•
2B
•
Updated
•
2
a-F1/Qwen-1.5B-SFT-OpenR1-LR_5e-5
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen-1.5B-SFT-OpenR1-LR_1e-5
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen-1.5B-SFT-OpenR1-LR_5e-6
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen-1.5B-SFT-OpenR1
2B
•
Updated
•
1
Text Generation
•
8B
•
Updated
•
2
•
1
8B
•
Updated
•
1
a-F1/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen2.5-Math-1.5B-Open-R1-Distill-bi
Text Generation
•
2B
•
Updated
•
2
a-F1/Qwen2.5-Math-1.5B-Open-R1-Distill-mixed
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen2.5-Math-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen2.5-1.5B-Open-R1-Distill-bi
Text Generation
•
2B
•
Updated
•
1
a-F1/Qwen2.5-1.5B-Open-R1-Distill-mixed
Text Generation
•
2B
•
Updated
a-F1/Qwen2.5-7B-Open-R1-Distill-mixed
Updated
a-F1/Qwen2.5-7B-Open-R1-Distill-bi
Text Generation
•
8B
•
Updated
Text Generation
•
7B
•
Updated
Text Generation
•
7B
•
Updated
•
1
a-F1/SimNPO_TOFU_Forget10
Text Generation
•
7B
•
Updated
a-F1/SimNPO_TOFU_Forget05
Text Generation
•
7B
•
Updated
•
1