YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft2_v2_chosenhigh_rejectedrand Viewer • Updated Sep 15, 2025 • 15k • 2
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft1_v2_chosenhigh_rejectedrand Viewer • Updated Sep 15, 2025 • 7.5k • 2
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft16_all_soft_random_unfiltered Viewer • Updated Sep 12, 2025 • 120k • 37
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft8_all_soft_random_unfiltered Viewer • Updated Sep 12, 2025 • 60k • 2
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft4_all_soft_random_unfiltered Viewer • Updated Sep 12, 2025 • 30k • 1
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft2_all_soft_random_unfiltered Viewer • Updated Sep 12, 2025 • 15k • 2
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft1_all_soft_random_unfiltered Viewer • Updated Sep 12, 2025 • 7.5k • 2
YuchenLi01/MATH_train_generations-mathgenpair2_1.5Bpretrained_sft_lm1_ebs32_lr1e-06_epoch1.0_42 Viewer • Updated Sep 12, 2025 • 7.5k • 6
YuchenLi01/GSM8K_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft1_v4_all_soft_random_unfiltered Viewer • Updated Sep 11, 2025 • 7.47k • 2
YuchenLi01/MATH_mathgenpair2_1.5Bpretrained_sft_lm1_ebs32_lr1e-06_epoch1.0_42_DPO_pair32 Viewer • Updated Sep 10, 2025 • 205k • 6
YuchenLi01/MATH_mathgenpair2_1.5Bpretrained_sft_lm1_ebs32_lr1e-06_epoch1.0_42_DPO_pair16 Viewer • Updated Sep 10, 2025 • 103k • 7
YuchenLi01/MATH_mathgenpair2_1.5Bpretrained_sft_lm1_ebs32_lr1e-06_epoch1.0_42_DPO_pair8 Viewer • Updated Sep 10, 2025 • 51.4k • 4
YuchenLi01/MATH_mathgenpair2_1.5Bpretrained_sft_lm1_ebs32_lr1e-06_epoch1.0_42_DPO_pair4 Viewer • Updated Sep 10, 2025 • 25.7k • 4
YuchenLi01/MATH_mathgenpair2_1.5Bpretrained_sft_lm1_ebs32_lr1e-06_epoch1.0_42_DPO_pair2 Viewer • Updated Sep 10, 2025 • 12.8k • 4
YuchenLi01/MATH_mathgenpair2_1.5Bpretrained_sft_lm1_ebs32_lr1e-06_epoch1.0_42_DPO_pair1 Viewer • Updated Sep 10, 2025 • 6.42k • 4
YuchenLi01/MATH_1.5Binstruct_Score_DPO_Qwen2.5MathRM72B_hard0soft16_all_soft_random_unfiltered Viewer • Updated Sep 7, 2025 • 120k • 2
YuchenLi01/MATH_1.5Binstruct_Score_DPO_Qwen2.5MathRM72B_hard0soft8_all_soft_random_unfiltered Viewer • Updated Sep 7, 2025 • 60k • 2
YuchenLi01/MATH_1.5Binstruct_Score_DPO_Qwen2.5MathRM72B_hard0soft4_all_soft_random_unfiltered Viewer • Updated Sep 7, 2025 • 30k • 4
YuchenLi01/MATH_1.5Binstruct_Score_DPO_Qwen2.5MathRM72B_hard0soft2_all_soft_random_unfiltered Viewer • Updated Sep 7, 2025 • 15k • 2
YuchenLi01/MATH_1.5Binstruct_Score_DPO_Qwen2.5MathRM72B_hard0soft1_all_soft_random_unfiltered Viewer • Updated Sep 6, 2025 • 7.5k • 2