-
-
-
-
-
-
Inference Providers
Active filters:
prm, trl
qgallouedec/Qwen2-0.5B-Reward
Token Classification
•
0.5B
•
Updated
•
1
plaguss/Qwen2.5-Math-7B-PRM-0.1
Token Classification
•
7B
•
Updated
plaguss/Qwen2.5-Math-7B-Instruct-PRM-0.1
Token Classification
•
7B
•
Updated
•
3
plaguss/Qwen2.5-Math-1.5B-Instruct-PRM-0.1
Token Classification
•
2B
•
Updated
•
1
HuggingFaceH4/Qwen2.5-Math-1.5B-Instruct-PRM-0.2
Token Classification
•
2B
•
Updated
•
35
HuggingFaceH4/Qwen2.5-Math-7B-Instruct-PRM-0.2
Token Classification
•
7B
•
Updated
•
23
Token Classification
•
66.4M
•
Updated
•
2
MikeMpapa/TraseSystem-orm-codeblob-verifier
Token Classification
•
0.5B
•
Updated
•
2
smohammadi/Qwen2.5-3B-MathShepherd
Token Classification
•
3B
•
Updated
•
1
axolotl-ai-co/Qwen2.5-Math-PRM-7B
Token Classification
•
7B
•
Updated
•
2
•
1
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V3
Token Classification
•
0.5B
•
Updated
•
1
alothomas/Qwen2.5-3B-PRM-RAD-balanced-V3
Token Classification
•
3B
•
Updated
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V4
Token Classification
•
0.5B
•
Updated
•
1
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k
Token Classification
•
0.5B
•
Updated
•
1
alothomas/Qwen2.5-3B-PRM-RAD-balanced-150k
Token Classification
•
3B
•
Updated
•
1
hzy/Qwen2.5-Math-7B-Instruct-PRM-Modified-math_shepherd
Token Classification
•
7B
•
Updated
•
1
jacopo-minniti/uats-value-model
Token Classification
•
2B
•
Updated
•
2
jacopo-minniti/Qwen2.5-Math-7B-PUM
Token Classification
•
7B
•
Updated
jacopo-minniti/Qwen2.5-Math-7B-PUM-half_entropy
Token Classification
•
7B
•
Updated
jacopo-minniti/Qwen2.5-Math-7B-PUM-soft-classification
2B
•
Updated
•
1
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k-LastStepOnly
Token Classification
•
0.5B
•
Updated
•
3
jacopo-minniti/Qwen2.5-Math-1.5B-PUM-variance
2B
•
Updated
jacopo-minniti/Qwen2.5-Math-1.5B-PUM-binary-variance
Token Classification
•
2B
•
Updated
jacopo-minniti/Qwen2.5-Math-1.5B-PUM-entropy_binary
Token Classification
•
2B
•
Updated
•
2
yungshun317/qwen2.5-0.5B-prm-mathshepherd
Token Classification
•
0.5B
•
Updated
•
12
jacopo-minniti/R1-Qwen-MMLU-1.5B-PUM-Variance
2B
•
Updated
•
3
jacopo-minniti/R1-Qwen-MMLU-1.5B-PRM
2B
•
Updated
•
23
jacopo-minniti/R1-Qwen-MMLU-1.5B-PRM-Regression
2B
•
Updated
•
4
ZaandaTeika/Qwen2.5-Math-7B-Instruct-SHARP-Math-PRM
Token Classification
•
7B
•
Updated
•
1