Inference Providers
Active filters: draft
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-50pct-draft
Text Generation
• 64B • Updated • 58
• 5
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-50pct-AutoRound-W4A16-draft
Text Generation
• 6B • Updated • 63
• 3
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-25pct-AutoRound-W4A16-draft
Text Generation
• 6B • Updated • 50
• 2
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-25pct-draft
Text Generation
• 92B • Updated • 44
• 1
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-AutoRound-W4A16-draft
Text Generation
• Updated • 1
mradermacher/DeepSeek-R1-DRAFT-0.5B-v1.0-GGUF
0.5B • Updated • 75
mradermacher/DeepSeek-V3-0324-DRAFT-0.5B-v1.0-GGUF
0.5B • Updated • 52
Gapeleon/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-Q4_K_M-GGUF
0.6B • Updated • 7
mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.0-GGUF
0.6B • Updated • 32
mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-GGUF
0.6B • Updated • 82
0.8B • Updated • 3
mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.1-GGUF
0.6B • Updated • 169
mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.1-GGUF
0.6B • Updated • 127
mradermacher/DeepSeek-R1-DRAFT-0.6B-v2.0-GGUF
0.6B • Updated • 18
mradermacher/DeepSeek-V3-DRAFT-0.6B-v2.0-GGUF
0.6B • Updated • 69
• 1
jukofyork/GLM-4.5-DRAFT-0.6B-v3.0
0.6B • Updated • 9
• 5
jukofyork/GLM-4.5-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 268
• 19
mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 72
mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-i1-GGUF
0.6B • Updated • 110
• 1
jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0
0.6B • Updated • 4
• 1
jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 28
mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 20
mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-i1-GGUF
0.6B • Updated • 119
jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0
0.6B • Updated • 5
• 2
jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 57
jukofyork/Qwen3-0.6B-YaRN-GGUF
0.8B • Updated • 436
• 4
jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0
0.7B • Updated • 3
• 1
jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0-GGUF
0.7B • Updated • 53
jukofyork/Qwen3-Coder-Instruct-DRAFT-0.75B-GGUF
0.8B • Updated • 628
• 7
mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 55