RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic Text Generation • 561B • Updated 2 days ago • 458 • 1
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-block Text Generation • 561B • Updated 3 days ago • 33
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-quantized.w4a16 Text Generation • 565B • Updated 3 days ago • 746 • 3
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-quantized.w4a16 Text Generation • 565B • Updated 3 days ago • 746 • 3
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-block Text Generation • 561B • Updated 3 days ago • 33
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic Text Generation • 561B • Updated 2 days ago • 458 • 1
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-noise Image-Text-to-Text • 32B • Updated 22 days ago • 65
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-hybrid Image-Text-to-Text • 32B • Updated 22 days ago • 65
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-heuristic Image-Text-to-Text • 32B • Updated 22 days ago • 117
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-noise Image-Text-to-Text • 30B • Updated 22 days ago • 79
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-hybrid Image-Text-to-Text • 30B • Updated 22 days ago • 63
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-heuristic Image-Text-to-Text • 30B • Updated 22 days ago • 84
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-noise Image-Text-to-Text • 28B • Updated 22 days ago • 48
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid Image-Text-to-Text • 28B • Updated 22 days ago • 131
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-heuristic Image-Text-to-Text • 28B • Updated 22 days ago • 70
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-noise Image-Text-to-Text • 26B • Updated 22 days ago • 46
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-hybrid Image-Text-to-Text • 26B • Updated 22 days ago • 74