nvidia/Qwen3-VL-235B-A22B-Instruct-NVFP4-MLPerf-Inference-Closed-V6.0 133B • Updated 14 days ago • 25.9k • 3
EliasOenal/MiniMax-M2.5-Hybrid-AWQ-W4A16G128-Attn-fp8_e4m3-KV-fp8_e4m3 Text Generation • 34B • Updated about 20 hours ago • 9 • 2
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 Image-Text-to-Text • 402B • Updated May 22, 2025 • 61.7k • • 152
dolfsai/Qwen3-Reranker-4B-seq-cls-vllm-W4A16_ASYM Text Ranking • 0.9B • Updated Aug 23, 2025 • 381 • 1
RedHatAI/Voxtral-Mini-3B-2507-FP8-dynamic Automatic Speech Recognition • 5B • Updated 13 days ago • 200 • 10