Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

2,575

Full-text search

Active filters: multimodal

allenai/Molmo2-8B

Video-Text-to-Text • 9B • Updated 8 days ago • 4.55k • 100

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 2.59M • • 1.41k

stepfun-ai/GELab-Zero-4B-preview

Image-Text-to-Text • 4B • Updated 12 days ago • 1.97k • 139

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 208k • 783

jinaai/jina-vlm

Image-Text-to-Text • 2B • Updated 25 days ago • 3.28k • 90

allenai/Molmo2-4B

Video-Text-to-Text • 5B • Updated 8 days ago • 6.33k • 31

Dream-org/Dream-VLA-7B

Image-Text-to-Text • 8B • Updated 7 days ago • 153 • 6

internlm/CapRL-Qwen3VL-2B

Image-Text-to-Text • 2B • Updated 5 days ago • 80 • 6

internlm/CapRL-Qwen3VL-4B

Image-Text-to-Text • 4B • Updated 5 days ago • 91 • 6

Qwen/Qwen3-Omni-30B-A3B-Captioner

Any-to-Any • 32B • Updated Sep 22 • 21.7k • 188

microsoft/Fara-7B

Image-Text-to-Text • 8B • Updated 19 days ago • 298k • 452

Thunderbolt215215/UniPercept

Feature Extraction • 8B • Updated 2 days ago • 18 • 5

Dream-org/Dream-VL-7B

Image-Text-to-Text • 8B • Updated 7 days ago • 30 • 5

unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 12 • 50k • 117

OctoMed/OctoMed-7B

Image-Text-to-Text • 8B • Updated 24 days ago • 1.51k • 21

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 71.6k • • 576

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14 • 261k • • 473

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 266k • 465

unsloth/Qwen2.5-Omni-3B-GGUF

Any-to-Any • 3B • Updated May 28 • 3.36k • 32

openvla/openvla-7b

Image-Text-to-Text • 8B • Updated Sep 16, 2024 • 752k • 159

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 1.2M • • 1.25k

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 2.94M • 580

imageomics/bioclip-2

Zero-Shot Image Classification • Updated Oct 16 • 17k • 25

Qwen/Qwen3-Omni-30B-A3B-Thinking

Any-to-Any • 32B • Updated Sep 22 • 40.8k • 241

internlm/CapRL-InternVL3.5-8B

Image-Text-to-Text • 9B • Updated 5 days ago • 485 • 5

QuixiAI/Prisma-VL-8B

Image-Text-to-Text • 770k • Updated about 17 hours ago • 42 • 22

Vikhrmodels/Borealis-5b-it

Audio-Text-to-Text • Updated 12 days ago • 789 • 9

amewebstudio/livia-multimodal-v1

10B • Updated 1 day ago • 387 • 2

internlm/CapRL-Qwen3VL-2B-GGUF

Image-Text-to-Text • 2B • Updated 2 days ago • 216 • 2

internlm/CapRL-Qwen3VL-4B-GGUF

Image-Text-to-Text • 4B • Updated 2 days ago • 197 • 2