Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 44
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 7
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
6,545
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
google/t5gemma-2-270m-270m
Image-Text-to-Text
•
0.8B
•
Updated
14 days ago
•
8.96k
•
147
browser-use/bu-30b-a3b-preview
Image-Text-to-Text
•
31B
•
Updated
7 days ago
•
5.4k
•
225
google/t5gemma-2-4b-4b
Image-Text-to-Text
•
9B
•
Updated
12 days ago
•
4.15k
•
128
Qwen/Qwen3-VL-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
Oct 15
•
2.82M
•
•
610
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text
•
3B
•
Updated
Nov 4
•
3.91M
•
3.02k
zai-org/GLM-4.6V-Flash
Image-Text-to-Text
•
10B
•
Updated
21 days ago
•
240k
•
•
520
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text
•
1.0B
•
Updated
19 days ago
•
17.1k
•
1.44k
Qwen/Qwen3-VL-30B-A3B-Instruct
Image-Text-to-Text
•
31B
•
Updated
Nov 26
•
1.4M
•
•
477
zai-org/AutoGLM-Phone-9B
Image-Text-to-Text
•
934k
•
Updated
21 days ago
•
86.4k
•
397
google/gemma-3-27b-it
Image-Text-to-Text
•
27B
•
Updated
Mar 21
•
1.56M
•
•
1.78k
google/t5gemma-2-1b-1b
Image-Text-to-Text
•
2B
•
Updated
12 days ago
•
4.17k
•
59
tencent/HunyuanOCR
Image-Text-to-Text
•
1.0B
•
Updated
7 days ago
•
880k
•
695
google/gemma-3-4b-it
Image-Text-to-Text
•
4B
•
Updated
Mar 21
•
852k
•
1.07k
zai-org/GLM-4.6V
Image-Text-to-Text
•
108B
•
Updated
22 days ago
•
162k
•
•
351
janhq/Jan-v2-VL-max-FP8
Image-Text-to-Text
•
31B
•
Updated
9 days ago
•
395
•
25
Qwen/Qwen3-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Oct 23
•
1.24M
•
249
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Apr 6
•
2.59M
•
•
1.41k
fancyfeast/llama-joycaption-beta-one-hf-llava
Image-Text-to-Text
•
8B
•
Updated
May 16
•
101k
•
287
Qwen/Qwen3-VL-4B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Oct 15
•
723k
•
282
unsloth/GLM-4.6V-Flash-GGUF
Image-Text-to-Text
•
9B
•
Updated
3 days ago
•
70k
•
62
google/gemma-3-12b-it
Image-Text-to-Text
•
12B
•
Updated
Mar 21
•
1.42M
•
•
598
moondream/moondream3-preview
Image-Text-to-Text
•
9B
•
Updated
Oct 9
•
5.9k
•
•
531
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text
•
236B
•
Updated
Nov 26
•
232k
•
•
347
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text
•
236B
•
Updated
Nov 26
•
32.6k
•
•
357
stepfun-ai/GELab-Zero-4B-preview
Image-Text-to-Text
•
4B
•
Updated
12 days ago
•
1.97k
•
139
microsoft/Florence-2-large
Image-Text-to-Text
•
0.8B
•
Updated
Aug 4
•
821k
•
1.73k
google/medgemma-4b-it
Image-Text-to-Text
•
4B
•
Updated
Oct 28
•
375k
•
807
google/medgemma-27b-it
Image-Text-to-Text
•
29B
•
Updated
Jul 10
•
12.1k
•
255
rednote-hilab/dots.ocr
Image-Text-to-Text
•
3B
•
Updated
Oct 31
•
847k
•
1.17k
jinaai/jina-vlm
Image-Text-to-Text
•
2B
•
Updated
25 days ago
•
3.28k
•
90
Previous
1
2
3
...
100
Next