Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
1
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 44
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
Reset Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 7
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
4,425
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-text-to-text, transformers
Clear all
LiquidAI/LFM2.5-VL-1.6B
Image-Text-to-Text
•
2B
•
Updated
about 16 hours ago
•
332
•
33
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text
•
3B
•
Updated
Nov 4, 2025
•
3.37M
•
3.04k
Qwen/Qwen3-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Oct 23, 2025
•
1.19M
•
264
Qwen/Qwen3-VL-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
Oct 15, 2025
•
2.53M
•
•
625
LiquidAI/LFM2-VL-3B
Image-Text-to-Text
•
3B
•
Updated
Dec 5, 2025
•
4.46k
•
123
tencent/HunyuanOCR
Image-Text-to-Text
•
1.0B
•
Updated
13 days ago
•
859k
•
669
google/gemma-3-4b-it
Image-Text-to-Text
•
4B
•
Updated
Mar 21, 2025
•
766k
•
1.08k
google/gemma-3-27b-it
Image-Text-to-Text
•
27B
•
Updated
Mar 21, 2025
•
1.51M
•
•
1.79k
ibm-granite/granite-docling-258M
Image-Text-to-Text
•
0.3B
•
Updated
Sep 23, 2025
•
202k
•
1.08k
google/medgemma-4b-it
Image-Text-to-Text
•
4B
•
Updated
Oct 28, 2025
•
367k
•
817
Qwen/Qwen3-VL-30B-A3B-Instruct
Image-Text-to-Text
•
31B
•
Updated
Nov 26, 2025
•
1.27M
•
•
488
google/t5gemma-2-270m-270m
Image-Text-to-Text
•
0.8B
•
Updated
21 days ago
•
14.1k
•
156
google/gemma-3n-E4B-it
Image-Text-to-Text
•
8B
•
Updated
Jul 14, 2025
•
132k
•
847
Qwen/Qwen3-VL-4B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Oct 15, 2025
•
613k
•
290
google/gemma-3-12b-it
Image-Text-to-Text
•
12B
•
Updated
Mar 21, 2025
•
1.31M
•
•
605
moondream/moondream3-preview
Image-Text-to-Text
•
9B
•
Updated
Oct 9, 2025
•
5.83k
•
•
536
zai-org/GLM-4.6V-Flash
Image-Text-to-Text
•
10B
•
Updated
28 days ago
•
309k
•
•
527
browser-use/bu-30b-a3b-preview
Image-Text-to-Text
•
31B
•
Updated
13 days ago
•
6.41k
•
231
janhq/Jan-v2-VL-max-Instruct-FP8
Image-Text-to-Text
•
31B
•
Updated
6 days ago
•
46
•
8
kacperwikiel/RysOCR
Image-Text-to-Text
•
Updated
7 days ago
•
130
•
7
google/medgemma-27b-it
Image-Text-to-Text
•
29B
•
Updated
Jul 10, 2025
•
11.7k
•
260
Qwen/Qwen3-VL-8B-Thinking
Image-Text-to-Text
•
9B
•
Updated
Nov 26, 2025
•
131k
•
171
Qwen/Qwen3-VL-4B-Instruct-GGUF
Image-Text-to-Text
•
4B
•
Updated
Nov 1, 2025
•
16.4k
•
28
zai-org/AutoGLM-Phone-9B
Image-Text-to-Text
•
934k
•
Updated
28 days ago
•
93.8k
•
402
microsoft/Florence-2-large
Image-Text-to-Text
•
0.8B
•
Updated
Aug 4, 2025
•
739k
•
1.73k
HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text
•
0.3B
•
Updated
Apr 8, 2025
•
126k
•
322
rednote-hilab/dots.ocr
Image-Text-to-Text
•
3B
•
Updated
Oct 31, 2025
•
605k
•
1.17k
LiquidAI/LFM2-VL-450M
Image-Text-to-Text
•
0.5B
•
Updated
about 17 hours ago
•
6.4k
•
145
openbmb/MiniCPM-V-4_5
Image-Text-to-Text
•
9B
•
Updated
19 days ago
•
41.7k
•
1.04k
Qwen/Qwen3-VL-8B-Instruct-GGUF
Image-Text-to-Text
•
8B
•
Updated
Nov 1, 2025
•
28.4k
•
42
Previous
1
2
3
...
100
Next