Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 44
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
llama.cpp
MLX LM
LM Studio
Ollama
Jan
Draw Things
+ 7
Inference Providers
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
+ 10
Apply filters
Models
7,196
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
moonshotai/Kimi-K2.5
Image-Text-to-Text
•
171B
•
Updated
4 days ago
•
405k
•
•
1.88k
internlm/Intern-S1-Pro
Image-Text-to-Text
•
Updated
4 days ago
•
8.06k
•
208
deepseek-ai/DeepSeek-OCR-2
Image-Text-to-Text
•
3B
•
Updated
6 days ago
•
592k
•
712
PaddlePaddle/PaddleOCR-VL-1.5
Image-Text-to-Text
•
1.0B
•
Updated
10 days ago
•
8.5k
•
364
lightonai/LightOnOCR-2-1B
Image-Text-to-Text
•
1B
•
Updated
6 days ago
•
188k
•
504
trillionlabs/gWorld-8B
Image-Text-to-Text
•
9B
•
Updated
5 days ago
•
258
•
34
tencent/Youtu-VL-4B-Instruct
Image-Text-to-Text
•
5B
•
Updated
4 days ago
•
3.89k
•
144
Qwen/Qwen3-VL-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
Oct 15, 2025
•
2.63M
•
•
734
google/medgemma-1.5-4b-it
Image-Text-to-Text
•
Updated
16 days ago
•
156k
•
420
Hcompany/Holo2-235B-A22B
Image-Text-to-Text
•
236B
•
Updated
6 days ago
•
162
•
21
trillionlabs/gWorld-32B
Image-Text-to-Text
•
33B
•
Updated
5 days ago
•
258
•
23
google/gemma-3-4b-it
Image-Text-to-Text
•
Updated
Mar 21, 2025
•
1.05M
•
1.16k
google/translategemma-4b-it
Image-Text-to-Text
•
Updated
11 days ago
•
109k
•
602
google/gemma-3-27b-it
Image-Text-to-Text
•
Updated
Mar 21, 2025
•
1.65M
•
•
1.86k
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text
•
3B
•
Updated
Nov 4, 2025
•
3.04M
•
3.14k
ByteDance-Seed/UI-TARS-1.5-7B
Image-Text-to-Text
•
8B
•
Updated
Apr 18, 2025
•
67.2k
•
511
bakrianoo/arabic-legal-documents-ocr-1.0
Image-Text-to-Text
•
4B
•
Updated
5 days ago
•
341
•
14
google/medgemma-4b-it
Image-Text-to-Text
•
Updated
Oct 28, 2025
•
361k
•
882
stepfun-ai/Step3-VL-10B
Image-Text-to-Text
•
10B
•
Updated
5 days ago
•
81.7k
•
387
nvidia/Cosmos-Reason2-8B
Image-Text-to-Text
•
9B
•
Updated
9 days ago
•
160k
•
109
Qwen/Qwen3-VL-4B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Oct 15, 2025
•
827k
•
327
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text
•
1.0B
•
Updated
4 days ago
•
16.4k
•
1.54k
Qwen/Qwen3-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Oct 23, 2025
•
1.49M
•
311
tencent/Youtu-VL-4B-Instruct-GGUF
Image-Text-to-Text
•
5B
•
Updated
4 days ago
•
4.08k
•
57
ibm-granite/granite-docling-258M
Image-Text-to-Text
•
0.3B
•
Updated
Sep 23, 2025
•
214k
•
1.12k
openbmb/MiniCPM-V-4_5
Image-Text-to-Text
•
9B
•
Updated
Dec 18, 2025
•
71.8k
•
1.06k
google/translategemma-27b-it
Image-Text-to-Text
•
Updated
11 days ago
•
37.7k
•
295
Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Apr 6, 2025
•
21.5M
•
605
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Apr 6, 2025
•
3.37M
•
•
1.45k
google/medgemma-27b-it
Image-Text-to-Text
•
Updated
Jul 10, 2025
•
19.6k
•
297
Previous
1
2
3
...
100
Next