Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
9,521
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
stepfun-ai/GELab-Zero-4B-preview
Image-to-Text
•
4B
•
Updated
5 days ago
•
589
•
88
datalab-to/chandra
Image-to-Text
•
9B
•
Updated
Oct 21
•
94.2k
•
405
lightonai/LightOnOCR-1B-1025
Image-to-Text
•
Updated
12 days ago
•
16.2k
•
179
nvidia/nemotron-ocr-v1
Image-to-Text
•
Updated
25 days ago
•
400
•
41
allenai/olmOCR-2-7B-1025-FP8
Image-to-Text
•
8B
•
Updated
Oct 22
•
420k
•
153
monkt/paddleocr-onnx
Image-to-Text
•
Updated
Oct 7
•
23
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
2.38M
•
818
allenai/olmOCR-2-7B-1025
Image-to-Text
•
8B
•
Updated
Oct 22
•
31.3k
•
88
XiaomiMiMo/MiMo-Embodied-7B
Image-to-Text
•
8B
•
Updated
15 days ago
•
896
•
45
thesby/Qwen3-VL-8B-NSFW-Caption-V4.5
Image-to-Text
•
9B
•
Updated
28 days ago
•
14.5k
•
39
xtuner/llava-llama-3-8b-v1_1-gguf
Image-to-Text
•
8B
•
Updated
Apr 30, 2024
•
3.14k
•
220
VLM2Vec/VLM2Vec-V2.0
Image-to-Text
•
Updated
Jul 13
•
10.6k
•
19
scb10x/typhoon-ocr1.5-2b
Image-to-Text
•
2B
•
Updated
20 days ago
•
2.57k
•
6
shkb/MemeLeak
Image-to-Text
•
9B
•
Updated
3 days ago
•
87
•
2
microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
Feb 11
•
105k
•
466
microsoft/trocr-base-printed
Image-to-Text
•
0.3B
•
Updated
May 27, 2024
•
269k
•
200
microsoft/trocr-large-printed
Image-to-Text
•
0.6B
•
Updated
May 27, 2024
•
220k
•
174
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
1.4M
•
920
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
43.4k
•
238
facebook/nougat-base
Image-to-Text
•
0.3B
•
Updated
Nov 20, 2023
•
19.5k
•
180
SawanStack/gpt2-image-captioning-onnx
Image-to-Text
•
Updated
Nov 13, 2023
•
4
•
1
OleehyO/TexTeller
Image-to-Text
•
0.3B
•
Updated
Jun 22, 2024
•
7.39k
•
38
breezedeus/pix2text-mfr
Image-to-Text
•
Updated
May 5, 2024
•
162k
•
46
GnanaPrasath/ocr_tamil
Image-to-Text
•
Updated
Feb 14, 2024
•
19
breezedeus/pix2text-mfd
Image-to-Text
•
Updated
Jul 10, 2024
•
90
•
7
DGurgurov/im2latex
Image-to-Text
•
0.2B
•
Updated
Oct 23, 2024
•
571
•
16
ifmain/blip-image2promt-stable-diffusion-base
Image-to-Text
•
0.2B
•
Updated
Aug 4, 2024
•
36
•
3
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text
•
6B
•
Updated
Dec 10, 2024
•
454k
•
80
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text
•
11B
•
Updated
Dec 10, 2024
•
24.2k
•
86
HuggingFaceTB/SmolVLM-256M-Base
Image-to-Text
•
0.3B
•
Updated
Jan 20
•
8.56k
•
18
Previous
1
2
3
...
100
Next