Multimodal OCR
🍍
385
nanonets ocr2 / olmocr / qwen2vl ocr / aya vision / rolmocr
Generate text from an image and question
Detect and label segments in book images
Extract text from images in multiple languages
Recognize text in images