-
-
-
-
-
-
Inference Providers
Active filters: Sa2VA
Image-Text-to-Text
• 4B • Updated
Dense-World/Sa2VA_InternVL2.5_4b
Image-Text-to-Text
• 4B • Updated
• 8
• 1
Dense-World/Sa2VA_InternVL2.5_8b
Image-Text-to-Text
• 8B • Updated
• 3
Dense-World/Sa2VA_InternVL2.5_26b
Image-Text-to-Text
• 26B • Updated
• 6
Image-Text-to-Text
• Updated
• 110k
• 96
Image-Text-to-Text
• Updated
• 1.32k
• 65
Image-Text-to-Text
• 1B • Updated
• 1.05k
• 29
Image-Text-to-Text
• 26B • Updated
• 109
• 31
Image Segmentation
• 4B • Updated
Image Segmentation
• 1B • Updated
• 9
Image Segmentation
• 8B • Updated
• 7
Image Segmentation
• 26B • Updated
• 3
ByteDance/Sa2VA-InternVL3-2B
Image-Text-to-Text
• 2B • Updated
• 262
• 1
ByteDance/Sa2VA-InternVL3-8B
Image-Text-to-Text
• 8B • Updated
• 140
• 4
ByteDance/Sa2VA-InternVL3-14B
Image-Text-to-Text
• 15B • Updated
• 288
• 9
ByteDance/Sa2VA-Qwen2_5-VL-3B
Image-Text-to-Text
• 4B • Updated
• 131
• 2
ByteDance/Sa2VA-Qwen2_5-VL-7B
Image-Text-to-Text
• 9B • Updated
• 145
• 4
ByteDance/Sa2VA-Qwen3-VL-4B
Image-Text-to-Text
• 5B • Updated
• 1.95k
• 14
ByteDance/Sa2VA-Qwen3-VL-2B
Image-Text-to-Text
• 3B • Updated
• 317
• 16