moonshotai/Kimi-VL-A3B-Thinking-2506 Image-Text-to-Text • 16B • Updated Aug 18, 2025 • 164k • 335
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 153k • 1.56k