An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex
-
MOSS Audio 8B Thinking
π’16Generate answers to audio or video prompts
-
OpenMOSS-Team/MOSS-Audio-4B-Instruct
Audio-Text-to-Text β’ 5B β’ Updated β’ 2.95k β’ 51 -
OpenMOSS-Team/MOSS-Audio-4B-Thinking
Audio-Text-to-Text β’ 5B β’ Updated β’ 891 β’ 28 -
OpenMOSS-Team/MOSS-Audio-8B-Instruct
Audio-Text-to-Text β’ 9B β’ Updated β’ 1.78k β’ 37