stepfun-ai/Step-Audio-R1
Audio-Text-to-Text
•
33B
•
Updated
•
480
•
127
Open source models with audio understanding. Tracking mostly vendor releases in the audio and text to text subclassification of multimodal.
Analyze audio to recognize speech, translate, and more