Whisper Models Dutch Language Collection This repo contains Dutch Whisper models finetuned on CV and other synthetic data, with different filtering options • 11 items • Updated Sep 16 • 1
Whisper Models Portuguese Language Collection This Repo contains Whisper models trained on subsets of data like Common Voice 17(CV_17), Synthetic(Generated by OpenAI) + CV17 and Synthetic Only. • 15 items • Updated 11 days ago • 1
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Feb 25 • 20
Seamless: Multilingual Expressive and Streaming Speech Translation Paper • 2312.05187 • Published Dec 8, 2023 • 14