SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Paper
• 2406.10118 • Published
• 32
SEACrowd is a community movement project aimed at centralizing and standardizing AI resources for Southeast Asian languages, cultures, and/or regions.
Note Our paper.
Note Our fine-tuned SEA translationese classifier. Based on the mDeBERTa model by Microsoft.
Note Our translationese vs. natural train/test data.