Non-English Embeddings and Models
updated
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Paper
• 2211.05100
• Published
• 37
Contrastive Language-Image Pre-training for the Italian Language
Paper
• 2108.08688
• Published
• 2
IT5: Large-scale Text-to-text Pretraining for Italian Language
Understanding and Generation
Paper
• 2203.03759
• Published
• 5
Spanish Pre-trained BERT Model and Evaluation Data
Paper
• 2308.02976
• Published
• 3
German FinBERT: A German Pre-trained Language Model
Paper
• 2311.08793
• Published
• 3
German Text Embedding Clustering Benchmark
Paper
• 2401.02709
• Published
• 6
AfroDigits: A Community-Driven Spoken Digit Dataset for African
Languages
Paper
• 2303.12582
• Published
• 21
Text Generation
• 7B • Updated
• 8.88k
• 68
Updated
• 64
• 24
Aya Model: An Instruction Finetuned Open-Access Multilingual Language
Model
Paper
• 2402.07827
• Published
• 48
Viewer
• Updated
• 206k • 3.26k
• 343
CohereLabs/c4ai-command-r-v01
Text Generation
• Updated
• 12.7k
• 1.1k