AlbMoRe: A Corpus of Movie Reviews for Sentiment Analysis in Albanian
Paper
•
2306.08526
•
Published
Create README.md
Albanian ALBERT model pretrained on around 16GB of text (I used uonlp/CulturaX's sq configuration) and 1.1 million training steps, using only the masked language modelling task. Trained on a TPU v4-32 pod, made possible through the Google TPU Research Cloud.
Hyperparameters:
Going to post the model's performance evaluated on different Albanian downstream tasks once I'm done evaluating the model.
| Task | Learning Rate | Number of epochs | Accuracy | Precision | Recall | F1 score |
|---|---|---|---|---|---|---|
| AlbMoRe[1]. | 1e-05 | 10 | 0.98 | 0.97 | 0.99 | 0.98 |
TODO
[1] Çano, E. (2023). Albmore: A corpus of movie reviews for sentiment analysis in albanian. arXiv preprint arXiv:2306.08526.