view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 19 days ago • 38
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers tomaarsen • Apr 16 • 71
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated Apr 22 • 43
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper • 2603.12254 • Published Mar 12 • 22
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 157