XTR Replicability Collection All the models used in experiments from "A Replicability Study of XTR" • 16 items • Updated 13 days ago • 6
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 895
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 qgallouedec, stevhliu, pcuenq, sergiopaniego • Mar 31 • 51
view article Article Training and Finetuning Reranker Models with Sentence Transformers tomaarsen • Mar 26, 2025 • 194
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 • 24 items • Updated Apr 16 • 58
The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus Paper • 2603.20105 • Published Mar 20 • 37
CodeScout Collection RL-trained code search agents (1.7B, 4B, 14B) that outperform 2–18× larger models using only a Unix terminal. 📄 arxiv.org/abs/2603.17829 • 12 items • Updated Mar 19 • 8
PyLate 🐕 Collection State-of-the-art late interaction models trained using PyLate • 7 items • Updated 6 days ago • 6
ColBERT-Zero 🐶 Collection First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT • 10 items • Updated Apr 7 • 22
💧 LFM2.5 Collection Collection of post-trained and base LFM2.5 models. • 30 items • Updated Apr 8 • 137
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated Apr 7 • 20
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling lightonai • Feb 12 • 56
view article Article **ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?** lightonai • Feb 19 • 21