SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios Paper • 2512.18470 • Published Dec 20, 2025 • 12
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published Oct 22, 2025 • 117
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 775
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance Paper • 2403.16952 • Published Mar 25, 2024 • 1
Large Language Models Struggle to Learn Long-Tail Knowledge Paper • 2211.08411 • Published Nov 15, 2022 • 3
Large Language Model based Multi-Agents: A Survey of Progress and Challenges Paper • 2402.01680 • Published Jan 21, 2024 • 2
Improving Text Embeddings with Large Language Models Paper • 2401.00368 • Published Dec 31, 2023 • 83
How Does Generative Retrieval Scale to Millions of Passages? Paper • 2305.11841 • Published May 19, 2023 • 4
Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic Paper • 2309.13339 • Published Sep 23, 2023 • 3