Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9, 2025 • 132
Running on CPU Upgrade Featured 2.88k The Smol Training Playbook 📚 2.88k The secrets to building world-class LLMs
Running 77 Unlocking On-Policy Distillation for Any Model Family 📝 77 Apply on-policy distillation to any model family
intfloat/multilingual-e5-large-instruct Feature Extraction • 0.6B • Updated Jul 10, 2025 • 1.21M • • 600
100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Paper • 2505.00551 • Published May 1, 2025 • 36