Running Featured 67 Distilling 100B+ Models 40x Faster with TRL 📝 67 TRL distillation for 100B+ teachers, 40x faster
Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought Paper • 2510.04230 • Published Oct 5, 2025 • 27