KaVa: Latent Reasoning via Compressed KV-Cache Distillation Paper • 2510.02312 • Published Oct 2, 2025 • 4
Memory-Efficient Looped Transformer: Decoupling Compute from Memory in Looped Language Models Paper • 2605.07721 • Published 19 days ago • 29