Attention Drift: What Autoregressive Speculative Decoding Models Learn Paper • 2605.09992 • Published 7 days ago • 1
SpecBundle Collection A collection of production-grade draft models for speculative decoding • 18 items • Updated Apr 15 • 17