DFlash Collection Block Diffusion for Flash Speculative Decoding • 23 items • Updated 6 days ago • 142
DFlash Collection Block Diffusion for Flash Speculative Decoding • 23 items • Updated 6 days ago • 142
ParoQuant Collection Pairwise Rotation Quantization for Efficient Reasoning LLM Inference • 24 items • Updated 27 days ago • 27