X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding Paper • 2606.02482 • Published 2 days ago • 24
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 2 days ago • 34
Draft-OPD: On-Policy Distillation for Speculative Draft Models Paper • 2605.29343 • Published 6 days ago • 27
NITP: Next Implicit Token Prediction for LLM Pre-training Paper • 2605.24956 • Published 10 days ago • 27
Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism Paper • 2606.00408 • Published 5 days ago • 45
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Paper • 2605.29707 • Published 6 days ago • 52
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 2 days ago • 88
Decentralized Instruction Tuning: Conflict-Aware Splitting and Weight Merging Paper • 2606.01717 • Published 2 days ago • 7
Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 1 day ago • 3
Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking Paper • 2606.03985 • Published 1 day ago • 10
Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories Paper • 2606.03979 • Published 1 day ago • 5
Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs Paper • 2511.20410 • Published Nov 25, 2025 • 4
SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance Paper • 2412.02687 • Published Dec 3, 2024 • 114
V_kD: Improving Knowledge Distillation using Orthogonal Projections Paper • 2403.06213 • Published Mar 10, 2024 • 3
ORPO-Distill: Mixed-Policy Preference Optimization for Cross-Architecture LLM Distillation Paper • 2509.25100 • Published Sep 29, 2025 • 2
Linear Projections of Teacher Embeddings for Few-Class Distillation Paper • 2409.20449 • Published Sep 30, 2024 • 1
ERNIE-Tiny : A Progressive Distillation Framework for Pretrained Transformer Compression Paper • 2106.02241 • Published Jun 4, 2021 • 1
Automatic Prompt Optimization with Prompt Distillation Paper • 2508.18992 • Published Aug 26, 2025 • 4