This collection includes the models used in the paper "Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recip
liyaxuan
lllyx
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 hour ago
Co-Evolving Policy Distillation upvoted a paper 9 days ago
Near-Future Policy Optimization updated a model 16 days ago
lllyx/Qwen3-1.7B-SFTOrganizations
None yet