arxiv:2603.10178
Huanxin Sheng
HuanxinSheng
ยท
AI & ML interests
None yet
Recent Activity
upvoted a collection 1 day ago
Nemotron-Post-Training-v3 commentedon a paper 5 days ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe commentedon a paper 5 days ago
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation