rl - a C-Tianyu Collection

C-Tianyu 's Collections

rl

agent

rl

updated Sep 30

No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping

Paper • 2509.21880 • Published Sep 26 • 52