Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 20 days ago • 31
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 22 days ago • 7