Parametric Social Identity Injection and Diversification in Public Opinion Simulation Paper • 2603.16142 • Published 11 days ago • 1
Reinforcement Learning from Rich Feedback with Distributional DAgger Paper • 2606.05152 • Published 9 days ago • 3
SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents Paper • 2606.05761 • Published 8 days ago • 19
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints Paper • 2606.05622 • Published 8 days ago • 40
Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues Paper • 2606.02754 • Published 11 days ago • 13
GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization Paper • 2605.31464 • Published 14 days ago • 2