TRACE: A Unified Rollout Budget Allocation Framework for Efficient Agentic Reinforcement Learning Paper • 2606.11119 • Published 20 days ago • 18
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated 28 days ago • 244M • • 5.02k
FastKernels: Benchmarking GPU Kernel Generation in Production Paper • 2605.23215 • Published May 22 • 8
LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know? Paper • 2605.28721 • Published May 27 • 18
Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning Paper • 2605.28424 • Published May 27 • 32
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published May 20 • 207