The Meta-Agent Challenge: Are Current Agents Capable of Autonomous Agent Development? Paper • 2606.04455 • Published 9 days ago • 3
The Meta-Agent Challenge: Are Current Agents Capable of Autonomous Agent Development? Paper • 2606.04455 • Published 9 days ago • 3
SoFA: Shielded On-the-fly Alignment via Priority Rule Following Paper • 2402.17358 • Published Feb 27, 2024 • 1
Scalable Oversight for Superhuman AI via Recursive Self-Critiquing Paper • 2502.04675 • Published Feb 7, 2025 • 1
On-Policy Self-Alignment with Fine-grained Knowledge Feedback for Hallucination Mitigation Paper • 2406.12221 • Published Jun 18, 2024
Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree? Paper • 2410.05584 • Published Oct 8, 2024
LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents Paper • 2605.29559 • Published 15 days ago • 17
LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents Paper • 2605.29559 • Published 15 days ago • 17
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering Paper • 2411.11504 • Published Nov 18, 2024 • 24
Towards Scalable Automated Alignment of LLMs: A Survey Paper • 2406.01252 • Published Jun 3, 2024 • 3