Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published about 1 month ago • 93
The Path Not Taken: RLVR Provably Learns Off the Principals Paper • 2511.08567 • Published Nov 11, 2025 • 33
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17, 2025 • 49
view article Article 🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It? Mar 17, 2025 • 348
SoftQE: Learned Representations of Queries Expanded by LLMs Paper • 2402.12663 • Published Feb 20, 2024