-
ReportBench: Evaluating Deep Research Agents via Academic Survey Tasks
Paper • 2508.15804 • Published • 15 -
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 52 -
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 104
Tobias Völzing
wumingshi
·
AI & ML interests
None yet
Recent Activity
updated
a collection
4 days ago
Reasoning
updated
a collection
14 days ago
Agents
updated
a collection
22 days ago
REL
Organizations
None yet