-
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 113 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 51 -
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Paper • 2502.18449 • Published • 75 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 192
yue
tianchi007
·
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
neulab/agent-data-collection:whats the differences between std and sft
upvoted
a
paper
about 2 months ago
WithAnyone: Towards Controllable and ID Consistent Image Generation
upvoted
a
paper
2 months ago
Sharing is Caring: Efficient LM Post-Training with Collective RL
Experience Sharing
Organizations
None yet