DSGym: A Holistic Framework for Evaluating and Training Data Science Agents Paper • 2601.16344 • Published Jan 22 • 12
Build error Agents 25 FutureBench Leaderboard 🔮 25 Display and analyze prediction leaderboard data