3 10 23

peng

jackphone

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

chromadb/context-1

liked a model about 2 months ago

AQ-MedAI/Kimi-K25-eagle3

upvoted an article 3 months ago

DABStep: Data Agent Benchmark for Multi-step Reasoning

View all activity

Organizations

None yet

upvoted an article 3 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

eggie5, martinigoyanes, frisokingma, andreumora, lvwerra, thomwolf, m-ric

•

Feb 4, 2025

• 130

upvoted a paper 4 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 162

upvoted a paper 10 months ago

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16, 2025 • 21

upvoted a paper about 1 year ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5, 2025 • 58

upvoted an article over 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

upvoted 2 papers over 1 year ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published Jan 18, 2025 • 26

DynaSaur: Large Language Agents Beyond Predefined Actions

Paper • 2411.01747 • Published Nov 4, 2024 • 37

upvoted 2 articles over 1 year ago

Article

Let's talk about LLM evaluation

clefourrier

•

May 23, 2024

• 209

Article

Everything About Long Context Fine-tuning

wenbopan

•

May 10, 2024

• 56

peng

AI & ML interests

Recent Activity

Organizations

jackphone's activity

DABStep: Data Agent Benchmark for Multi-step Reasoning

Open-R1: a fully open reproduction of DeepSeek-R1

Let's talk about LLM evaluation

Everything About Long Context Fine-tuning