haodi lei's picture

🔄 In a Training Loop

haodi lei

bingyang-lei

·

https://www.haodilei.top/

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Qwen-AgentWorld: Language World Models for General Agents

upvoted a paper 5 days ago

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

upvoted a paper 6 days ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

View all activity

Organizations

upvoted 2 papers 5 days ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 6 days ago • 139

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

Paper • 2606.24530 • Published 6 days ago • 61

upvoted a paper 6 days ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Paper • 2606.23654 • Published 7 days ago • 79

upvoted 3 papers 17 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 18 days ago • 92

FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents

Paper • 2606.12087 • Published 19 days ago • 77

MiniMax Sparse Attention

Paper • 2606.13392 • Published 18 days ago • 148

upvoted a paper 18 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

Paper • 2606.10479 • Published 20 days ago • 19

upvoted a collection 20 days ago

Draft-OPD

6 items • Updated 25 days ago • 2

upvoted a paper 21 days ago

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

Paper • 2605.19587 • Published May 19 • 10

upvoted a paper 27 days ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published May 28 • 36

upvoted a paper 28 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published May 29 • 120

upvoted a collection about 1 month ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.82k

upvoted a paper about 1 month ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 108

upvoted a collection about 1 month ago

DeepSeek-V4

6 items • Updated 2 days ago • 702

upvoted 2 papers about 1 month ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published May 18 • 50

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published May 18 • 30

upvoted a paper about 2 months ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 165

upvoted a paper 3 months ago

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22, 2025 • 154

upvoted a collection 4 months ago

DFlash

Block Diffusion for Flash Speculative Decoding • 23 items • Updated about 15 hours ago • 139

upvoted a paper 4 months ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 88