10 16 37

Zhouliang Yu

zhouliang

https://zhouliang-yu.github.io

zhouliang-yu

AI & ML interests

Model-Based AI, Reinforcement Learning, Autoformalization

Recent Activity

authored a paper 2 days ago

Stabilizing Rubric Integration Training via Decoupled Advantage Normalization

liked a dataset 5 days ago

nohurry/Opus-4.6-Reasoning-3000x-filtered

liked a model 7 days ago

Jackrong/Qwopus3.5-4B-v3

View all activity

Organizations

authored a paper 2 days ago

Stabilizing Rubric Integration Training via Decoupled Advantage Normalization

Paper • 2603.26535 • Published 16 days ago • 3

liked a dataset 5 days ago

nohurry/Opus-4.6-Reasoning-3000x-filtered

Viewer • Updated 12 days ago • 2.33k • 9.99k • 533

liked a model 7 days ago

Jackrong/Qwopus3.5-4B-v3

Image-Text-to-Text • 5B • Updated 6 days ago • 2.07k • 9

upvoted a paper about 1 month ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published Feb 27 • 97

liked 2 datasets about 1 month ago

BytedTsinghua-SIA/CUDA-Agent-Ops-6K

Viewer • Updated Feb 27 • 6k • 299 • 59

Goedel-LM/SFT_dataset_v2

Viewer • Updated Mar 2 • 1.75M • 721 • 29

liked 4 datasets about 2 months ago

upvoted a paper about 2 months ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published Feb 3 • 13

liked a dataset about 2 months ago

FrenzyMath/Herald_proofs

Viewer • Updated May 13, 2025 • 44.6k • 183 • 3

liked 2 datasets 2 months ago

INSAIT-Institute/OPC

Viewer • Updated Jul 15, 2025 • 4.93k • 134 • 14

wenjiema02/ProofBench

Viewer • Updated Oct 14, 2025 • 899 • 86 • 7

upvoted a paper 2 months ago

Steering LLMs via Scalable Interactive Oversight

Paper • 2602.04210 • Published Feb 4 • 18

upvoted an article 2 months ago

Article

What's Automatic Differentiation?

Mar 19, 2024

•

liked 2 datasets 3 months ago

ulamai/UnsolvedMath

Updated Feb 4 • 53 • 23

phanerozoic/Lean4-Mathlib

Viewer • Updated Jan 10 • 193k • 94 • 2

liked a dataset 4 months ago

nvidia/Nemotron-Math-Proofs-v1

Viewer • Updated Jan 5 • 925k • 1.75k • 117

published a dataset 5 months ago

zhouliang/DEMIMathAnalysis

Viewer • Updated Feb 27, 2025 • 88 • 3

Zhouliang Yu

AI & ML interests

Recent Activity

Organizations

zhouliang's activity

What's Automatic Differentiation?