Young-Jun Lee's picture

Young-Jun Lee PRO

passing2961

·

https://sites.google.com/view/passing2961/home

AI & ML interests

Social Dialogue System, Multi-Modal Dialogue

Recent Activity

upvoted a paper 4 days ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

upvoted a paper 4 days ago

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

upvoted a paper 4 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

View all activity

Organizations

upvoted 3 papers 4 days ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Paper • 2603.18886 • Published 5 days ago • 3

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published 5 days ago • 10

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 4 days ago • 53

upvoted a paper 7 days ago

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published 7 days ago • 145

upvoted 6 papers 8 days ago

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Paper • 2603.09206 • Published 14 days ago • 52

Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams

Paper • 2603.07392 • Published 16 days ago • 17

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 13 days ago • 137

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published 12 days ago • 30

EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery

Paper • 2603.08127 • Published 15 days ago • 15

daVinci-Env: Open SWE Environment Synthesis at Scale

Paper • 2603.13023 • Published 11 days ago • 29

upvoted 5 papers 14 days ago

OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning

Paper • 2603.08655 • Published 14 days ago • 3

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published 14 days ago • 57

Mozi: Governed Autonomy for Drug Discovery LLM Agents

Paper • 2603.03655 • Published 20 days ago • 4

SkillNet: Create, Evaluate, and Connect AI Skills

Paper • 2603.04448 • Published 26 days ago • 90

AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery

Paper • 2603.07300 • Published 16 days ago • 17

upvoted 3 papers 19 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 20 days ago • 100

Qwen3-Coder-Next Technical Report

Paper • 2603.00729 • Published 24 days ago • 60

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Paper • 2603.03194 • Published 20 days ago • 56

upvoted 2 papers 21 days ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published 24 days ago • 97

Agentic Code Reasoning

Paper • 2603.01896 • Published 22 days ago • 9