Young-Jun Lee's picture

Young-Jun Lee PRO

passing2961

·

https://sites.google.com/view/passing2961/home

AI & ML interests

Social Dialogue System, Multi-Modal Dialogue

Recent Activity

upvoted a paper about 16 hours ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

upvoted a paper about 16 hours ago

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

upvoted a paper about 16 hours ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

View all activity

Organizations

upvoted 3 papers about 16 hours ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Paper • 2603.18886 • Published 1 day ago • 2

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published 1 day ago • 5

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 1 day ago • 28

upvoted a paper 4 days ago

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published 4 days ago • 137

upvoted 6 papers 5 days ago

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Paper • 2603.09206 • Published 11 days ago • 51

Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams

Paper • 2603.07392 • Published 13 days ago • 17

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 10 days ago • 132

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published 9 days ago • 29

EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery

Paper • 2603.08127 • Published 12 days ago • 14

daVinci-Env: Open SWE Environment Synthesis at Scale

Paper • 2603.13023 • Published 8 days ago • 29

upvoted 5 papers 11 days ago

OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning

Paper • 2603.08655 • Published 11 days ago • 3

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published 11 days ago • 56

Mozi: Governed Autonomy for Drug Discovery LLM Agents

Paper • 2603.03655 • Published 17 days ago • 4

SkillNet: Create, Evaluate, and Connect AI Skills

Paper • 2603.04448 • Published 23 days ago • 88

AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery

Paper • 2603.07300 • Published 13 days ago • 16

upvoted 3 papers 16 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 17 days ago • 97

Qwen3-Coder-Next Technical Report

Paper • 2603.00729 • Published 21 days ago • 57

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Paper • 2603.03194 • Published 17 days ago • 56

upvoted 2 papers 18 days ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published 21 days ago • 96

Agentic Code Reasoning

Paper • 2603.01896 • Published 19 days ago • 9