DONGRYEOLLEE

drlee1

2 188 124

DONGRYEOLLEE1

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

Multi-Block Diffusion Language Models

upvoted a paper 5 days ago

Improved Large Language Diffusion Models

upvoted a paper 7 days ago

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

View all activity

Organizations

None yet

upvoted a paper about 3 hours ago

Multi-Block Diffusion Language Models

Paper • 2606.29215 • Published 1 day ago • 16

upvoted a paper 5 days ago

Improved Large Language Diffusion Models

Paper • 2606.25331 • Published 7 days ago • 42

upvoted a paper 7 days ago

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

Paper • 2606.20945 • Published 13 days ago • 76

liked a model 9 days ago

LiquidAI/LFM2.5-Embedding-350M

liked a dataset 12 days ago

lordx64/agentic-distill-fable-5-sft

Viewer • Updated 16 days ago • 4.66k • 1.41k • 52

liked a model 12 days ago

WeiboAI/VibeThinker-3B

Text Generation • 3B • Updated about 23 hours ago • 72.7k • • 757

upvoted a paper 13 days ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

Paper • 2606.18195 • Published 15 days ago • 76

upvoted a paper 15 days ago

FastContext: Training Efficient Repository Explorer for Coding Agents

Paper • 2606.14066 • Published 19 days ago • 93

upvoted 2 papers 16 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 20 days ago • 92

MiniMax Sparse Attention

Paper • 2606.13392 • Published 20 days ago • 148

liked a model 16 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated Mar 15 • 1.04M • • 785

liked a model 19 days ago

jinaai/jina-embeddings-v5-text-small

Feature Extraction • 0.6B • Updated Apr 15 • 363k • 184

upvoted 2 papers 20 days ago

MoBA: Mixture of Block Attention for Long-Context LLMs

Paper • 2502.13189 • Published Feb 18, 2025 • 19

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 23 days ago • 54

liked a dataset 21 days ago

m-a-p/CodeFeedback-Filtered-Instruction

Viewer • Updated Feb 26, 2024 • 157k • 18.6k • 204

liked a model 22 days ago

ny1031/Qwen3-1.7B-SFT-RLVR-IF

Text Generation • 2B • Updated May 6 • 5 • 1

liked a dataset 22 days ago

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 18k • 251

upvoted 2 papers 26 days ago

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Paper • 2605.30039 • Published May 29 • 20

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published about 1 month ago • 236

upvoted a paper 29 days ago

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

Paper • 2606.02404 • Published about 1 month ago • 59

DONGRYEOLLEE

AI & ML interests

Recent Activity

Organizations

drlee1's activity