Diffusion Large Language Models with a SOTA Accuracy–Parallelism Trade-off
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders
LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding
models 13
SJTU-DENG-Lab/LightningRL-8B-b32-MBPP
Text Generation • 8B • Updated
SJTU-DENG-Lab/LightningRL-8B-b32-MATH500
Text Generation • 8B • Updated
SJTU-DENG-Lab/LightningRL-8B-b32-GSM8K
Text Generation • 8B • Updated
SJTU-DENG-Lab/LightningRL-8B-b32-HumanEval
Text Generation • 8B • Updated
SJTU-DENG-Lab/Think-Then-Generate-T2I
Text-to-Image • Updated
• 22 • 2
SJTU-DENG-Lab/D2F_DiffuCoder_Instruct_7B_Lora
Text Generation • Updated
SJTU-DENG-Lab/D2F_Dream_Instruct_7B_Lora
Text Generation • Updated
SJTU-DENG-Lab/D2F_Dream_Base_7B_Lora
Text Generation • Updated
• 4
SJTU-DENG-Lab/D2F_LLaDA_Instruct_8B_Lora
Text Generation • Updated
• 5
SJTU-DENG-Lab/UniCMs-512
Updated
• 1
datasets 7
SJTU-DENG-Lab/PrimeIntellect
Viewer
• Updated
• 5.95k • 3
SJTU-DENG-Lab/GSM8K_train
Viewer
• Updated
• 7.47k • 4
SJTU-DENG-Lab/MATH_train
Viewer
• Updated
• 8.52k • 4
SJTU-DENG-Lab/HumanEval
Viewer
• Updated
• 164 • 4
SJTU-DENG-Lab/MBPP
Viewer
• Updated
• 500 • 5
SJTU-DENG-Lab/MATH500
Viewer
• Updated
• 500 • 4
SJTU-DENG-Lab/GSM8K
Viewer
• Updated
• 1.32k • 3