19 18 15

Wu Chengyue

WuChengyue

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

upvoted a paper about 1 month ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

updated a model about 2 months ago

Efficient-Large-Model/Fast_dLLM_v2_7B

View all activity

Organizations

upvoted a paper 3 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published 11 days ago • 95

upvoted a paper about 1 month ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29 • 76

upvoted a collection 2 months ago

Fast-dLLM

Collection

Efficient Diffusion LLM • 4 items • Updated Oct 8 • 7

upvoted a paper 3 months ago

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27 • 31

upvoted 2 papers 5 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 159

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

Paper • 2507.01957 • Published Jul 2 • 21

upvoted a paper 6 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 142

upvoted a paper 7 months ago

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20 • 76

upvoted a paper 9 months ago

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Paper • 2503.13434 • Published Mar 17 • 27

upvoted a paper 12 months ago

BrushEdit: All-In-One Image Inpainting and Editing

Paper • 2412.10316 • Published Dec 13, 2024 • 35

upvoted a paper about 1 year ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17, 2024 • 35

upvoted 4 papers over 1 year ago

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3, 2024 • 103

Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots

Paper • 2405.07990 • Published May 13, 2024 • 20

Adapting LLaMA Decoder to Vision Transformer

Paper • 2404.06773 • Published Apr 10, 2024 • 18

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Paper • 2403.09029 • Published Mar 14, 2024 • 55

upvoted a collection almost 2 years ago

AnyLLM-Pro

Collection

6 items • Updated Feb 27, 2024 • 4

upvoted 2 papers almost 2 years ago

FiT: Flexible Vision Transformer for Diffusion Model

Paper • 2402.12376 • Published Feb 19, 2024 • 48

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 50

Wu Chengyue

AI & ML interests

Recent Activity

Organizations

WuChengyue's activity