16 18 16

Alex Jinpeng Wang

Awiny

https://fingerrec.github.io

FingerRec

AI & ML interests

Multi-Modality Pre-training, Data-Centric AI, Video Self-supervised Learning

Recent Activity

liked a model 7 days ago

CSU-JPG/Glance

upvoted a paper 7 days ago

Glance: Accelerating Diffusion Models with 1 Sample

upvoted a paper 23 days ago

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

View all activity

Organizations

liked a model 7 days ago

CSU-JPG/Glance

Text-to-Image • Updated 6 days ago • 345 • • 14

upvoted a paper 7 days ago

Glance: Accelerating Diffusion Models with 1 Sample

Paper • 2512.02899 • Published 8 days ago • 25

upvoted a paper 23 days ago

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Paper • 2511.11434 • Published 26 days ago • 44

liked a Space about 1 month ago

VCode

🐨

Convert images to SVG code

updated a Space about 1 month ago

README

📈

liked a dataset about 1 month ago

CSU-JPG/Chart2Code

Updated 2 days ago • 254 • 4

updated a collection about 1 month ago

🔱 Sailor2 Language Models

Collection

Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated 21 days ago • 30

upvoted 2 papers about 1 month ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4 • 101

UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback

Paper • 2511.01678 • Published Nov 3 • 34

upvoted a paper about 2 months ago

From Charts to Code: A Hierarchical Benchmark for Multimodal Models

Paper • 2510.17932 • Published Oct 20 • 7

New activity in deepseek-ai/DeepSeek-OCR about 2 months ago

Clarifying Prior Research on Visual Compression of Textual Contexts

❤️ 👍 14

#18 opened about 2 months ago by

Awiny

upvoted a paper 2 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6 • 117

liked a dataset 5 months ago

CSU-JPG/MVPBench

Viewer • Updated May 15 • 4.7k • 51 • 1

liked a model 6 months ago

showlab/show-o2-1.5B-HQ

Any-to-Any • Updated Sep 5 • 70 • 3

authored a paper 8 months ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Paper • 2504.06148 • Published Apr 8 • 13

upvoted a paper 8 months ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Paper • 2504.06148 • Published Apr 8 • 13

commented a paper 8 months ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Paper • 2504.06148 • Published Apr 8 • 13 •

authored a paper 9 months ago

Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models

Paper • 2503.20198 • Published Mar 26 • 4

upvoted a paper 9 months ago

Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models

Paper • 2503.20198 • Published Mar 26 • 4

commented a paper 9 months ago

Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models

Paper • 2503.20198 • Published Mar 26 • 4 •

Alex Jinpeng Wang

AI & ML interests

Recent Activity

Organizations

Awiny's activity

VCode

README

Clarifying Prior Research on Visual Compression of Textual Contexts