JKKJJJ's picture

12 2

JKKJJJ

JKKJJJ

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs

upvoted a paper 2 months ago

Agent Learning via Early Experience

upvoted a paper 2 months ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

View all activity

Organizations

None yet

upvoted a paper 22 days ago

MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs

Paper • 2511.14159 • Published 23 days ago • 24

upvoted 3 papers 2 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 266

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9 • 70

Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks

Paper • 2510.02286 • Published Oct 2 • 28

upvoted 2 papers 3 months ago

Reconstruction Alignment Improves Unified Multimodal Models

Paper • 2509.07295 • Published Sep 8 • 40

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 84

upvoted 4 papers 6 months ago

AMO Sampler: Enhancing Text Rendering with Overshooting

Paper • 2411.19415 • Published Nov 28, 2024 • 5

LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer

Paper • 2506.06952 • Published Jun 8 • 9

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 82

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Paper • 2505.23606 • Published May 29 • 14

upvoted 2 papers 7 months ago

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Paper • 2505.16707 • Published May 22 • 45

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 97