Vchitect

non-profit

https://vchitect.intern-ai.org.cn/

Vchitect

Activity Feed Request to join this org

AI & ML interests

generative models, video generation

Recent Activity

ynhe updated a dataset about 14 hours ago

Vchitect/VBench_human_annotation

zhengli1013 authored a paper 1 day ago

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

syingxi updated a dataset 3 days ago

Vchitect/VBench_sampled_video

View all activity

Papers

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

View all Papers

ynhe

updated a dataset about 14 hours ago

Vchitect/VBench_human_annotation

Preview • Updated about 14 hours ago • 21 • 1

zhengli1013

authored a paper 1 day ago

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published 4 days ago • 33

syingxi

updated a dataset 3 days ago

Vchitect/VBench_sampled_video

Viewer • Updated 3 days ago • 200 • 214 • 1

ChenyangSi

authored a paper 5 days ago

PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design

Paper • 2512.04082 • Published 6 days ago • 12

zhengli1013

authored 3 papers 5 days ago

Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks Training

Paper • 2308.06689 • Published Aug 13, 2023

SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input

Paper • 2411.11934 • Published Nov 18, 2024

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published 7 days ago • 29

zhengli1013

authored 3 papers 6 days ago

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Paper • 2506.21356 • Published Jun 26 • 22

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Paper • 2510.13759 • Published Oct 15 • 9

Panorama Generation From NFoV Image Done Right

Paper • 2503.18420 • Published Mar 24 • 1

zhengli1013

authored 2 papers 8 days ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published Mar 27 • 33

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published 12 days ago • 28

ynhe

updated a dataset 12 days ago

Vchitect/VBench-2.0_human_annotation

Preview • Updated 12 days ago • 66 • 1

awojustin

authored 3 papers 13 days ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

ExpVid: A Benchmark for Experiment Video Understanding & Reasoning

Paper • 2510.11606 • Published Oct 13 • 3

VEU-Bench: Towards Comprehensive Understanding of Video Editing

Paper • 2504.17828 • Published Apr 24

yumingj

authored a paper 15 days ago

RynnVLA-002: A Unified Vision-Language-Action and World Model

Paper • 2511.17502 • Published 18 days ago • 24

Ziqi

authored 3 papers 21 days ago

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Paper • 2510.13759 • Published Oct 15 • 9

RealDPO: Real or Not Real, that is the Preference

Paper • 2510.14955 • Published Oct 16 • 6

Simulating the Visual World with Artificial Intelligence: A Roadmap

Paper • 2511.08585 • Published 28 days ago • 29