arxiv:2507.16815
Fu-En Yang
FuEnYang
AI & ML interests
Computer Vision, Deep Learning, Vision-Language Models (VLMs), Vision-Language-Action Models (VLAs), Reasoning Models, Embodied AI
Recent Activity
upvoted
a
paper
about 24 hours ago
EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture
upvoted
a
paper
about 24 hours ago
Unified Video Editing with Temporal Reasoner
upvoted
a
paper
about 24 hours ago
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance