arxiv:2603.02872
Hao Wu
HarrisonWu
AI & ML interests
None yet
Recent Activity
authored a paper 12 days ago
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding
and Reasoning in Pathology authored a paper 12 days ago
Speak While Watching: Unleashing TRUE Real-Time Video Understanding Capability of Multimodal Large Language Models authored a paper 12 days ago
HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early ExitOrganizations
None yet