22 83 33

HAODONG DUAN

KennyUTC

https://kennymckormick.github.io

AI & ML interests

Video Understanding; Multi-Modal Learning

Recent Activity

liked a dataset 3 days ago

internlm/WildClawBench

upvoted a paper about 2 months ago

RISE-Video: Can Video Generators Decode Implicit World Rules?

authored a paper 3 months ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

View all activity

Organizations

liked a dataset 3 days ago

internlm/WildClawBench

Updated 1 day ago • 3.64k • 29

upvoted a paper about 2 months ago

RISE-Video: Can Video Generators Decode Implicit World Rules?

Paper • 2602.05986 • Published Feb 5 • 26

authored a paper 3 months ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Paper • 2512.22334 • Published Dec 26, 2025 • 36

upvoted a paper 3 months ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published Dec 19, 2025 • 37

upvoted a paper 4 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 50

upvoted 3 papers 5 months ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 60

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 98

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23, 2025 • 56

authored 3 papers 5 months ago

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Paper • 2508.21148 • Published Aug 28, 2025 • 142

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Paper • 2509.22624 • Published Sep 26, 2025 • 19

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 110

upvoted a paper 6 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 110

upvoted 4 papers 7 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 127

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Paper • 2508.21148 • Published Aug 28, 2025 • 142

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26, 2025 • 104

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27, 2025 • 37

authored 4 papers 7 months ago

MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Paper • 2505.23764 • Published May 29, 2025 • 3

Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings

Paper • 2506.04997 • Published Jun 5, 2025

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

Paper • 2507.19478 • Published Jul 25, 2025 • 33

GOBench: Benchmarking Geometric Optics Generation and Understanding of MLLMs

Paper • 2506.00991 • Published Jun 1, 2025

HAODONG DUAN

AI & ML interests

Recent Activity

Organizations

KennyUTC's activity