Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning Paper • 2601.16163 • Published about 16 hours ago • 5
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Paper • 2601.11868 • Published 6 days ago • 8
Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published 1 day ago • 38
Facilitating Proactive and Reactive Guidance for Decision Making on the Web: A Design Probe with WebSeek Paper • 2601.15100 • Published 2 days ago • 1
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning Paper • 2601.14750 • Published 2 days ago • 14
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models Paper • 2601.10387 • Published 8 days ago • 10
FrankenMotion: Part-level Human Motion Generation and Composition Paper • 2601.10909 • Published 7 days ago • 18
AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems Paper • 2601.11354 • Published 7 days ago • 4
BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search Paper • 2601.11037 • Published 7 days ago • 16
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 9 days ago • 31
FlowAct-R1: Towards Interactive Humanoid Video Generation Paper • 2601.10103 • Published 8 days ago • 30
Inference-time Physics Alignment of Video Generative Models with Latent World Models Paper • 2601.10553 • Published 8 days ago • 12
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 8 days ago • 26