A2RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation Paper • 2605.17278 • Published 11 days ago • 4
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning Paper • 2603.23483 • Published Mar 24 • 63