LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning Paper • 2512.05325 • Published Dec 5, 2025 • 2
Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models Paper • 2505.14071 • Published May 20, 2025 • 1
Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models Paper • 2505.14071 • Published May 20, 2025 • 1
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning Paper • 2507.16746 • Published Jul 22, 2025 • 35
AION-1: Omnimodal Foundation Model for Astronomical Sciences Paper • 2510.17960 • Published Oct 20, 2025 • 29
Localized Gaussian Splatting Editing with Contextual Awareness Paper • 2408.00083 • Published Jul 31, 2024
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31, 2025 • 301
DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis Paper • 2312.13016 • Published Dec 20, 2023 • 6
DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis Paper • 2503.15667 • Published Mar 19, 2025 • 8
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring Paper • 2501.02045 • Published Jan 3, 2025 • 22
Game-theoretic LLM: Agent Workflow for Negotiation Games Paper • 2411.05990 • Published Nov 8, 2024 • 8
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions Paper • 2412.08737 • Published Dec 11, 2024 • 54
D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation Paper • 2409.14365 • Published Sep 22, 2024
SAGE: Bridging Semantic and Actionable Parts for GEneralizable Manipulation of Articulated Objects Paper • 2312.01307 • Published Dec 3, 2023
On Retrieval Augmentation and the Limitations of Language Model Training Paper • 2311.09615 • Published Nov 16, 2023 • 1
DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models Paper • 2402.02392 • Published Feb 4, 2024 • 6