Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes Paper • 2509.06266 • Published Sep 8, 2025 • 11
From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model Paper • 2512.05277 • Published about 1 month ago • 4