ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning Paper • 2603.10160 • Published Mar 10 • 26
Toward Cognitive Supersensing in Multimodal Large Language Model Paper • 2602.01541 • Published Feb 2 • 16
PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling Paper • 2506.20936 • Published Jun 26, 2025 • 12
Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation Paper • 2509.10687 • Published Sep 12, 2025 • 7
RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes Paper • 2509.15123 • Published Sep 18, 2025 • 5
RigMo: Unifying Rig and Motion Learning for Generative Animation Paper • 2601.06378 • Published Jan 10 • 12
RigMo: Unifying Rig and Motion Learning for Generative Animation Paper • 2601.06378 • Published Jan 10 • 12
Drive as You Speak: Enabling Human-Like Interaction with Large Language Models in Autonomous Vehicles Paper • 2309.10228 • Published Sep 19, 2023
On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation Paper • 2411.11913 • Published Nov 17, 2024
MedSAM3: Delving into Segment Anything with Medical Concepts Paper • 2511.19046 • Published Nov 24, 2025 • 55
Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization Paper • 2511.14846 • Published Nov 18, 2025
NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning Paper • 2307.08941 • Published Jul 18, 2023 • 1
Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond Paper • 2403.10667 • Published Mar 15, 2024 • 1
SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence Paper • 2502.08767 • Published Feb 12, 2025
SocialGesture: Delving into Multi-person Gesture Understanding Paper • 2504.02244 • Published Apr 3, 2025
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs Paper • 2506.21656 • Published Jun 26, 2025 • 16