Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 71
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Paper • 2506.03143 • Published Jun 3, 2025 • 53
Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning Paper • 2505.12370 • Published May 18, 2025
UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning Paper • 2505.12493 • Published May 18, 2025
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices Paper • 2406.08451 • Published Jun 12, 2024 • 25
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior Paper • 2506.08012 • Published Jun 9, 2025 • 7