Saved in:
| Main Authors: | Dai, Gaole, Jiang, Shiqi, Cao, Ting, Yang, Yuqing, Li, Yuanchun, Tan, Rui, Li, Mo, Qiu, Lili |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.21823 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment
by: Dai, Gaole, et al.
Published: (2025)
by: Dai, Gaole, et al.
Published: (2025)
Expand Heterogeneous Learning Systems with Selective Multi-Source Knowledge Fusion
by: Dai, Gaole, et al.
Published: (2024)
by: Dai, Gaole, et al.
Published: (2024)
Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment
by: Dai, Shenghong, et al.
Published: (2024)
by: Dai, Shenghong, et al.
Published: (2024)
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
by: Wu, Qianhui, et al.
Published: (2025)
by: Wu, Qianhui, et al.
Published: (2025)
AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance
by: Zhao, Yuyang, et al.
Published: (2025)
by: Zhao, Yuyang, et al.
Published: (2025)
PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents
by: Chai, Yuxiang, et al.
Published: (2026)
by: Chai, Yuxiang, et al.
Published: (2026)
ProActor: Timing-Aware Reinforcement Learning for Proactive Task Scheduling Agents
by: Ding, Lei, et al.
Published: (2026)
by: Ding, Lei, et al.
Published: (2026)
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
by: Liu, Yuhang, et al.
Published: (2025)
by: Liu, Yuhang, et al.
Published: (2025)
Agentic Reward Modeling: Verifying GUI Agent via Online Proactive Interaction
by: Cui, Chaoqun, et al.
Published: (2026)
by: Cui, Chaoqun, et al.
Published: (2026)
GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration
by: Sun, Yuchen, et al.
Published: (2025)
by: Sun, Yuchen, et al.
Published: (2025)
ProAgent: Harnessing On-Demand Sensory Contexts for Proactive LLM Agent Systems in the Wild
by: Yang, Bufang, et al.
Published: (2025)
by: Yang, Bufang, et al.
Published: (2025)
Adaptive Milestone Reward for GUI Agents
by: Zheng, Congmin, et al.
Published: (2026)
by: Zheng, Congmin, et al.
Published: (2026)
Video-in-the-Loop: Span-Grounded Long Video QA with Interleaved Reasoning
by: Wang, Chendong, et al.
Published: (2025)
by: Wang, Chendong, et al.
Published: (2025)
GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-Training
by: Cao, Yuan, et al.
Published: (2026)
by: Cao, Yuan, et al.
Published: (2026)
AVA: Towards Agentic Video Analytics with Vision Language Models
by: Yan, Yuxuan, et al.
Published: (2025)
by: Yan, Yuxuan, et al.
Published: (2025)
ProBench: Benchmarking GUI Agents with Accurate Process Information
by: Yang, Leyang, et al.
Published: (2025)
by: Yang, Leyang, et al.
Published: (2025)
ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
by: Zhou, Jingqi, et al.
Published: (2024)
by: Zhou, Jingqi, et al.
Published: (2024)
From Control to Foresight: Simulation as a New Paradigm for Human-Agent Collaboration
by: He, Gaole, et al.
Published: (2026)
by: He, Gaole, et al.
Published: (2026)
Proactive Detection of GUI Defects in Multi-Window Scenarios via Multimodal Reasoning
by: Zhang, Xinyao, et al.
Published: (2026)
by: Zhang, Xinyao, et al.
Published: (2026)
ReMe: Scaffolding Personalized Cognitive Training via Controllable LLM-Mediated Conversations
by: Wang, Zilong, et al.
Published: (2024)
by: Wang, Zilong, et al.
Published: (2024)
ProAgentBench: Evaluating LLM Agents for Proactive Assistance with Real-World Data
by: Tang, Yuanbo, et al.
Published: (2026)
by: Tang, Yuanbo, et al.
Published: (2026)
MagicGUI-RMS: A Multi-Agent Reward Model System for Self-Evolving GUI Agents via Automated Feedback Reflux
by: Li, Zecheng, et al.
Published: (2026)
by: Li, Zecheng, et al.
Published: (2026)
Empowering In-Browser Deep Learning Inference on Edge Devices with Just-in-Time Kernel Optimizations
by: Jia, Fucheng, et al.
Published: (2023)
by: Jia, Fucheng, et al.
Published: (2023)
History-Aware Reasoning for GUI Agents
by: Wang, Ziwei, et al.
Published: (2025)
by: Wang, Ziwei, et al.
Published: (2025)
GUI-PRA: Process Reward Agent for GUI Tasks
by: Xiong, Tao, et al.
Published: (2025)
by: Xiong, Tao, et al.
Published: (2025)
ProAct: A Dual-System Framework for Proactive Embodied Social Agents
by: Zhang, Zeyi, et al.
Published: (2026)
by: Zhang, Zeyi, et al.
Published: (2026)
GUI-ReWalk: Massive Data Generation for GUI Agent via Stochastic Exploration and Intent-Aware Reasoning
by: Lin, Musen, et al.
Published: (2025)
by: Lin, Musen, et al.
Published: (2025)
OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds
by: Yang, Longrong, et al.
Published: (2025)
by: Yang, Longrong, et al.
Published: (2025)
Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective
by: Zhang, Zhi, et al.
Published: (2024)
by: Zhang, Zhi, et al.
Published: (2024)
PACT: Proactive Asking for Continual Task Assistance in Human-Robot Collaboration
by: He, Chengbo, et al.
Published: (2026)
by: He, Chengbo, et al.
Published: (2026)
Anatomizing Deep Learning Inference in Web Browsers
by: Wang, Qipeng, et al.
Published: (2024)
by: Wang, Qipeng, et al.
Published: (2024)
ProgRM: Build Better GUI Agents with Progress Rewards
by: Zhang, Danyang, et al.
Published: (2025)
by: Zhang, Danyang, et al.
Published: (2025)
ProFocus: Proactive Perception and Focused Reasoning in Vision-and-Language Navigation
by: Xue, Wei, et al.
Published: (2026)
by: Xue, Wei, et al.
Published: (2026)
Mobile GUI Agents under Real-world Threats: Are We There Yet?
by: Liu, Guohong, et al.
Published: (2025)
by: Liu, Guohong, et al.
Published: (2025)
ProAgent: Building Proactive Cooperative Agents with Large Language Models
by: Zhang, Ceyao, et al.
Published: (2023)
by: Zhang, Ceyao, et al.
Published: (2023)
AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management
by: Tian, Shizuo, et al.
Published: (2025)
by: Tian, Shizuo, et al.
Published: (2025)
GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness
by: Huang, Kung-Hsiang, et al.
Published: (2025)
by: Huang, Kung-Hsiang, et al.
Published: (2025)
DocOS: Towards Proactive Document-Guided Actions in GUI Agents
by: Liu, Jingjing, et al.
Published: (2026)
by: Liu, Jingjing, et al.
Published: (2026)
PRISM: Festina Lente Proactivity -- Risk-Sensitive, Uncertainty-Aware Deliberation for Proactive Agents
by: Fu, Yuxuan, et al.
Published: (2026)
by: Fu, Yuxuan, et al.
Published: (2026)
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
by: Zhang, Shaojie, et al.
Published: (2025)
by: Zhang, Shaojie, et al.
Published: (2025)
Similar Items
-
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment
by: Dai, Gaole, et al.
Published: (2025) -
Expand Heterogeneous Learning Systems with Selective Multi-Source Knowledge Fusion
by: Dai, Gaole, et al.
Published: (2024) -
Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment
by: Dai, Shenghong, et al.
Published: (2024) -
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
by: Wu, Qianhui, et al.
Published: (2025) -
AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance
by: Zhao, Yuyang, et al.
Published: (2025)