:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Dai, Gaole, Jiang, Shiqi, Cao, Ting, Yang, Yuqing, Li, Yuanchun, Tan, Rui, Li, Mo, Qiu, Lili
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.21823
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment
by: Dai, Gaole, et al.
Published: (2025)

Expand Heterogeneous Learning Systems with Selective Multi-Source Knowledge Fusion
by: Dai, Gaole, et al.
Published: (2024)

Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment
by: Dai, Shenghong, et al.
Published: (2024)

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
by: Wu, Qianhui, et al.
Published: (2025)

AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance
by: Zhao, Yuyang, et al.
Published: (2025)

PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents
by: Chai, Yuxiang, et al.
Published: (2026)

ProActor: Timing-Aware Reinforcement Learning for Proactive Task Scheduling Agents
by: Ding, Lei, et al.
Published: (2026)

InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
by: Liu, Yuhang, et al.
Published: (2025)

Agentic Reward Modeling: Verifying GUI Agent via Online Proactive Interaction
by: Cui, Chaoqun, et al.
Published: (2026)

GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration
by: Sun, Yuchen, et al.
Published: (2025)

ProAgent: Harnessing On-Demand Sensory Contexts for Proactive LLM Agent Systems in the Wild
by: Yang, Bufang, et al.
Published: (2025)

Adaptive Milestone Reward for GUI Agents
by: Zheng, Congmin, et al.
Published: (2026)

Video-in-the-Loop: Span-Grounded Long Video QA with Interleaved Reasoning
by: Wang, Chendong, et al.
Published: (2025)

GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-Training
by: Cao, Yuan, et al.
Published: (2026)

AVA: Towards Agentic Video Analytics with Vision Language Models
by: Yan, Yuxuan, et al.
Published: (2025)

ProBench: Benchmarking GUI Agents with Accurate Process Information
by: Yang, Leyang, et al.
Published: (2025)

ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
by: Zhou, Jingqi, et al.
Published: (2024)

From Control to Foresight: Simulation as a New Paradigm for Human-Agent Collaboration
by: He, Gaole, et al.
Published: (2026)

Proactive Detection of GUI Defects in Multi-Window Scenarios via Multimodal Reasoning
by: Zhang, Xinyao, et al.
Published: (2026)

ReMe: Scaffolding Personalized Cognitive Training via Controllable LLM-Mediated Conversations
by: Wang, Zilong, et al.
Published: (2024)

ProAgentBench: Evaluating LLM Agents for Proactive Assistance with Real-World Data
by: Tang, Yuanbo, et al.
Published: (2026)

MagicGUI-RMS: A Multi-Agent Reward Model System for Self-Evolving GUI Agents via Automated Feedback Reflux
by: Li, Zecheng, et al.
Published: (2026)

Empowering In-Browser Deep Learning Inference on Edge Devices with Just-in-Time Kernel Optimizations
by: Jia, Fucheng, et al.
Published: (2023)

History-Aware Reasoning for GUI Agents
by: Wang, Ziwei, et al.
Published: (2025)

GUI-PRA: Process Reward Agent for GUI Tasks
by: Xiong, Tao, et al.
Published: (2025)

ProAct: A Dual-System Framework for Proactive Embodied Social Agents
by: Zhang, Zeyi, et al.
Published: (2026)

GUI-ReWalk: Massive Data Generation for GUI Agent via Stochastic Exploration and Intent-Aware Reasoning
by: Lin, Musen, et al.
Published: (2025)

OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds
by: Yang, Longrong, et al.
Published: (2025)

Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective
by: Zhang, Zhi, et al.
Published: (2024)

PACT: Proactive Asking for Continual Task Assistance in Human-Robot Collaboration
by: He, Chengbo, et al.
Published: (2026)

Anatomizing Deep Learning Inference in Web Browsers
by: Wang, Qipeng, et al.
Published: (2024)

ProgRM: Build Better GUI Agents with Progress Rewards
by: Zhang, Danyang, et al.
Published: (2025)

ProFocus: Proactive Perception and Focused Reasoning in Vision-and-Language Navigation
by: Xue, Wei, et al.
Published: (2026)

Mobile GUI Agents under Real-world Threats: Are We There Yet?
by: Liu, Guohong, et al.
Published: (2025)

ProAgent: Building Proactive Cooperative Agents with Large Language Models
by: Zhang, Ceyao, et al.
Published: (2023)

AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management
by: Tian, Shizuo, et al.
Published: (2025)

GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness
by: Huang, Kung-Hsiang, et al.
Published: (2025)

DocOS: Towards Proactive Document-Guided Actions in GUI Agents
by: Liu, Jingjing, et al.
Published: (2026)

PRISM: Festina Lente Proactivity -- Risk-Sensitive, Uncertainty-Aware Deliberation for Proactive Agents
by: Fu, Yuxuan, et al.
Published: (2026)

BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
by: Zhang, Shaojie, et al.
Published: (2025)