Saved in:
| Main Authors: | Sun, Yuchen, Fu, Pei, Zhang, Shaojie, Du, Anan, Xi, Xiuwen, Zhang, Ruoceng, Luo, Zhenbo, Luan, Jian, Zhang, Chongyang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.14311 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Enhancing Trustworthy GUI Grounding via Self-Critiqued Reinforcement Learning
by: Zhang, Shaojie, et al.
Published: (2025)
by: Zhang, Shaojie, et al.
Published: (2025)
GAIA: A Data Flywheel System for Training GUI Test-Time Scaling Critic Models
by: Wang, Shaokang, et al.
Published: (2026)
by: Wang, Shaokang, et al.
Published: (2026)
MobileViews: A Million-scale and Diverse Mobile GUI Dataset
by: Gao, Longxi, et al.
Published: (2024)
by: Gao, Longxi, et al.
Published: (2024)
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
by: Zhang, Shaojie, et al.
Published: (2025)
by: Zhang, Shaojie, et al.
Published: (2025)
MagicGUI: A Foundational Mobile GUI Agent with Scalable Data Pipeline and Reinforcement Fine-tuning
by: Tang, Liujian, et al.
Published: (2025)
by: Tang, Liujian, et al.
Published: (2025)
Automatic Semantic Alignment of Flow Pattern Representations for Exploration with Large Language Models
by: Zhang, Weihan, et al.
Published: (2025)
by: Zhang, Weihan, et al.
Published: (2025)
OmniGUI: Benchmarking GUI Agents in Omni-Modal Smartphone Environments
by: Henry, Felix, et al.
Published: (2026)
by: Henry, Felix, et al.
Published: (2026)
SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents
by: Cheng, Kanzhi, et al.
Published: (2024)
by: Cheng, Kanzhi, et al.
Published: (2024)
CritiqueCrew: Orchestrating Multi-Perspective Conversational Design Critique
by: Chen, Xiaojiao, et al.
Published: (2026)
by: Chen, Xiaojiao, et al.
Published: (2026)
Beyond Chat and Clicks: GUI Agents for In-Situ Assistance via Live Interface Transformation
by: Hao, Pan, et al.
Published: (2026)
by: Hao, Pan, et al.
Published: (2026)
Beyond the Single Turn: Reframing Refusals as Dynamic Experiences Embedded in the Context of Mental Health Support Interactions with LLMs
by: Tang, Ningjing, et al.
Published: (2026)
by: Tang, Ningjing, et al.
Published: (2026)
Characterizing Unintended Consequences in Human-GUI Agent Collaboration for Web Browsing
by: Zhang, Shuning, et al.
Published: (2025)
by: Zhang, Shuning, et al.
Published: (2025)
Creation, Critique, and Consumption: Exploring Generative AI Descriptions for Supporting Blind and Low Vision Professionals with Visual Tasks
by: Jiang, Lucy, et al.
Published: (2025)
by: Jiang, Lucy, et al.
Published: (2025)
OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents
by: Cheng, Pengzhou, et al.
Published: (2025)
by: Cheng, Pengzhou, et al.
Published: (2025)
API Agents vs. GUI Agents: Divergence and Convergence
by: Zhang, Chaoyun, et al.
Published: (2025)
by: Zhang, Chaoyun, et al.
Published: (2025)
No Evidence for LLMs Being Useful in Problem Reframing
by: Shin, Joongi, et al.
Published: (2025)
by: Shin, Joongi, et al.
Published: (2025)
AmbiBench: Benchmarking Mobile GUI Agents Beyond One-Shot Instructions in the Wild
by: Sun, Jiazheng, et al.
Published: (2026)
by: Sun, Jiazheng, et al.
Published: (2026)
GraphPilot: GUI Task Automation with One-Step LLM Reasoning Powered by Knowledge Graph
by: Yu, Mingxian, et al.
Published: (2026)
by: Yu, Mingxian, et al.
Published: (2026)
GUI Agents: A Survey
by: Nguyen, Dang, et al.
Published: (2024)
by: Nguyen, Dang, et al.
Published: (2024)
Reframing Human-Robot Interaction Through Extended Reality: Unlocking Safer, Smarter, and More Empathic Interactions with Virtual Robots and Foundation Models
by: Zhang, Yuchong, et al.
Published: (2025)
by: Zhang, Yuchong, et al.
Published: (2025)
Between Puppet and Actor: Reframing Authorship in this Age of AI Agents
by: Sun, Yuqian, et al.
Published: (2025)
by: Sun, Yuqian, et al.
Published: (2025)
CRAFT-GUI: Curriculum-Reinforced Agent For GUI Tasks
by: Nong, Songqin, et al.
Published: (2025)
by: Nong, Songqin, et al.
Published: (2025)
Reframe Anything: LLM Agent for Open World Video Reframing
by: Cao, Jiawang, et al.
Published: (2024)
by: Cao, Jiawang, et al.
Published: (2024)
Aria-UI: Visual Grounding for GUI Instructions
by: Yang, Yuhao, et al.
Published: (2024)
by: Yang, Yuhao, et al.
Published: (2024)
Beyond Likes: How Normative Feedback Complements Engagement Signals on Social Media
by: Wu, Yuchen, et al.
Published: (2025)
by: Wu, Yuchen, et al.
Published: (2025)
Beyond Clicking:A Step Towards Generalist GUI Grounding via Text Dragging
by: Liao, Zeyi, et al.
Published: (2025)
by: Liao, Zeyi, et al.
Published: (2025)
A Survey on (M)LLM-Based GUI Agents
by: Tang, Fei, et al.
Published: (2025)
by: Tang, Fei, et al.
Published: (2025)
Reframing Pattern: A Comprehensive Approach to a Composite Visual Variable
by: He, Tingying, et al.
Published: (2025)
by: He, Tingying, et al.
Published: (2025)
A Unified, Cross-Platform Framework for Automatic GUI and Plugin Generation in Structural Bioinformatics and Beyond
by: Guo, Sikao, et al.
Published: (2026)
by: Guo, Sikao, et al.
Published: (2026)
Reframing Conversational Design in HRI: Deliberate Design with AI Scaffolds
by: Cao, Shiye, et al.
Published: (2026)
by: Cao, Shiye, et al.
Published: (2026)
Arm Robot: AR-Enhanced Embodied Control and Visualization for Intuitive Robot Arm Manipulation
by: Pei, Siyou, et al.
Published: (2024)
by: Pei, Siyou, et al.
Published: (2024)
Large Language Model-Brained GUI Agents: A Survey
by: Zhang, Chaoyun, et al.
Published: (2024)
by: Zhang, Chaoyun, et al.
Published: (2024)
Detecting the Use of Generative AI in Crowdsourced Surveys: Implications for Data Integrity
by: Zhang, Dapeng, et al.
Published: (2025)
by: Zhang, Dapeng, et al.
Published: (2025)
UIPro: Unleashing Superior Interaction Capability For GUI Agents
by: Li, Hongxin, et al.
Published: (2025)
by: Li, Hongxin, et al.
Published: (2025)
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
by: Liu, Guangyi, et al.
Published: (2025)
by: Liu, Guangyi, et al.
Published: (2025)
An Audio-Visual Fusion Emotion Generation Model Based on Neuroanatomical Alignment
by: Wang, Haidong, et al.
Published: (2025)
by: Wang, Haidong, et al.
Published: (2025)
RemixTape: Enriching Narratives about Metrics with Semantic Alignment and Contextual Recommendation
by: Brehmer, Matthew, et al.
Published: (2024)
by: Brehmer, Matthew, et al.
Published: (2024)
AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents
by: Chai, Yuxiang, et al.
Published: (2024)
by: Chai, Yuxiang, et al.
Published: (2024)
DECAN: A Denoising Encoder via Contrastive Alignment Network for Dry Electrode EEG Emotion Recognition
by: Zhang, Meihong, et al.
Published: (2024)
by: Zhang, Meihong, et al.
Published: (2024)
The Behavioral Fabric of LLM-Powered GUI Agents: Human Values and Interaction Outcomes
by: Gebreegziabher, Simret Araya, et al.
Published: (2026)
by: Gebreegziabher, Simret Araya, et al.
Published: (2026)
Similar Items
-
Enhancing Trustworthy GUI Grounding via Self-Critiqued Reinforcement Learning
by: Zhang, Shaojie, et al.
Published: (2025) -
GAIA: A Data Flywheel System for Training GUI Test-Time Scaling Critic Models
by: Wang, Shaokang, et al.
Published: (2026) -
MobileViews: A Million-scale and Diverse Mobile GUI Dataset
by: Gao, Longxi, et al.
Published: (2024) -
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
by: Zhang, Shaojie, et al.
Published: (2025) -
MagicGUI: A Foundational Mobile GUI Agent with Scalable Data Pipeline and Reinforcement Fine-tuning
by: Tang, Liujian, et al.
Published: (2025)