:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wen, Hao, Tian, Shizuo, Pavlov, Borislav, Du, Wenjie, Li, Yixuan, Chang, Ge, Zhao, Shanhui, Liu, Jiacheng, Liu, Yunxin, Zhang, Ya-Qin, Li, Yuanchun
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2412.18116
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AutoDroid: LLM-powered Task Automation in Android
by: Wen, Hao, et al.
Published: (2023)

AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management
by: Tian, Shizuo, et al.
Published: (2025)

Mobile GUI Agents under Real-world Threats: Are We There Yet?
by: Liu, Guohong, et al.
Published: (2025)

Joint Agent Memory and Exploration Learning via Novelty Signals
by: Tian, Shizuo, et al.
Published: (2026)

LLM-Explorer: Towards Efficient and Affordable LLM-based Exploration for Mobile Apps
by: Zhao, Shanhui, et al.
Published: (2025)

GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration
by: Sun, Yuchen, et al.
Published: (2025)

SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking
by: Liu, Guohong, et al.
Published: (2026)

Routine Computing: A Systematic Review of Sensing Daily Life Dimensions Towards Human-Centered Goals
by: Pavlov, Borislav, et al.
Published: (2026)

DroidBot-GPT: GPT-powered UI Automation for Android
by: Wen, Hao, et al.
Published: (2023)

ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
by: Wen, Hao, et al.
Published: (2025)

ChainStream: An LLM-based Framework for Unified Synthetic Sensing
by: Liu, Jiacheng, et al.
Published: (2024)

UI-TARS: Pioneering Automated GUI Interaction with Native Agents
by: Qin, Yujia, et al.
Published: (2025)

GRAIL:Learning to Interact with Large Knowledge Graphs for Retrieval Augmented Reasoning
by: Chang, Ge, et al.
Published: (2025)

Enhancing Agentic Textual Graph Retrieval with Synthetic Stepwise Supervision
by: Chang, Ge, et al.
Published: (2025)

A First Look At Efficient And Secure On-Device LLM Inference Against KV Leakage
by: Yang, Huan, et al.
Published: (2024)

ReuseDroid: A VLM-empowered Android UI Test Migrator Boosted by Active Feedback
by: Li, Xiaolei, et al.
Published: (2025)

WindowsWorld: A Process-Centric Benchmark of Autonomous GUI Agents in Professional Cross-Application Environments
by: Li, Jinchao, et al.
Published: (2026)

FuncDroid: Towards Inter-Functional Flows for Comprehensive Mobile App GUI Testing
by: He, Jinlong, et al.
Published: (2026)

EpiDroid: Dependency-Guided Recomposition for Deep State Discovery in Mobile GUI Testing
by: Song, Jiahui, et al.
Published: (2026)

BudgetThinker: Empowering Budget-aware LLM Reasoning with Control Tokens
by: Wen, Hao, et al.
Published: (2025)

Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment
by: Dai, Gaole, et al.
Published: (2025)

LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design
by: Kong, Rui, et al.
Published: (2024)

Next-Gen CAPTCHAs: Leveraging the Cognitive Gap for Scalable and Diverse GUI-Agent Defense
by: Liu, Jiacheng, et al.
Published: (2026)

Rational Decision-Making Agent with Internalized Utility Judgment
by: Ye, Yining, et al.
Published: (2023)

Auto-scaling Continuous Memory for GUI Agent
by: Wu, Wenyi, et al.
Published: (2025)

Threshold Neuron: A Brain-inspired Artificial Neuron for Efficient On-device Inference
by: Zheng, Zihao, et al.
Published: (2024)

An Empirical Study of LLM Reasoning Ability Under Strict Output Length Constraint
by: Sun, Yi, et al.
Published: (2025)

Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents
by: Xu, Weikai, et al.
Published: (2025)

SHARE: An SLM-based Hierarchical Action CorREction Assistant for Text-to-SQL
by: Qu, Ge, et al.
Published: (2025)

From Automated to Autonomous: Hierarchical Agent-native Network Architecture (HANA)
by: Wu, Binghan, et al.
Published: (2026)

ProRe: A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration
by: Dai, Gaole, et al.
Published: (2025)

AutoGUI: Scaling GUI Grounding with Automatic Functionality Annotations from LLMs
by: Li, Hongxin, et al.
Published: (2025)

Leveraging AI Agents for Autonomous Networks: A Reference Architecture and Empirical Studies
by: Wu, Binghan, et al.
Published: (2025)

Continual GUI Agents
by: Liu, Ziwei, et al.
Published: (2026)

MobileViews: A Million-scale and Diverse Mobile GUI Dataset
by: Gao, Longxi, et al.
Published: (2024)

Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
by: Li, Yuanchun, et al.
Published: (2024)

Region-based Content Enhancement for Efficient Video Analytics at the Edge
by: Wang, Weijun, et al.
Published: (2024)

Annotated record of the detailed examination of Mn deposits in a core from R/V Hakuho Maru Cruise KH-74-4 station
by: Tsunogai, Shizuo, et al.
Published: (1980)

AutoGUI-v2: A Comprehensive Multi-Modal GUI Functionality Understanding Benchmark
by: Li, Hongxin, et al.
Published: (2026)

VALORES DE REFERÊNCIA DO DRIS PARA A SOJA, CULTIVARES EMBRAPA 59 E BR 37, EM CARAMBEÍ - PARANÁ
by: Shizuo Maeda
Published: (2004)