Saved in:
| Main Authors: | Wen, Hao, Tian, Shizuo, Pavlov, Borislav, Du, Wenjie, Li, Yixuan, Chang, Ge, Zhao, Shanhui, Liu, Jiacheng, Liu, Yunxin, Zhang, Ya-Qin, Li, Yuanchun |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.18116 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AutoDroid: LLM-powered Task Automation in Android
by: Wen, Hao, et al.
Published: (2023)
by: Wen, Hao, et al.
Published: (2023)
AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management
by: Tian, Shizuo, et al.
Published: (2025)
by: Tian, Shizuo, et al.
Published: (2025)
Mobile GUI Agents under Real-world Threats: Are We There Yet?
by: Liu, Guohong, et al.
Published: (2025)
by: Liu, Guohong, et al.
Published: (2025)
Joint Agent Memory and Exploration Learning via Novelty Signals
by: Tian, Shizuo, et al.
Published: (2026)
by: Tian, Shizuo, et al.
Published: (2026)
LLM-Explorer: Towards Efficient and Affordable LLM-based Exploration for Mobile Apps
by: Zhao, Shanhui, et al.
Published: (2025)
by: Zhao, Shanhui, et al.
Published: (2025)
GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration
by: Sun, Yuchen, et al.
Published: (2025)
by: Sun, Yuchen, et al.
Published: (2025)
SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking
by: Liu, Guohong, et al.
Published: (2026)
by: Liu, Guohong, et al.
Published: (2026)
Routine Computing: A Systematic Review of Sensing Daily Life Dimensions Towards Human-Centered Goals
by: Pavlov, Borislav, et al.
Published: (2026)
by: Pavlov, Borislav, et al.
Published: (2026)
DroidBot-GPT: GPT-powered UI Automation for Android
by: Wen, Hao, et al.
Published: (2023)
by: Wen, Hao, et al.
Published: (2023)
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
by: Wen, Hao, et al.
Published: (2025)
by: Wen, Hao, et al.
Published: (2025)
ChainStream: An LLM-based Framework for Unified Synthetic Sensing
by: Liu, Jiacheng, et al.
Published: (2024)
by: Liu, Jiacheng, et al.
Published: (2024)
UI-TARS: Pioneering Automated GUI Interaction with Native Agents
by: Qin, Yujia, et al.
Published: (2025)
by: Qin, Yujia, et al.
Published: (2025)
GRAIL:Learning to Interact with Large Knowledge Graphs for Retrieval Augmented Reasoning
by: Chang, Ge, et al.
Published: (2025)
by: Chang, Ge, et al.
Published: (2025)
Enhancing Agentic Textual Graph Retrieval with Synthetic Stepwise Supervision
by: Chang, Ge, et al.
Published: (2025)
by: Chang, Ge, et al.
Published: (2025)
A First Look At Efficient And Secure On-Device LLM Inference Against KV Leakage
by: Yang, Huan, et al.
Published: (2024)
by: Yang, Huan, et al.
Published: (2024)
ReuseDroid: A VLM-empowered Android UI Test Migrator Boosted by Active Feedback
by: Li, Xiaolei, et al.
Published: (2025)
by: Li, Xiaolei, et al.
Published: (2025)
WindowsWorld: A Process-Centric Benchmark of Autonomous GUI Agents in Professional Cross-Application Environments
by: Li, Jinchao, et al.
Published: (2026)
by: Li, Jinchao, et al.
Published: (2026)
FuncDroid: Towards Inter-Functional Flows for Comprehensive Mobile App GUI Testing
by: He, Jinlong, et al.
Published: (2026)
by: He, Jinlong, et al.
Published: (2026)
EpiDroid: Dependency-Guided Recomposition for Deep State Discovery in Mobile GUI Testing
by: Song, Jiahui, et al.
Published: (2026)
by: Song, Jiahui, et al.
Published: (2026)
BudgetThinker: Empowering Budget-aware LLM Reasoning with Control Tokens
by: Wen, Hao, et al.
Published: (2025)
by: Wen, Hao, et al.
Published: (2025)
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment
by: Dai, Gaole, et al.
Published: (2025)
by: Dai, Gaole, et al.
Published: (2025)
LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design
by: Kong, Rui, et al.
Published: (2024)
by: Kong, Rui, et al.
Published: (2024)
Next-Gen CAPTCHAs: Leveraging the Cognitive Gap for Scalable and Diverse GUI-Agent Defense
by: Liu, Jiacheng, et al.
Published: (2026)
by: Liu, Jiacheng, et al.
Published: (2026)
Rational Decision-Making Agent with Internalized Utility Judgment
by: Ye, Yining, et al.
Published: (2023)
by: Ye, Yining, et al.
Published: (2023)
Auto-scaling Continuous Memory for GUI Agent
by: Wu, Wenyi, et al.
Published: (2025)
by: Wu, Wenyi, et al.
Published: (2025)
Threshold Neuron: A Brain-inspired Artificial Neuron for Efficient On-device Inference
by: Zheng, Zihao, et al.
Published: (2024)
by: Zheng, Zihao, et al.
Published: (2024)
An Empirical Study of LLM Reasoning Ability Under Strict Output Length Constraint
by: Sun, Yi, et al.
Published: (2025)
by: Sun, Yi, et al.
Published: (2025)
Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents
by: Xu, Weikai, et al.
Published: (2025)
by: Xu, Weikai, et al.
Published: (2025)
SHARE: An SLM-based Hierarchical Action CorREction Assistant for Text-to-SQL
by: Qu, Ge, et al.
Published: (2025)
by: Qu, Ge, et al.
Published: (2025)
From Automated to Autonomous: Hierarchical Agent-native Network Architecture (HANA)
by: Wu, Binghan, et al.
Published: (2026)
by: Wu, Binghan, et al.
Published: (2026)
ProRe: A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration
by: Dai, Gaole, et al.
Published: (2025)
by: Dai, Gaole, et al.
Published: (2025)
AutoGUI: Scaling GUI Grounding with Automatic Functionality Annotations from LLMs
by: Li, Hongxin, et al.
Published: (2025)
by: Li, Hongxin, et al.
Published: (2025)
Leveraging AI Agents for Autonomous Networks: A Reference Architecture and Empirical Studies
by: Wu, Binghan, et al.
Published: (2025)
by: Wu, Binghan, et al.
Published: (2025)
Continual GUI Agents
by: Liu, Ziwei, et al.
Published: (2026)
by: Liu, Ziwei, et al.
Published: (2026)
MobileViews: A Million-scale and Diverse Mobile GUI Dataset
by: Gao, Longxi, et al.
Published: (2024)
by: Gao, Longxi, et al.
Published: (2024)
Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
by: Li, Yuanchun, et al.
Published: (2024)
by: Li, Yuanchun, et al.
Published: (2024)
Region-based Content Enhancement for Efficient Video Analytics at the Edge
by: Wang, Weijun, et al.
Published: (2024)
by: Wang, Weijun, et al.
Published: (2024)
Annotated record of the detailed examination of Mn deposits in a core from R/V Hakuho Maru Cruise KH-74-4 station
by: Tsunogai, Shizuo, et al.
Published: (1980)
by: Tsunogai, Shizuo, et al.
Published: (1980)
AutoGUI-v2: A Comprehensive Multi-Modal GUI Functionality Understanding Benchmark
by: Li, Hongxin, et al.
Published: (2026)
by: Li, Hongxin, et al.
Published: (2026)
VALORES DE REFERÊNCIA DO DRIS PARA A SOJA, CULTIVARES EMBRAPA 59 E BR 37, EM CARAMBEÍ - PARANÁ
by: Shizuo Maeda
Published: (2004)
by: Shizuo Maeda
Published: (2004)
Similar Items
-
AutoDroid: LLM-powered Task Automation in Android
by: Wen, Hao, et al.
Published: (2023) -
AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management
by: Tian, Shizuo, et al.
Published: (2025) -
Mobile GUI Agents under Real-world Threats: Are We There Yet?
by: Liu, Guohong, et al.
Published: (2025) -
Joint Agent Memory and Exploration Learning via Novelty Signals
by: Tian, Shizuo, et al.
Published: (2026) -
LLM-Explorer: Towards Efficient and Affordable LLM-based Exploration for Mobile Apps
by: Zhao, Shanhui, et al.
Published: (2025)