Saved in:
| Main Authors: | Gonzalez-Pumariega, Gonzalo, Agashe, Saaket, Yang, Jiachen, Li, Ang, Wang, Xin Eric |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.17849 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents
by: Agashe, Saaket, et al.
Published: (2025)
by: Agashe, Saaket, et al.
Published: (2025)
Agent S: An Open Agentic Framework that Uses Computers Like a Human
by: Agashe, Saaket, et al.
Published: (2024)
by: Agashe, Saaket, et al.
Published: (2024)
Scaling Agents for Computer Use
by: Gonzalez-Pumariega, Gonzalo, et al.
Published: (2025)
by: Gonzalez-Pumariega, Gonzalo, et al.
Published: (2025)
Self-Resource Allocation in Multi-Agent LLM Systems
by: Amayuelas, Alfonso, et al.
Published: (2025)
by: Amayuelas, Alfonso, et al.
Published: (2025)
EnactToM: An Evolving Benchmark for Functional Theory of Mind in Embodied Agents
by: Juneja, Gurusha, et al.
Published: (2026)
by: Juneja, Gurusha, et al.
Published: (2026)
Robotouille: An Asynchronous Planning Benchmark for LLM Agents
by: Gonzalez-Pumariega, Gonzalo, et al.
Published: (2025)
by: Gonzalez-Pumariega, Gonzalo, et al.
Published: (2025)
Query-Efficient Planning with Language Models
by: Gonzalez-Pumariega, Gonzalo, et al.
Published: (2024)
by: Gonzalez-Pumariega, Gonzalo, et al.
Published: (2024)
Stateful Reasoning via Insight Replay
by: Lei, Bin, et al.
Published: (2026)
by: Lei, Bin, et al.
Published: (2026)
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
by: Agashe, Saaket, et al.
Published: (2023)
by: Agashe, Saaket, et al.
Published: (2023)
Skill-Aligned Fairness in Multi-Agent Learning for Collaboration in Healthcare
by: Ekpo, Promise Osaine, et al.
Published: (2025)
by: Ekpo, Promise Osaine, et al.
Published: (2025)
Multi-Turn Code Generation Through Single-Step Rewards
by: Jain, Arnav Kumar, et al.
Published: (2025)
by: Jain, Arnav Kumar, et al.
Published: (2025)
Context Bootstrapped Reinforcement Learning
by: Agashe, Saaket, et al.
Published: (2026)
by: Agashe, Saaket, et al.
Published: (2026)
MCPWorld: A Unified Benchmarking Testbed for API, GUI, and Hybrid Computer Use Agents
by: Yan, Yunhe, et al.
Published: (2025)
by: Yan, Yunhe, et al.
Published: (2025)
A Large Language Model-Empowered Agent for Reliable and Robust Structural Analysis
by: Liu, Jiachen, et al.
Published: (2025)
by: Liu, Jiachen, et al.
Published: (2025)
Human-Guided Harm Recovery for Computer Use Agents
by: Li, Christy, et al.
Published: (2026)
by: Li, Christy, et al.
Published: (2026)
RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
by: Yang, Jingyi, et al.
Published: (2025)
by: Yang, Jingyi, et al.
Published: (2025)
The Art of Building Verifiers for Computer Use Agents
by: Rosset, Corby, et al.
Published: (2026)
by: Rosset, Corby, et al.
Published: (2026)
Learning to Rewrite Tool Descriptions for Reliable LLM-Agent Tool Use
by: Guo, Ruocheng, et al.
Published: (2026)
by: Guo, Ruocheng, et al.
Published: (2026)
OSExpert: Computer-Use Agents Learning Professional Skills via Exploration
by: Liu, Jiateng, et al.
Published: (2026)
by: Liu, Jiateng, et al.
Published: (2026)
Identification of Probabilities of Causation: from Recursive to Closed-Form Bounds
by: Shu, Xin, et al.
Published: (2025)
by: Shu, Xin, et al.
Published: (2025)
PRO-CUA: Process-Reward Optimization for Computer Use Agents
by: He, Yifei, et al.
Published: (2026)
by: He, Yifei, et al.
Published: (2026)
OpenCUA: Open Foundations for Computer-Use Agents
by: Wang, Xinyuan, et al.
Published: (2025)
by: Wang, Xinyuan, et al.
Published: (2025)
AgentHijack: Benchmarking Computer Use Agent Robustness to Common Environment Corruptions
by: Sun, Jingwei, et al.
Published: (2026)
by: Sun, Jingwei, et al.
Published: (2026)
AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents
by: Feng, Yunhao, et al.
Published: (2026)
by: Feng, Yunhao, et al.
Published: (2026)
CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents
by: Foerster, Hanna, et al.
Published: (2026)
by: Foerster, Hanna, et al.
Published: (2026)
S-Agents: Self-organizing Agents in Open-ended Environments
by: Chen, Jiaqi, et al.
Published: (2024)
by: Chen, Jiaqi, et al.
Published: (2024)
Grounding Computer Use Agents on Human Demonstrations
by: Feizi, Aarash, et al.
Published: (2025)
by: Feizi, Aarash, et al.
Published: (2025)
IntentScore: Intent-Conditioned Action Evaluation for Computer-Use Agents
by: Chen, Rongqian, et al.
Published: (2026)
by: Chen, Rongqian, et al.
Published: (2026)
Tool-to-Tool Matching Analysis Based Difference Score Computation Methods for Semiconductor Manufacturing
by: H., Sameera Bharadwaja, et al.
Published: (2025)
by: H., Sameera Bharadwaja, et al.
Published: (2025)
The Agent Use of Agent Beings: Agent Cybernetics Is the Missing Science of Foundation Agents
by: Wang, Xinrun, et al.
Published: (2026)
by: Wang, Xinrun, et al.
Published: (2026)
DPO Learning with LLMs-Judge Signal for Computer Use Agents
by: Luo, Man, et al.
Published: (2025)
by: Luo, Man, et al.
Published: (2025)
OpenComputer: Verifiable Software Worlds for Computer-Use Agents
by: Wei, Jinbiao, et al.
Published: (2026)
by: Wei, Jinbiao, et al.
Published: (2026)
P-Guide: Parameter-Efficient Prior Steering for Single-Pass CFG Inference
by: Peng, Xin, et al.
Published: (2026)
by: Peng, Xin, et al.
Published: (2026)
Curie: Toward Rigorous and Automated Scientific Experimentation with AI Agents
by: Kon, Patrick Tser Jern, et al.
Published: (2025)
by: Kon, Patrick Tser Jern, et al.
Published: (2025)
Reducing Cognitive Overhead in Tool Use via Multi-Small-Agent Reinforcement Learning
by: Wang, Dayu, et al.
Published: (2025)
by: Wang, Dayu, et al.
Published: (2025)
C-World: A Computer Use Agent Environment Creator
by: Xi, Ziqiao, et al.
Published: (2026)
by: Xi, Ziqiao, et al.
Published: (2026)
ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents
by: Lai, Hanyu, et al.
Published: (2025)
by: Lai, Hanyu, et al.
Published: (2025)
Agent Alpha: Tree Search Unifying Generation, Exploration and Evaluation for Computer-Use Agents
by: Tang, Sizhe, et al.
Published: (2026)
by: Tang, Sizhe, et al.
Published: (2026)
DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation
by: Gu, Tianjun, et al.
Published: (2025)
by: Gu, Tianjun, et al.
Published: (2025)
Efficient Agent Training for Computer Use
by: He, Yanheng, et al.
Published: (2025)
by: He, Yanheng, et al.
Published: (2025)
Similar Items
-
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents
by: Agashe, Saaket, et al.
Published: (2025) -
Agent S: An Open Agentic Framework that Uses Computers Like a Human
by: Agashe, Saaket, et al.
Published: (2024) -
Scaling Agents for Computer Use
by: Gonzalez-Pumariega, Gonzalo, et al.
Published: (2025) -
Self-Resource Allocation in Multi-Agent LLM Systems
by: Amayuelas, Alfonso, et al.
Published: (2025) -
EnactToM: An Evolving Benchmark for Functional Theory of Mind in Embodied Agents
by: Juneja, Gurusha, et al.
Published: (2026)