:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gonzalez-Pumariega, Gonzalo, Agashe, Saaket, Yang, Jiachen, Li, Ang, Wang, Xin Eric
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.17849
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents
by: Agashe, Saaket, et al.
Published: (2025)

Agent S: An Open Agentic Framework that Uses Computers Like a Human
by: Agashe, Saaket, et al.
Published: (2024)

Scaling Agents for Computer Use
by: Gonzalez-Pumariega, Gonzalo, et al.
Published: (2025)

Self-Resource Allocation in Multi-Agent LLM Systems
by: Amayuelas, Alfonso, et al.
Published: (2025)

EnactToM: An Evolving Benchmark for Functional Theory of Mind in Embodied Agents
by: Juneja, Gurusha, et al.
Published: (2026)

Robotouille: An Asynchronous Planning Benchmark for LLM Agents
by: Gonzalez-Pumariega, Gonzalo, et al.
Published: (2025)

Query-Efficient Planning with Language Models
by: Gonzalez-Pumariega, Gonzalo, et al.
Published: (2024)

Stateful Reasoning via Insight Replay
by: Lei, Bin, et al.
Published: (2026)

LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
by: Agashe, Saaket, et al.
Published: (2023)

Skill-Aligned Fairness in Multi-Agent Learning for Collaboration in Healthcare
by: Ekpo, Promise Osaine, et al.
Published: (2025)

Multi-Turn Code Generation Through Single-Step Rewards
by: Jain, Arnav Kumar, et al.
Published: (2025)

Context Bootstrapped Reinforcement Learning
by: Agashe, Saaket, et al.
Published: (2026)

MCPWorld: A Unified Benchmarking Testbed for API, GUI, and Hybrid Computer Use Agents
by: Yan, Yunhe, et al.
Published: (2025)

A Large Language Model-Empowered Agent for Reliable and Robust Structural Analysis
by: Liu, Jiachen, et al.
Published: (2025)

Human-Guided Harm Recovery for Computer Use Agents
by: Li, Christy, et al.
Published: (2026)

RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
by: Yang, Jingyi, et al.
Published: (2025)

The Art of Building Verifiers for Computer Use Agents
by: Rosset, Corby, et al.
Published: (2026)

Learning to Rewrite Tool Descriptions for Reliable LLM-Agent Tool Use
by: Guo, Ruocheng, et al.
Published: (2026)

OSExpert: Computer-Use Agents Learning Professional Skills via Exploration
by: Liu, Jiateng, et al.
Published: (2026)

Identification of Probabilities of Causation: from Recursive to Closed-Form Bounds
by: Shu, Xin, et al.
Published: (2025)

PRO-CUA: Process-Reward Optimization for Computer Use Agents
by: He, Yifei, et al.
Published: (2026)

OpenCUA: Open Foundations for Computer-Use Agents
by: Wang, Xinyuan, et al.
Published: (2025)

AgentHijack: Benchmarking Computer Use Agent Robustness to Common Environment Corruptions
by: Sun, Jingwei, et al.
Published: (2026)

AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents
by: Feng, Yunhao, et al.
Published: (2026)

CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents
by: Foerster, Hanna, et al.
Published: (2026)

S-Agents: Self-organizing Agents in Open-ended Environments
by: Chen, Jiaqi, et al.
Published: (2024)

Grounding Computer Use Agents on Human Demonstrations
by: Feizi, Aarash, et al.
Published: (2025)

IntentScore: Intent-Conditioned Action Evaluation for Computer-Use Agents
by: Chen, Rongqian, et al.
Published: (2026)

Tool-to-Tool Matching Analysis Based Difference Score Computation Methods for Semiconductor Manufacturing
by: H., Sameera Bharadwaja, et al.
Published: (2025)

The Agent Use of Agent Beings: Agent Cybernetics Is the Missing Science of Foundation Agents
by: Wang, Xinrun, et al.
Published: (2026)

DPO Learning with LLMs-Judge Signal for Computer Use Agents
by: Luo, Man, et al.
Published: (2025)

OpenComputer: Verifiable Software Worlds for Computer-Use Agents
by: Wei, Jinbiao, et al.
Published: (2026)

P-Guide: Parameter-Efficient Prior Steering for Single-Pass CFG Inference
by: Peng, Xin, et al.
Published: (2026)

Curie: Toward Rigorous and Automated Scientific Experimentation with AI Agents
by: Kon, Patrick Tser Jern, et al.
Published: (2025)

Reducing Cognitive Overhead in Tool Use via Multi-Small-Agent Reinforcement Learning
by: Wang, Dayu, et al.
Published: (2025)

C-World: A Computer Use Agent Environment Creator
by: Xi, Ziqiao, et al.
Published: (2026)

ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents
by: Lai, Hanyu, et al.
Published: (2025)

Agent Alpha: Tree Search Unifying Generation, Exploration and Evaluation for Computer-Use Agents
by: Tang, Sizhe, et al.
Published: (2026)

DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation
by: Gu, Tianjun, et al.
Published: (2025)

Efficient Agent Training for Computer Use
by: He, Yanheng, et al.
Published: (2025)