:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Borthwick, Andrew, Ash, Stephen, Galczak, Anthony
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.04347
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

RoboPhD: Self-Improving Text-to-SQL Through Autonomous Agent Evolution
by: Borthwick, Andrew, et al.
Published: (2026)

ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget
by: Thakur, Nandan, et al.
Published: (2026)

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
by: Xi, Zhiheng, et al.
Published: (2024)

ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents
by: Wu, Yong, et al.
Published: (2026)

RoboLayout: Differentiable 3D Scene Generation for Embodied Agents
by: Shamsaddinlou, Ali
Published: (2026)

PhD: A ChatGPT-Prompted Visual hallucination Evaluation Dataset
by: Liu, Jiazhen, et al.
Published: (2024)

Phase Transition for Budgeted Multi-Agent Synergy
by: Liu, Bang, et al.
Published: (2026)

Towards Goal-Oriented Agents for Evolving Problems Observed via Conversation
by: Free, Michael, et al.
Published: (2024)

SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment
by: Jiang, Sihang, et al.
Published: (2026)

When Agents Evolve, Institutions Follow
by: Fei, Chao, et al.
Published: (2026)

RoboCertProb: Property Specification for Probabilistic RoboChart Models
by: Ye, Kangfeng, et al.
Published: (2024)

RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning
by: Kim, Seungku, et al.
Published: (2026)

RoboWM-Bench: A Benchmark for Evaluating World Models in Robotic Manipulation
by: Jiang, Feng, et al.
Published: (2026)

Inference-Time Budget Control for LLM Search Agents
by: Fang, Zhengru, et al.
Published: (2026)

Beyond Perfect APIs: A Comprehensive Evaluation of LLM Agents Under Real-World API Complexity
by: Kim, Doyoung, et al.
Published: (2026)

Alita-G: Self-Evolving Generative Agent for Agent Generation
by: Qiu, Jiahao, et al.
Published: (2025)

Agents of Change: Self-Evolving LLM Agents for Strategic Planning
by: Belle, Nikolas, et al.
Published: (2025)

Self-Evolving Software Agents
by: Robol, Marco, et al.
Published: (2026)

Efficient Agent Evaluation via Diversity-Guided User Simulation
by: Nakash, Itay, et al.
Published: (2026)

RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
by: Song, Chan Hee, et al.
Published: (2024)

EXG: Self-Evolving Agents with Experience Graphs
by: Jin, Yuxin, et al.
Published: (2026)

Autogenesis: A Self-Evolving Agent Protocol
by: Zhang, Wentao, et al.
Published: (2026)

Real-Time Reasoning Agents in Evolving Environments
by: Wen, Yule, et al.
Published: (2025)

DSGBench: A Diverse Strategic Game Benchmark for Evaluating LLM-based Agents in Complex Decision-Making Environments
by: Tang, Wenjie, et al.
Published: (2025)

Budget-Aware Tool-Use Enables Effective Agent Scaling
by: Liu, Tengxiao, et al.
Published: (2025)

Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents
by: Fan, Zhiyuan, et al.
Published: (2026)

AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents
by: Wang, Xiaoxing, et al.
Published: (2026)

AgentDevel: Reframing Self-Evolving LLM Agents as Release Engineering
by: Zhang, Di
Published: (2026)

AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
by: Jiang, Wenjia, et al.
Published: (2025)

RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic
by: Wang, Le, et al.
Published: (2025)

EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection
by: Yang, Shuo, et al.
Published: (2026)

A Study of Plasticity Loss in On-Policy Deep Reinforcement Learning
by: Juliani, Arthur, et al.
Published: (2024)

EVE-Agent: Evidence-Verifiable Self-Evolving Agents
by: Arai, Yamato, et al.
Published: (2026)

BAGEN: Are LLM Agents Budget-Aware?
by: Lin, Yuxiang, et al.
Published: (2026)

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints
by: Qin, Yiran, et al.
Published: (2025)

Agent Alignment in Evolving Social Norms
by: Li, Shimin, et al.
Published: (2024)

SD-E$^2$: Semantic Exploration for Reasoning Under Token Budgets
by: Mishra, Kshitij, et al.
Published: (2026)

EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
by: Wu, Rong, et al.
Published: (2025)

Observation Denoising in CYRUS Soccer Simulation 2D Team For RoboCup 2024
by: Zare, Nader, et al.
Published: (2024)

CODESKILL: Learning Self-Evolving Skills for Coding Agents
by: Li, Yanzhou, et al.
Published: (2026)