Saved in:
| Main Authors: | Borthwick, Andrew, Ash, Stephen, Galczak, Anthony |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.04347 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RoboPhD: Self-Improving Text-to-SQL Through Autonomous Agent Evolution
by: Borthwick, Andrew, et al.
Published: (2026)
by: Borthwick, Andrew, et al.
Published: (2026)
ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget
by: Thakur, Nandan, et al.
Published: (2026)
by: Thakur, Nandan, et al.
Published: (2026)
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
by: Xi, Zhiheng, et al.
Published: (2024)
by: Xi, Zhiheng, et al.
Published: (2024)
ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents
by: Wu, Yong, et al.
Published: (2026)
by: Wu, Yong, et al.
Published: (2026)
RoboLayout: Differentiable 3D Scene Generation for Embodied Agents
by: Shamsaddinlou, Ali
Published: (2026)
by: Shamsaddinlou, Ali
Published: (2026)
PhD: A ChatGPT-Prompted Visual hallucination Evaluation Dataset
by: Liu, Jiazhen, et al.
Published: (2024)
by: Liu, Jiazhen, et al.
Published: (2024)
Phase Transition for Budgeted Multi-Agent Synergy
by: Liu, Bang, et al.
Published: (2026)
by: Liu, Bang, et al.
Published: (2026)
Towards Goal-Oriented Agents for Evolving Problems Observed via Conversation
by: Free, Michael, et al.
Published: (2024)
by: Free, Michael, et al.
Published: (2024)
SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment
by: Jiang, Sihang, et al.
Published: (2026)
by: Jiang, Sihang, et al.
Published: (2026)
When Agents Evolve, Institutions Follow
by: Fei, Chao, et al.
Published: (2026)
by: Fei, Chao, et al.
Published: (2026)
RoboCertProb: Property Specification for Probabilistic RoboChart Models
by: Ye, Kangfeng, et al.
Published: (2024)
by: Ye, Kangfeng, et al.
Published: (2024)
RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning
by: Kim, Seungku, et al.
Published: (2026)
by: Kim, Seungku, et al.
Published: (2026)
RoboWM-Bench: A Benchmark for Evaluating World Models in Robotic Manipulation
by: Jiang, Feng, et al.
Published: (2026)
by: Jiang, Feng, et al.
Published: (2026)
Inference-Time Budget Control for LLM Search Agents
by: Fang, Zhengru, et al.
Published: (2026)
by: Fang, Zhengru, et al.
Published: (2026)
Beyond Perfect APIs: A Comprehensive Evaluation of LLM Agents Under Real-World API Complexity
by: Kim, Doyoung, et al.
Published: (2026)
by: Kim, Doyoung, et al.
Published: (2026)
Alita-G: Self-Evolving Generative Agent for Agent Generation
by: Qiu, Jiahao, et al.
Published: (2025)
by: Qiu, Jiahao, et al.
Published: (2025)
Agents of Change: Self-Evolving LLM Agents for Strategic Planning
by: Belle, Nikolas, et al.
Published: (2025)
by: Belle, Nikolas, et al.
Published: (2025)
Self-Evolving Software Agents
by: Robol, Marco, et al.
Published: (2026)
by: Robol, Marco, et al.
Published: (2026)
Efficient Agent Evaluation via Diversity-Guided User Simulation
by: Nakash, Itay, et al.
Published: (2026)
by: Nakash, Itay, et al.
Published: (2026)
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
by: Song, Chan Hee, et al.
Published: (2024)
by: Song, Chan Hee, et al.
Published: (2024)
EXG: Self-Evolving Agents with Experience Graphs
by: Jin, Yuxin, et al.
Published: (2026)
by: Jin, Yuxin, et al.
Published: (2026)
Autogenesis: A Self-Evolving Agent Protocol
by: Zhang, Wentao, et al.
Published: (2026)
by: Zhang, Wentao, et al.
Published: (2026)
Real-Time Reasoning Agents in Evolving Environments
by: Wen, Yule, et al.
Published: (2025)
by: Wen, Yule, et al.
Published: (2025)
DSGBench: A Diverse Strategic Game Benchmark for Evaluating LLM-based Agents in Complex Decision-Making Environments
by: Tang, Wenjie, et al.
Published: (2025)
by: Tang, Wenjie, et al.
Published: (2025)
Budget-Aware Tool-Use Enables Effective Agent Scaling
by: Liu, Tengxiao, et al.
Published: (2025)
by: Liu, Tengxiao, et al.
Published: (2025)
Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents
by: Fan, Zhiyuan, et al.
Published: (2026)
by: Fan, Zhiyuan, et al.
Published: (2026)
AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents
by: Wang, Xiaoxing, et al.
Published: (2026)
by: Wang, Xiaoxing, et al.
Published: (2026)
AgentDevel: Reframing Self-Evolving LLM Agents as Release Engineering
by: Zhang, Di
Published: (2026)
by: Zhang, Di
Published: (2026)
AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
by: Jiang, Wenjia, et al.
Published: (2025)
by: Jiang, Wenjia, et al.
Published: (2025)
RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic
by: Wang, Le, et al.
Published: (2025)
by: Wang, Le, et al.
Published: (2025)
EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection
by: Yang, Shuo, et al.
Published: (2026)
by: Yang, Shuo, et al.
Published: (2026)
A Study of Plasticity Loss in On-Policy Deep Reinforcement Learning
by: Juliani, Arthur, et al.
Published: (2024)
by: Juliani, Arthur, et al.
Published: (2024)
EVE-Agent: Evidence-Verifiable Self-Evolving Agents
by: Arai, Yamato, et al.
Published: (2026)
by: Arai, Yamato, et al.
Published: (2026)
BAGEN: Are LLM Agents Budget-Aware?
by: Lin, Yuxiang, et al.
Published: (2026)
by: Lin, Yuxiang, et al.
Published: (2026)
RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints
by: Qin, Yiran, et al.
Published: (2025)
by: Qin, Yiran, et al.
Published: (2025)
Agent Alignment in Evolving Social Norms
by: Li, Shimin, et al.
Published: (2024)
by: Li, Shimin, et al.
Published: (2024)
SD-E$^2$: Semantic Exploration for Reasoning Under Token Budgets
by: Mishra, Kshitij, et al.
Published: (2026)
by: Mishra, Kshitij, et al.
Published: (2026)
EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
by: Wu, Rong, et al.
Published: (2025)
by: Wu, Rong, et al.
Published: (2025)
Observation Denoising in CYRUS Soccer Simulation 2D Team For RoboCup 2024
by: Zare, Nader, et al.
Published: (2024)
by: Zare, Nader, et al.
Published: (2024)
CODESKILL: Learning Self-Evolving Skills for Coding Agents
by: Li, Yanzhou, et al.
Published: (2026)
by: Li, Yanzhou, et al.
Published: (2026)
Similar Items
-
RoboPhD: Self-Improving Text-to-SQL Through Autonomous Agent Evolution
by: Borthwick, Andrew, et al.
Published: (2026) -
ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget
by: Thakur, Nandan, et al.
Published: (2026) -
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
by: Xi, Zhiheng, et al.
Published: (2024) -
ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents
by: Wu, Yong, et al.
Published: (2026) -
RoboLayout: Differentiable 3D Scene Generation for Embodied Agents
by: Shamsaddinlou, Ali
Published: (2026)