Saved in:
| Main Authors: | Wang, Charles L., Dorchen, Keir, Jin, Peter |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.04399 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Continual Harness: Online Adaptation for Self-Improving Foundation Agents
by: Karten, Seth, et al.
Published: (2026)
by: Karten, Seth, et al.
Published: (2026)
Self-Improving AI Agents through Self-Play
by: Chojecki, Przemyslaw
Published: (2025)
by: Chojecki, Przemyslaw
Published: (2025)
Information-Theoretic Limits of Safety Verification for Self-Improving Systems
by: Scrivens, Arsenios
Published: (2026)
by: Scrivens, Arsenios
Published: (2026)
VideoAgent: Self-Improving Video Generation
by: Soni, Achint, et al.
Published: (2024)
by: Soni, Achint, et al.
Published: (2024)
Experiential Reflective Learning for Self-Improving LLM Agents
by: Allard, Marc-Antoine, et al.
Published: (2026)
by: Allard, Marc-Antoine, et al.
Published: (2026)
Soft Self-Consistency Improves Language Model Agents
by: Wang, Han, et al.
Published: (2024)
by: Wang, Han, et al.
Published: (2024)
Self-Improving LLM Agents at Test-Time
by: Acikgoz, Emre Can, et al.
Published: (2025)
by: Acikgoz, Emre Can, et al.
Published: (2025)
Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance
by: He, Yufei, et al.
Published: (2025)
by: He, Yufei, et al.
Published: (2025)
The Limits of Predicting Agents from Behaviour
by: Bellot, Alexis, et al.
Published: (2025)
by: Bellot, Alexis, et al.
Published: (2025)
On the Limited Representational Power of Value Functions and its Links to Statistical (In)Efficiency
by: Cheikhi, David, et al.
Published: (2024)
by: Cheikhi, David, et al.
Published: (2024)
Multi-Agent Conformal Prediction with Personalized Statistical Validity
by: Vejling, Martin V., et al.
Published: (2026)
by: Vejling, Martin V., et al.
Published: (2026)
Statistical Limits and Efficient Algorithms for Differentially Private Federated Learning
by: Auddy, Arnab, et al.
Published: (2026)
by: Auddy, Arnab, et al.
Published: (2026)
DrugSAGE:Self-evolving Agent Experience for Efficient State-of-the-Art Drug Discovery
by: Zhang, Yikun, et al.
Published: (2026)
by: Zhang, Yikun, et al.
Published: (2026)
On Limitation of Transformer for Learning HMMs
by: Hu, Jiachen, et al.
Published: (2024)
by: Hu, Jiachen, et al.
Published: (2024)
Combee: Scaling Prompt Learning for Self-Improving Language Model Agents
by: Li, Hanchen, et al.
Published: (2026)
by: Li, Hanchen, et al.
Published: (2026)
Large Language Models Can Self-Improve At Web Agent Tasks
by: Patel, Ajay, et al.
Published: (2024)
by: Patel, Ajay, et al.
Published: (2024)
Recursive Introspection: Teaching Language Model Agents How to Self-Improve
by: Qu, Yuxiao, et al.
Published: (2024)
by: Qu, Yuxiao, et al.
Published: (2024)
On the Limits of Self-Improving in Large Language Models: The Singularity Is Not Near Without Symbolic Model Synthesis
by: Zenil, Hector
Published: (2026)
by: Zenil, Hector
Published: (2026)
Annealing Self-Distillation Rectification Improves Adversarial Training
by: Wu, Yu-Yu, et al.
Published: (2023)
by: Wu, Yu-Yu, et al.
Published: (2023)
Self-Improved Learning for Scalable Neural Combinatorial Optimization
by: Luo, Fu, et al.
Published: (2024)
by: Luo, Fu, et al.
Published: (2024)
Hybrid Self-evolving Structured Memory for GUI Agents
by: Zhu, Sibo, et al.
Published: (2026)
by: Zhu, Sibo, et al.
Published: (2026)
How to Train Your LLM Web Agent: A Statistical Diagnosis
by: Vattikonda, Dheeraj, et al.
Published: (2025)
by: Vattikonda, Dheeraj, et al.
Published: (2025)
Self-Improving Robust Preference Optimization
by: Choi, Eugene, et al.
Published: (2024)
by: Choi, Eugene, et al.
Published: (2024)
Training Agents to Self-Report Misbehavior
by: Lee, Bruce W., et al.
Published: (2026)
by: Lee, Bruce W., et al.
Published: (2026)
MIRA: Memory-Integrated Reinforcement Learning Agent with Limited LLM Guidance
by: Nourzad, Narjes, et al.
Published: (2026)
by: Nourzad, Narjes, et al.
Published: (2026)
SGM: A Statistical Godel Machine for Risk-Controlled Recursive Self-Modification
by: Wu, Xuening, et al.
Published: (2025)
by: Wu, Xuening, et al.
Published: (2025)
AgentOCR: Reimagining Agent History via Optical Self-Compression
by: Feng, Lang, et al.
Published: (2026)
by: Feng, Lang, et al.
Published: (2026)
World Modelling Improves Language Model Agents
by: Guo, Shangmin, et al.
Published: (2025)
by: Guo, Shangmin, et al.
Published: (2025)
APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents
by: Li, Yibo, et al.
Published: (2026)
by: Li, Yibo, et al.
Published: (2026)
Self-Improving Diffusion Models with Synthetic Data
by: Alemohammad, Sina, et al.
Published: (2024)
by: Alemohammad, Sina, et al.
Published: (2024)
On the Statistical Properties of Generative Adversarial Models for Low Intrinsic Data Dimension
by: Chakraborty, Saptarshi, et al.
Published: (2024)
by: Chakraborty, Saptarshi, et al.
Published: (2024)
Self-Training Meets Consistency: Improving LLMs' Reasoning with Consistency-Driven Rationale Evaluation
by: Lee, Jaehyeok, et al.
Published: (2024)
by: Lee, Jaehyeok, et al.
Published: (2024)
A Statistical Analysis of Deep Federated Learning for Intrinsically Low-dimensional Data
by: Chakraborty, Saptarshi, et al.
Published: (2024)
by: Chakraborty, Saptarshi, et al.
Published: (2024)
On the Unknowable Limits to Prediction
by: Yan, Jiani, et al.
Published: (2024)
by: Yan, Jiani, et al.
Published: (2024)
MALT: Improving Reasoning with Multi-Agent LLM Training
by: Motwani, Sumeet Ramesh, et al.
Published: (2024)
by: Motwani, Sumeet Ramesh, et al.
Published: (2024)
What Are the Odds? Improving the foundations of Statistical Model Checking
by: Meggendorfer, Tobias, et al.
Published: (2024)
by: Meggendorfer, Tobias, et al.
Published: (2024)
Robust Statistical Scaling of Outlier Scores: Improving the Quality of Outlier Probabilities for Outliers (Extended Version)
by: Röchner, Philipp, et al.
Published: (2024)
by: Röchner, Philipp, et al.
Published: (2024)
Self-Alignment Learning to Improve Myocardial Infarction Detection from Single-Lead ECG
by: Jin, Jiarui, et al.
Published: (2025)
by: Jin, Jiarui, et al.
Published: (2025)
BOAD: Discovering Hierarchical Software Engineering Agents via Bandit Optimization
by: Xu, Iris, et al.
Published: (2025)
by: Xu, Iris, et al.
Published: (2025)
Beyond Training: Enabling Self-Evolution of Agents with MOBIMEM
by: Liu, Zibin, et al.
Published: (2025)
by: Liu, Zibin, et al.
Published: (2025)
Similar Items
-
Continual Harness: Online Adaptation for Self-Improving Foundation Agents
by: Karten, Seth, et al.
Published: (2026) -
Self-Improving AI Agents through Self-Play
by: Chojecki, Przemyslaw
Published: (2025) -
Information-Theoretic Limits of Safety Verification for Self-Improving Systems
by: Scrivens, Arsenios
Published: (2026) -
VideoAgent: Self-Improving Video Generation
by: Soni, Achint, et al.
Published: (2024) -
Experiential Reflective Learning for Self-Improving LLM Agents
by: Allard, Marc-Antoine, et al.
Published: (2026)