Enregistré dans:
| Auteurs principaux: | Zhang, Xiaoying, Liu, Zichen, Zhang, Yipeng, Hu, Xia, Shao, Wenqi |
|---|---|
| Format: | Preprint |
| Publié: |
2026
|
| Sujets: | |
| Accès en ligne: | https://arxiv.org/abs/2603.08561 |
| Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Documents similaires
GDiffRetro: Retrosynthesis Prediction with Dual Graph Enhanced Molecular Representation and Diffusion Generation
par: Sun, Shengyin, et autres
Publié: (2025)
par: Sun, Shengyin, et autres
Publié: (2025)
Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback
par: Zhang, Xiaoying, et autres
Publié: (2025)
par: Zhang, Xiaoying, et autres
Publié: (2025)
No More Stale Feedback: Co-Evolving Critics for Open-World Agent Learning
par: Li, Zhicong, et autres
Publié: (2026)
par: Li, Zhicong, et autres
Publié: (2026)
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model
par: Hu, Mengkang, et autres
Publié: (2024)
par: Hu, Mengkang, et autres
Publié: (2024)
Deep Research of Deep Research: From Transformer to Agent, From AI to AI for Science
par: Yu, Yipeng
Publié: (2026)
par: Yu, Yipeng
Publié: (2026)
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
par: Zhang, Wenqi, et autres
Publié: (2024)
par: Zhang, Wenqi, et autres
Publié: (2024)
Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
par: Chong, Yee Hin, et autres
Publié: (2026)
par: Chong, Yee Hin, et autres
Publié: (2026)
EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents
par: Liu, Jiaqi, et autres
Publié: (2026)
par: Liu, Jiaqi, et autres
Publié: (2026)
MagicGUI-RMS: A Multi-Agent Reward Model System for Self-Evolving GUI Agents via Automated Feedback Reflux
par: Li, Zecheng, et autres
Publié: (2026)
par: Li, Zecheng, et autres
Publié: (2026)
CODESKILL: Learning Self-Evolving Skills for Coding Agents
par: Li, Yanzhou, et autres
Publié: (2026)
par: Li, Yanzhou, et autres
Publié: (2026)
Evolving and Executing Research Plans via Double-Loop Multi-Agent Collaboration
par: Zhang, Zhi, et autres
Publié: (2025)
par: Zhang, Zhi, et autres
Publié: (2025)
Towards Self-Evolving Benchmarks: Synthesizing Agent Trajectories via Test-Time Exploration under Validate-by-Reproduce Paradigm
par: Guo, Dadi, et autres
Publié: (2025)
par: Guo, Dadi, et autres
Publié: (2025)
Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents
par: Fan, Zhiyuan, et autres
Publié: (2026)
par: Fan, Zhiyuan, et autres
Publié: (2026)
EXG: Self-Evolving Agents with Experience Graphs
par: Jin, Yuxin, et autres
Publié: (2026)
par: Jin, Yuxin, et autres
Publié: (2026)
MorphAgent: Empowering Agents through Self-Evolving Profiles and Decentralized Collaboration
par: Lu, Siyuan, et autres
Publié: (2024)
par: Lu, Siyuan, et autres
Publié: (2024)
AgentDevel: Reframing Self-Evolving LLM Agents as Release Engineering
par: Zhang, Di
Publié: (2026)
par: Zhang, Di
Publié: (2026)
Building Self-Evolving Agents via Experience-Driven Lifelong Learning: A Framework and Benchmark
par: Cai, Yuxuan, et autres
Publié: (2025)
par: Cai, Yuxuan, et autres
Publié: (2025)
Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback
par: Zheng, Qinqing, et autres
Publié: (2024)
par: Zheng, Qinqing, et autres
Publié: (2024)
UniPruning: Unifying Local Metric and Global Feedback for Scalable Sparse LLMs
par: Ding, Yizhuo, et autres
Publié: (2025)
par: Ding, Yizhuo, et autres
Publié: (2025)
A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence
par: Gao, Huan-ang, et autres
Publié: (2025)
par: Gao, Huan-ang, et autres
Publié: (2025)
Attributing Emergence in Million-Agent Systems
par: Tang, Ling, et autres
Publié: (2026)
par: Tang, Ling, et autres
Publié: (2026)
SEDM: Scalable Self-Evolving Distributed Memory for Agents
par: Xu, Haoran, et autres
Publié: (2025)
par: Xu, Haoran, et autres
Publié: (2025)
K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model
par: Cao, Shiyi, et autres
Publié: (2026)
par: Cao, Shiyi, et autres
Publié: (2026)
BinCtx: Multi-Modal Representation Learning for Robust Android App Behavior Detection
par: Liu, Zichen, et autres
Publié: (2025)
par: Liu, Zichen, et autres
Publié: (2025)
Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
par: Shen, Wei, et autres
Publié: (2024)
par: Shen, Wei, et autres
Publié: (2024)
Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control
par: Liu, Zifan, et autres
Publié: (2024)
par: Liu, Zifan, et autres
Publié: (2024)
Retro3D: A 3D-aware Template-free Method for Enhancing Retrosynthesis via Molecular Conformer Information
par: Zhuang, Jiaxi, et autres
Publié: (2025)
par: Zhuang, Jiaxi, et autres
Publié: (2025)
AgentFactory: A Self-Evolving Framework Through Executable Subagent Accumulation and Reuse
par: Zhang, Zhang, et autres
Publié: (2026)
par: Zhang, Zhang, et autres
Publié: (2026)
Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving
par: Mai, Xinji, et autres
Publié: (2025)
par: Mai, Xinji, et autres
Publié: (2025)
Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing
par: Weng, Zhaotian, et autres
Publié: (2026)
par: Weng, Zhaotian, et autres
Publié: (2026)
EVA: Red-Teaming GUI Agents via Evolving Indirect Prompt Injection
par: Lu, Yijie, et autres
Publié: (2025)
par: Lu, Yijie, et autres
Publié: (2025)
Real-Time Reasoning Agents in Evolving Environments
par: Wen, Yule, et autres
Publié: (2025)
par: Wen, Yule, et autres
Publié: (2025)
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards
par: Xue, Xiangyuan, et autres
Publié: (2025)
par: Xue, Xiangyuan, et autres
Publié: (2025)
SEARL: Joint Optimization of Policy and Tool Graph Memory for Self-Evolving Agents
par: Feng, Xinshun, et autres
Publié: (2026)
par: Feng, Xinshun, et autres
Publié: (2026)
CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification
par: Chen, Jinpeng, et autres
Publié: (2026)
par: Chen, Jinpeng, et autres
Publié: (2026)
Retro-fallback: retrosynthetic planning in an uncertain world
par: Tripp, Austin, et autres
Publié: (2023)
par: Tripp, Austin, et autres
Publié: (2023)
Retro-Expert: Collaborative Reasoning for Interpretable Retrosynthesis
par: Li, Xinyi, et autres
Publié: (2025)
par: Li, Xinyi, et autres
Publié: (2025)
PragLocker: Protecting Agent Intellectual Property in Untrusted Deployments via Non-Portable Prompts
par: Li, Qinfeng, et autres
Publié: (2026)
par: Li, Qinfeng, et autres
Publié: (2026)
EE-MCP: Self-Evolving MCP-GUI Agents via Automated Environment Generation and Experience Learning
par: He, Tiantian, et autres
Publié: (2026)
par: He, Tiantian, et autres
Publié: (2026)
1D-Bench: A Benchmark for Iterative UI Code Generation with Visual Feedback in Real-World
par: Xu, Qiao, et autres
Publié: (2026)
par: Xu, Qiao, et autres
Publié: (2026)
Documents similaires
-
GDiffRetro: Retrosynthesis Prediction with Dual Graph Enhanced Molecular Representation and Diffusion Generation
par: Sun, Shengyin, et autres
Publié: (2025) -
Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback
par: Zhang, Xiaoying, et autres
Publié: (2025) -
No More Stale Feedback: Co-Evolving Critics for Open-World Agent Learning
par: Li, Zhicong, et autres
Publié: (2026) -
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model
par: Hu, Mengkang, et autres
Publié: (2024) -
Deep Research of Deep Research: From Transformer to Agent, From AI to AI for Science
par: Yu, Yipeng
Publié: (2026)