Guardado en:
| Autor principal: | Wu, Xiangfan |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2506.13358 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Prompting Policies for Multi-step Reasoning and Tool-Use in Black-box LLMs with Iterative Distillation of Experience
por: Sayana, Krishna, et al.
Publicado: (2026)
por: Sayana, Krishna, et al.
Publicado: (2026)
Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework
por: Ye, Jianing, et al.
Publicado: (2022)
por: Ye, Jianing, et al.
Publicado: (2022)
DeepFusion: Accelerating MoE Training via Federated Knowledge Distillation from Heterogeneous Edge Devices
por: Li, Songyuan, et al.
Publicado: (2026)
por: Li, Songyuan, et al.
Publicado: (2026)
Efficient support ticket resolution using Knowledge Graphs
por: Varghese, Sherwin, et al.
Publicado: (2024)
por: Varghese, Sherwin, et al.
Publicado: (2024)
Planner Matters! An Efficient and Unbalanced Multi-agent Collaboration Framework for Long-horizon Planning
por: Wu, Wenyi, et al.
Publicado: (2026)
por: Wu, Wenyi, et al.
Publicado: (2026)
Toward Autonomous Engineering Design: A Knowledge-Guided Multi-Agent Framework
por: Kumar, Varun, et al.
Publicado: (2025)
por: Kumar, Varun, et al.
Publicado: (2025)
HyperMARL: Adaptive Hypernetworks for Multi-Agent RL
por: Tessera, Kale-ab Abebe, et al.
Publicado: (2024)
por: Tessera, Kale-ab Abebe, et al.
Publicado: (2024)
EdgeAgentX: A Novel Framework for Agentic AI at the Edge in Military Communication Networks
por: Ray, Abir
Publicado: (2025)
por: Ray, Abir
Publicado: (2025)
Socratic: Enhancing Human Teamwork via AI-enabled Coaching
por: Seo, Sangwon, et al.
Publicado: (2025)
por: Seo, Sangwon, et al.
Publicado: (2025)
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX
por: Rutherford, Alexander, et al.
Publicado: (2023)
por: Rutherford, Alexander, et al.
Publicado: (2023)
PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning
por: Chen, Yiqun, et al.
Publicado: (2022)
por: Chen, Yiqun, et al.
Publicado: (2022)
Multi-Agent Path Finding via Offline RL and LLM Collaboration
por: Atasever, Merve, et al.
Publicado: (2025)
por: Atasever, Merve, et al.
Publicado: (2025)
FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
por: Koh, Woosung, et al.
Publicado: (2024)
por: Koh, Woosung, et al.
Publicado: (2024)
KVComm: Enabling Efficient LLM Communication through Selective KV Sharing
por: Shi, Xiangyu, et al.
Publicado: (2025)
por: Shi, Xiangyu, et al.
Publicado: (2025)
SPIRAL: Symbolic LLM Planning via Grounded and Reflective Search
por: Zhang, Yifan, et al.
Publicado: (2025)
por: Zhang, Yifan, et al.
Publicado: (2025)
In-Context Curiosity: Distilling Exploration for Decision-Pretrained Transformers on Bandit Tasks
por: Yang, Huitao, et al.
Publicado: (2025)
por: Yang, Huitao, et al.
Publicado: (2025)
Iterative Graph Alignment
por: Yu, Fangyuan, et al.
Publicado: (2024)
por: Yu, Fangyuan, et al.
Publicado: (2024)
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
por: Zhang, Zhicheng, et al.
Publicado: (2024)
por: Zhang, Zhicheng, et al.
Publicado: (2024)
Adaptively Coordinating with Novel Partners via Learned Latent Strategies
por: Li, Benjamin, et al.
Publicado: (2025)
por: Li, Benjamin, et al.
Publicado: (2025)
EcoFair-CH-MARL: Scalable Constrained Hierarchical Multi-Agent RL with Real-Time Emission Budgets and Fairness Guarantees
por: Alqithami, Saad
Publicado: (2026)
por: Alqithami, Saad
Publicado: (2026)
Quantifying Skill and Chance: A Unified Framework for the Geometry of Games
por: Silver, David H.
Publicado: (2025)
por: Silver, David H.
Publicado: (2025)
KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs
por: Xie, Yuzhang, et al.
Publicado: (2025)
por: Xie, Yuzhang, et al.
Publicado: (2025)
STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBack
por: Kirtania, Shashank, et al.
Publicado: (2024)
por: Kirtania, Shashank, et al.
Publicado: (2024)
A Unified Multi-Agent Framework for Universal Multimodal Understanding and Generation
por: Li, Jiulin, et al.
Publicado: (2025)
por: Li, Jiulin, et al.
Publicado: (2025)
Adaptability in Multi-Agent Reinforcement Learning: A Framework and Unified Review
por: Hu, Siyi, et al.
Publicado: (2025)
por: Hu, Siyi, et al.
Publicado: (2025)
KernelSkill: A Multi-Agent Framework for GPU Kernel Optimization
por: Sun, Qitong, et al.
Publicado: (2026)
por: Sun, Qitong, et al.
Publicado: (2026)
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents
por: Wang, Taiyi, et al.
Publicado: (2026)
por: Wang, Taiyi, et al.
Publicado: (2026)
OptiMindTune: A Multi-Agent Framework for Intelligent Hyperparameter Optimization
por: Madiraju, Meher Bhaskar, et al.
Publicado: (2025)
por: Madiraju, Meher Bhaskar, et al.
Publicado: (2025)
OMAC: A Holistic Optimization Framework for LLM-Based Multi-Agent Collaboration
por: Li, Shijun, et al.
Publicado: (2025)
por: Li, Shijun, et al.
Publicado: (2025)
A Multi-Agent Reinforcement Learning Framework for Public Health Decision Analysis
por: Sharma, Dinesh, et al.
Publicado: (2023)
por: Sharma, Dinesh, et al.
Publicado: (2023)
The Overcooked Generalisation Challenge: Evaluating Cooperation with Novel Partners in Unknown Environments Using Unsupervised Environment Design
por: Ruhdorfer, Constantin, et al.
Publicado: (2024)
por: Ruhdorfer, Constantin, et al.
Publicado: (2024)
Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems
por: Wang, Zhao, et al.
Publicado: (2025)
por: Wang, Zhao, et al.
Publicado: (2025)
PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing
por: Song, Yiwen, et al.
Publicado: (2026)
por: Song, Yiwen, et al.
Publicado: (2026)
ScholarPeer: A Context-Aware Multi-Agent Framework for Automated Peer Review
por: Goyal, Palash, et al.
Publicado: (2026)
por: Goyal, Palash, et al.
Publicado: (2026)
Efficient Multi-agent Reinforcement Learning by Planning
por: Liu, Qihan, et al.
Publicado: (2024)
por: Liu, Qihan, et al.
Publicado: (2024)
A Hierarchical Deep Reinforcement Learning Framework for Traffic Signal Control with Predictable Cycle Planning
por: Gu, Hankang, et al.
Publicado: (2025)
por: Gu, Hankang, et al.
Publicado: (2025)
A Hierarchical Framework with Spatio-Temporal Consistency Learning for Emergence Detection in Complex Adaptive Systems
por: Chen, Siyuan, et al.
Publicado: (2024)
por: Chen, Siyuan, et al.
Publicado: (2024)
EngiAI: A Multi-Agent Framework and Benchmark Suite for LLM-Driven Engineering Design
por: Molinari, Gioele, et al.
Publicado: (2026)
por: Molinari, Gioele, et al.
Publicado: (2026)
Incorporating Human Flexibility through Reward Preferences in Human-AI Teaming
por: Bhambri, Siddhant, et al.
Publicado: (2023)
por: Bhambri, Siddhant, et al.
Publicado: (2023)
Episodic Memory in Agentic Frameworks: Suggesting Next Tasks
por: Fiorini, Sandro Rama, et al.
Publicado: (2025)
por: Fiorini, Sandro Rama, et al.
Publicado: (2025)
Ejemplares similares
-
Prompting Policies for Multi-step Reasoning and Tool-Use in Black-box LLMs with Iterative Distillation of Experience
por: Sayana, Krishna, et al.
Publicado: (2026) -
Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework
por: Ye, Jianing, et al.
Publicado: (2022) -
DeepFusion: Accelerating MoE Training via Federated Knowledge Distillation from Heterogeneous Edge Devices
por: Li, Songyuan, et al.
Publicado: (2026) -
Efficient support ticket resolution using Knowledge Graphs
por: Varghese, Sherwin, et al.
Publicado: (2024) -
Planner Matters! An Efficient and Unbalanced Multi-agent Collaboration Framework for Long-horizon Planning
por: Wu, Wenyi, et al.
Publicado: (2026)