Saved in:
| Main Authors: | Liu, Xianyang, Gu, Shangding, Song, Dawn |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.06008 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Long Context, Less Focus: A Scaling Gap in LLMs Revealed through Privacy and Personalization
by: Gu, Shangding
Published: (2026)
by: Gu, Shangding
Published: (2026)
Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity
by: Yang, Yingxuan, et al.
Published: (2026)
by: Yang, Yingxuan, et al.
Published: (2026)
Agentic Web: Weaving the Next Web with AI Agents
by: Yang, Yingxuan, et al.
Published: (2025)
by: Yang, Yingxuan, et al.
Published: (2025)
Data Uniformity Improves Training Efficiency and More, with a Convergence Framework Beyond the NTK Regime
by: Wang, Yuqing, et al.
Published: (2025)
by: Wang, Yuqing, et al.
Published: (2025)
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
by: Gu, Shangding, et al.
Published: (2024)
by: Gu, Shangding, et al.
Published: (2024)
What Makes a Sale? Rethinking End-to-End Seller--Buyer Retail Dynamics with LLM Agents
by: Choi, Jeonghwan, et al.
Published: (2026)
by: Choi, Jeonghwan, et al.
Published: (2026)
MemFail: Stress-Testing Failure Modes of LLM Memory Systems
by: Garg, Ishir, et al.
Published: (2026)
by: Garg, Ishir, et al.
Published: (2026)
Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents
by: Liu, Zhihan, et al.
Published: (2026)
by: Liu, Zhihan, et al.
Published: (2026)
LLMs Should Express Uncertainty Explicitly
by: Guo, Junyu, et al.
Published: (2026)
by: Guo, Junyu, et al.
Published: (2026)
StyleBench: Evaluating thinking styles in Large Language Models
by: Guo, Junyu, et al.
Published: (2025)
by: Guo, Junyu, et al.
Published: (2025)
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
by: Gu, Shangding, et al.
Published: (2022)
by: Gu, Shangding, et al.
Published: (2022)
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
by: Gu, Shangding, et al.
Published: (2024)
by: Gu, Shangding, et al.
Published: (2024)
AgentFlux: Decoupled Fine-Tuning & Inference for On-Device Agentic Systems
by: Kadekodi, Rohan, et al.
Published: (2025)
by: Kadekodi, Rohan, et al.
Published: (2025)
When Do Multi-Agent Systems Outperform? Analysing the Learning Efficiency of Agentic Systems
by: Su, Junwei, et al.
Published: (2026)
by: Su, Junwei, et al.
Published: (2026)
LeakAgent: RL-based Red-teaming Agent for LLM Privacy Leakage
by: Nie, Yuzhou, et al.
Published: (2024)
by: Nie, Yuzhou, et al.
Published: (2024)
Agentic Unlearning: When LLM Agent Meets Machine Unlearning
by: Wang, Bin, et al.
Published: (2026)
by: Wang, Bin, et al.
Published: (2026)
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving
by: Zheng, Zhi, et al.
Published: (2024)
by: Zheng, Zhi, et al.
Published: (2024)
Hybrid Agentic AI and Multi-Agent Systems in Smart Manufacturing
by: Farahani, Mojtaba A., et al.
Published: (2025)
by: Farahani, Mojtaba A., et al.
Published: (2025)
A Benchmark for Multi-Party Negotiation Games from Real Negotiation Data
by: Benac, Leo, et al.
Published: (2026)
by: Benac, Leo, et al.
Published: (2026)
Contextual Dynamic Pricing with Strategic Buyers
by: Liu, Pangpang, et al.
Published: (2023)
by: Liu, Pangpang, et al.
Published: (2023)
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
by: Gu, Shangding, et al.
Published: (2025)
by: Gu, Shangding, et al.
Published: (2025)
[Re] Benchmarking LLM Capabilities in Negotiation through Scoreable Games
by: Pollo, Jorge Carrasco, et al.
Published: (2026)
by: Pollo, Jorge Carrasco, et al.
Published: (2026)
When Reasoning Models Hurt Behavioral Simulation: A Solver-Sampler Mismatch in Multi-Agent LLM Negotiation
by: Andric, Sandro
Published: (2026)
by: Andric, Sandro
Published: (2026)
ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems
by: Yao, Bohan, et al.
Published: (2025)
by: Yao, Bohan, et al.
Published: (2025)
Game-theoretic LLM: Agent Workflow for Negotiation Games
by: Hua, Wenyue, et al.
Published: (2024)
by: Hua, Wenyue, et al.
Published: (2024)
dLLM: Simple Diffusion Language Modeling
by: Zhou, Zhanhui, et al.
Published: (2026)
by: Zhou, Zhanhui, et al.
Published: (2026)
What Limits Agentic Systems Efficiency?
by: Bian, Song, et al.
Published: (2025)
by: Bian, Song, et al.
Published: (2025)
GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems
by: Chen, Hongjiang, et al.
Published: (2026)
by: Chen, Hongjiang, et al.
Published: (2026)
Multi-Agent Debate: A Unified Agentic Framework for Tabular Anomaly Detection
by: Wang, Pinqiao, et al.
Published: (2026)
by: Wang, Pinqiao, et al.
Published: (2026)
MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems
by: Wang, Zhexuan, et al.
Published: (2026)
by: Wang, Zhexuan, et al.
Published: (2026)
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems
by: Feng, Lang, et al.
Published: (2026)
by: Feng, Lang, et al.
Published: (2026)
On the Importance of Task Complexity in Evaluating LLM-Based Multi-Agent Systems
by: Tang, Bohan, et al.
Published: (2025)
by: Tang, Bohan, et al.
Published: (2025)
Multi-View Encoders for Performance Prediction in LLM-Based Agentic Workflows
by: Trirat, Patara, et al.
Published: (2025)
by: Trirat, Patara, et al.
Published: (2025)
Harnessing Agentic Evolution
by: Zhang, Jiayi, et al.
Published: (2026)
by: Zhang, Jiayi, et al.
Published: (2026)
GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments
by: Zhu, Hanlin, et al.
Published: (2025)
by: Zhu, Hanlin, et al.
Published: (2025)
Scaling Graph Chain-of-Thought Reasoning: A Multi-Agent Framework with Efficient LLM Serving
by: Huan, Chengying, et al.
Published: (2025)
by: Huan, Chengying, et al.
Published: (2025)
Fed-SE: Federated Self-Evolution for Privacy-Constrained Multi-Environment LLM Agents
by: Chen, Xiang, et al.
Published: (2025)
by: Chen, Xiang, et al.
Published: (2025)
Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation
by: Ma, Xiaowen, et al.
Published: (2025)
by: Ma, Xiaowen, et al.
Published: (2025)
Pay Attention to Small Weights
by: Zhou, Chao, et al.
Published: (2025)
by: Zhou, Chao, et al.
Published: (2025)
Temporal-Aware Graph Attention Network for Cryptocurrency Transaction Fraud Detection
by: Zheng, Zhi, et al.
Published: (2025)
by: Zheng, Zhi, et al.
Published: (2025)
Similar Items
-
Long Context, Less Focus: A Scaling Gap in LLMs Revealed through Privacy and Personalization
by: Gu, Shangding
Published: (2026) -
Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity
by: Yang, Yingxuan, et al.
Published: (2026) -
Agentic Web: Weaving the Next Web with AI Agents
by: Yang, Yingxuan, et al.
Published: (2025) -
Data Uniformity Improves Training Efficiency and More, with a Convergence Framework Beyond the NTK Regime
by: Wang, Yuqing, et al.
Published: (2025) -
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
by: Gu, Shangding, et al.
Published: (2024)