:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Xianyang, Gu, Shangding, Song, Dawn
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2602.06008
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Long Context, Less Focus: A Scaling Gap in LLMs Revealed through Privacy and Personalization
by: Gu, Shangding
Published: (2026)

Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity
by: Yang, Yingxuan, et al.
Published: (2026)

Agentic Web: Weaving the Next Web with AI Agents
by: Yang, Yingxuan, et al.
Published: (2025)

Data Uniformity Improves Training Efficiency and More, with a Convergence Framework Beyond the NTK Regime
by: Wang, Yuqing, et al.
Published: (2025)

Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
by: Gu, Shangding, et al.
Published: (2024)

What Makes a Sale? Rethinking End-to-End Seller--Buyer Retail Dynamics with LLM Agents
by: Choi, Jeonghwan, et al.
Published: (2026)

MemFail: Stress-Testing Failure Modes of LLM Memory Systems
by: Garg, Ishir, et al.
Published: (2026)

Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents
by: Liu, Zhihan, et al.
Published: (2026)

LLMs Should Express Uncertainty Explicitly
by: Guo, Junyu, et al.
Published: (2026)

StyleBench: Evaluating thinking styles in Large Language Models
by: Guo, Junyu, et al.
Published: (2025)

A Review of Safe Reinforcement Learning: Methods, Theory and Applications
by: Gu, Shangding, et al.
Published: (2022)

Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
by: Gu, Shangding, et al.
Published: (2024)

AgentFlux: Decoupled Fine-Tuning & Inference for On-Device Agentic Systems
by: Kadekodi, Rohan, et al.
Published: (2025)

When Do Multi-Agent Systems Outperform? Analysing the Learning Efficiency of Agentic Systems
by: Su, Junwei, et al.
Published: (2026)

LeakAgent: RL-based Red-teaming Agent for LLM Privacy Leakage
by: Nie, Yuzhou, et al.
Published: (2024)

Agentic Unlearning: When LLM Agent Meets Machine Unlearning
by: Wang, Bin, et al.
Published: (2026)

Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving
by: Zheng, Zhi, et al.
Published: (2024)

Hybrid Agentic AI and Multi-Agent Systems in Smart Manufacturing
by: Farahani, Mojtaba A., et al.
Published: (2025)

A Benchmark for Multi-Party Negotiation Games from Real Negotiation Data
by: Benac, Leo, et al.
Published: (2026)

Contextual Dynamic Pricing with Strategic Buyers
by: Liu, Pangpang, et al.
Published: (2023)

Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
by: Gu, Shangding, et al.
Published: (2025)

[Re] Benchmarking LLM Capabilities in Negotiation through Scoreable Games
by: Pollo, Jorge Carrasco, et al.
Published: (2026)

When Reasoning Models Hurt Behavioral Simulation: A Solver-Sampler Mismatch in Multi-Agent LLM Negotiation
by: Andric, Sandro
Published: (2026)

ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems
by: Yao, Bohan, et al.
Published: (2025)

Game-theoretic LLM: Agent Workflow for Negotiation Games
by: Hua, Wenyue, et al.
Published: (2024)

dLLM: Simple Diffusion Language Modeling
by: Zhou, Zhanhui, et al.
Published: (2026)

What Limits Agentic Systems Efficiency?
by: Bian, Song, et al.
Published: (2025)

GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems
by: Chen, Hongjiang, et al.
Published: (2026)

Multi-Agent Debate: A Unified Agentic Framework for Tabular Anomaly Detection
by: Wang, Pinqiao, et al.
Published: (2026)

MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems
by: Wang, Zhexuan, et al.
Published: (2026)

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems
by: Feng, Lang, et al.
Published: (2026)

On the Importance of Task Complexity in Evaluating LLM-Based Multi-Agent Systems
by: Tang, Bohan, et al.
Published: (2025)

Multi-View Encoders for Performance Prediction in LLM-Based Agentic Workflows
by: Trirat, Patara, et al.
Published: (2025)

Harnessing Agentic Evolution
by: Zhang, Jiayi, et al.
Published: (2026)

GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments
by: Zhu, Hanlin, et al.
Published: (2025)

Scaling Graph Chain-of-Thought Reasoning: A Multi-Agent Framework with Efficient LLM Serving
by: Huan, Chengying, et al.
Published: (2025)

Fed-SE: Federated Self-Evolution for Privacy-Constrained Multi-Environment LLM Agents
by: Chen, Xiang, et al.
Published: (2025)

Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation
by: Ma, Xiaowen, et al.
Published: (2025)

Pay Attention to Small Weights
by: Zhou, Chao, et al.
Published: (2025)

Temporal-Aware Graph Attention Network for Cryptocurrency Transaction Fraud Detection
by: Zheng, Zhi, et al.
Published: (2025)