:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Park, Chanwoo, Chen, Ziyang, Ozdaglar, Asuman, Zhang, Kaiqing
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.04393
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Do LLM Agents Have Regret? A Case Study in Online Learning and Games
by: Park, Chanwoo, et al.
Published: (2024)

MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning
by: Park, Chanwoo, et al.
Published: (2025)

RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
by: Park, Chanwoo, et al.
Published: (2024)

Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
by: Park, Chanwoo, et al.
Published: (2023)

A Policy-Gradient Approach to Solving Imperfect-Information Games with Best-Iterate Convergence
by: Liu, Mingyang, et al.
Published: (2024)

LiteEFG: An Efficient Python Library for Solving Extensive-form Games
by: Liu, Mingyang, et al.
Published: (2024)

A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
by: Kim, Kihyun, et al.
Published: (2024)

Differentially Private Equilibrium Finding in Polymatrix Games
by: Liu, Mingyang, et al.
Published: (2025)

Beyond RLHF and NLHF: Population-Proportional Alignment under an Axiomatic Framework
by: Kim, Kihyun, et al.
Published: (2025)

Computing Equilibrium beyond Unilateral Deviation
by: Liu, Mingyang, et al.
Published: (2026)

Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints
by: Ozdaglar, Asuman, et al.
Published: (2022)

How AI Aggregation Affects Knowledge
by: Acemoglu, Daron, et al.
Published: (2026)

The Power of Regularization in Solving Extensive-Form Games
by: Liu, Mingyang, et al.
Published: (2022)

MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making
by: Kim, Yubin, et al.
Published: (2024)

Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games
by: Chen, Zaiwei, et al.
Published: (2024)

Reasoning-Based Approach with Chain-of-Thought for Alzheimer's Detection Using Speech and Large Language Models
by: Park, Chanwoo, et al.
Published: (2025)

Mirror Duality in Convex Optimization
by: Kim, Jaeyeon, et al.
Published: (2023)

Learning Decision-Sufficient Representations for Linear Optimization
by: Ye, Yuhan, et al.
Published: (2026)

Online Learning and Equilibrium Computation with Ranking Feedback
by: Liu, Mingyang, et al.
Published: (2026)

Beyond Line-Level Filtering for the Pretraining Corpora of LLMs
by: Park, Chanwoo, et al.
Published: (2025)

Can LLM Agents Simulate Dynamic Networks? A Case Study on Email Networks with Phishing Synthesis
by: Miao, Siqi, et al.
Published: (2026)

Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
by: Zhong, Qihuang, et al.
Published: (2024)

Conformal Prediction and Human Decision Making
by: Hullman, Jessica, et al.
Published: (2025)

ToMPO: Training LLM Strategic Decision Making from a Multi-Agent Perspective
by: Zhang, Yiwen, et al.
Published: (2025)

Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
by: Li, Manling, et al.
Published: (2024)

Better World Models Can Lead to Better Post-Training Performance
by: Gupta, Prakhar, et al.
Published: (2025)

Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities
by: Zhang, Lili, et al.
Published: (2025)

Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation
by: Tang, Shuo, et al.
Published: (2024)

Thunder-Tok: Minimizing Tokens per Word in Tokenizing Korean Texts for Generative Language Models
by: Cho, Gyeongje, et al.
Published: (2025)

Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning
by: Hu, Zican, et al.
Published: (2025)

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities
by: Schmied, Thomas, et al.
Published: (2025)

Position: Foundation Agents as the Paradigm Shift for Decision Making
by: Liu, Xiaoqian, et al.
Published: (2024)

Online Mixture of Experts: No-Regret Learning for Optimal Collective Decision-Making
by: Liu, Larkin, et al.
Published: (2025)

PIANIST: Learning Partially Observable World Models with LLMs for Multi-Agent Decision Making
by: Light, Jonathan, et al.
Published: (2024)

Cognitive Bias in Decision-Making with LLMs
by: Echterhoff, Jessica, et al.
Published: (2024)

LLMs for Explainable Business Decision-Making: A Reinforcement Learning Fine-Tuning Approach
by: Cheng, Xiang, et al.
Published: (2025)

Parallelizing Counterfactual Regret Minimization
by: Kim, Juho, et al.
Published: (2026)

How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
by: Huang, Jen-tse, et al.
Published: (2024)

MetaAgents: Large Language Model Based Agents for Decision-Making on Teaming
by: Li, Yuan, et al.
Published: (2023)

Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
by: Zhang, Bin, et al.
Published: (2023)