:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Park, Chanwoo, Han, Seungju, Guo, Xingzhi, Ozdaglar, Asuman, Zhang, Kaiqing, Kim, Joo-Kyung
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.18439
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach
by: Park, Chanwoo, et al.
Published: (2025)

Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
by: Park, Chanwoo, et al.
Published: (2023)

Do LLM Agents Have Regret? A Case Study in Online Learning and Games
by: Park, Chanwoo, et al.
Published: (2024)

RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
by: Park, Chanwoo, et al.
Published: (2024)

Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints
by: Ozdaglar, Asuman, et al.
Published: (2022)

The Power of Regularization in Solving Extensive-Form Games
by: Liu, Mingyang, et al.
Published: (2022)

Mirror Duality in Convex Optimization
by: Kim, Jaeyeon, et al.
Published: (2023)

Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games
by: Chen, Zaiwei, et al.
Published: (2024)

Optimism as Risk-Seeking in Multi-Agent Reinforcement Learning
by: Zhang, Runyu, et al.
Published: (2025)

Online Learning and Equilibrium Computation with Ranking Feedback
by: Liu, Mingyang, et al.
Published: (2026)

UFT: Unifying Supervised and Reinforcement Fine-Tuning
by: Liu, Mingyang, et al.
Published: (2025)

Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework
by: Zhang, Runyu, et al.
Published: (2024)

Learning Decision-Sufficient Representations for Linear Optimization
by: Ye, Yuhan, et al.
Published: (2026)

Uniformly Stable Algorithms for Adversarial Training and Beyond
by: Xiao, Jiancong, et al.
Published: (2024)

A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
by: Kim, Kihyun, et al.
Published: (2024)

LiteEFG: An Efficient Python Library for Solving Extensive-form Games
by: Liu, Mingyang, et al.
Published: (2024)

Computing Equilibrium beyond Unilateral Deviation
by: Liu, Mingyang, et al.
Published: (2026)

A Policy-Gradient Approach to Solving Imperfect-Information Games with Best-Iterate Convergence
by: Liu, Mingyang, et al.
Published: (2024)

Differentially Private Equilibrium Finding in Polymatrix Games
by: Liu, Mingyang, et al.
Published: (2025)

Finite-Sample Guarantees for Learning Dynamics in Zero-Sum Polymatrix Games
by: Faizal, Fathima Zarin, et al.
Published: (2024)

Partially Observable Multi-Agent Reinforcement Learning with Information Sharing
by: Liu, Xiangyu, et al.
Published: (2023)

Beyond RLHF and NLHF: Population-Proportional Alignment under an Axiomatic Framework
by: Kim, Kihyun, et al.
Published: (2025)

Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG
by: Han, Seungju, et al.
Published: (2026)

A Bayesian Framework for Human-AI Collaboration: Complementarity and Correlation Neglect
by: Amin, Saurabh, et al.
Published: (2026)

Addressing misspecification in contextual optimization
by: Bennouna, Omar, et al.
Published: (2024)

How AI Aggregation Affects Knowledge
by: Acemoglu, Daron, et al.
Published: (2026)

What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization
by: Bennouna, Omar, et al.
Published: (2025)

Data Informativeness in Linear Optimization under Uncertainty
by: Bennouna, Omar, et al.
Published: (2026)

SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents
by: Zhan, Qiusi, et al.
Published: (2025)

An Effective Energy Mask-based Adversarial Evasion Attacks against Misclassification in Speaker Recognition Systems
by: Park, Chanwoo, et al.
Published: (2026)

Reasoning-Based Approach with Chain-of-Thought for Alzheimer's Detection Using Speech and Large Language Models
by: Park, Chanwoo, et al.
Published: (2025)

Graph Elicitation for Guiding Multi-Step Reasoning in Large Language Models
by: Park, Jinyoung, et al.
Published: (2023)

Zeroth-Order Constrained Optimization from a Control Perspective via Feedback Linearization
by: Zhang, Runyu, et al.
Published: (2025)

Matching of Users and Creators in Two-Sided Markets with Departures
by: Huttenlocher, Daniel, et al.
Published: (2023)

Optimal interventions in opinion dynamics on large-scale, time-varying, random networks
by: Cianfanelli, Leonardo, et al.
Published: (2025)

Enhancing Robustness in Incremental Learning with Adversarial Training
by: Cho, Seungju, et al.
Published: (2023)

Enhancing Pre‐Service Teachers' Understanding of Quadrilaterals Through Wiki‐Supported Collaborative Learning
by: Asuman Duatepe‐Paksu
Published: (2026)

Prism: Spectral Parameter Sharing for Multi-Agent Reinforcement Learning
by: Kim, Kyungbeom, et al.
Published: (2026)

Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces
by: Yu, Jiapeng, et al.
Published: (2024)

Wikipedia Contributions in the Wake of ChatGPT
by: Lyu, Liang, et al.
Published: (2025)