Saved in:
| Main Authors: | Park, Chanwoo, Han, Seungju, Guo, Xingzhi, Ozdaglar, Asuman, Zhang, Kaiqing, Kim, Joo-Kyung |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.18439 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach
by: Park, Chanwoo, et al.
Published: (2025)
by: Park, Chanwoo, et al.
Published: (2025)
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
by: Park, Chanwoo, et al.
Published: (2023)
by: Park, Chanwoo, et al.
Published: (2023)
Do LLM Agents Have Regret? A Case Study in Online Learning and Games
by: Park, Chanwoo, et al.
Published: (2024)
by: Park, Chanwoo, et al.
Published: (2024)
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
by: Park, Chanwoo, et al.
Published: (2024)
by: Park, Chanwoo, et al.
Published: (2024)
Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints
by: Ozdaglar, Asuman, et al.
Published: (2022)
by: Ozdaglar, Asuman, et al.
Published: (2022)
The Power of Regularization in Solving Extensive-Form Games
by: Liu, Mingyang, et al.
Published: (2022)
by: Liu, Mingyang, et al.
Published: (2022)
Mirror Duality in Convex Optimization
by: Kim, Jaeyeon, et al.
Published: (2023)
by: Kim, Jaeyeon, et al.
Published: (2023)
Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games
by: Chen, Zaiwei, et al.
Published: (2024)
by: Chen, Zaiwei, et al.
Published: (2024)
Optimism as Risk-Seeking in Multi-Agent Reinforcement Learning
by: Zhang, Runyu, et al.
Published: (2025)
by: Zhang, Runyu, et al.
Published: (2025)
Online Learning and Equilibrium Computation with Ranking Feedback
by: Liu, Mingyang, et al.
Published: (2026)
by: Liu, Mingyang, et al.
Published: (2026)
UFT: Unifying Supervised and Reinforcement Fine-Tuning
by: Liu, Mingyang, et al.
Published: (2025)
by: Liu, Mingyang, et al.
Published: (2025)
Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework
by: Zhang, Runyu, et al.
Published: (2024)
by: Zhang, Runyu, et al.
Published: (2024)
Learning Decision-Sufficient Representations for Linear Optimization
by: Ye, Yuhan, et al.
Published: (2026)
by: Ye, Yuhan, et al.
Published: (2026)
Uniformly Stable Algorithms for Adversarial Training and Beyond
by: Xiao, Jiancong, et al.
Published: (2024)
by: Xiao, Jiancong, et al.
Published: (2024)
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
by: Kim, Kihyun, et al.
Published: (2024)
by: Kim, Kihyun, et al.
Published: (2024)
LiteEFG: An Efficient Python Library for Solving Extensive-form Games
by: Liu, Mingyang, et al.
Published: (2024)
by: Liu, Mingyang, et al.
Published: (2024)
Computing Equilibrium beyond Unilateral Deviation
by: Liu, Mingyang, et al.
Published: (2026)
by: Liu, Mingyang, et al.
Published: (2026)
A Policy-Gradient Approach to Solving Imperfect-Information Games with Best-Iterate Convergence
by: Liu, Mingyang, et al.
Published: (2024)
by: Liu, Mingyang, et al.
Published: (2024)
Differentially Private Equilibrium Finding in Polymatrix Games
by: Liu, Mingyang, et al.
Published: (2025)
by: Liu, Mingyang, et al.
Published: (2025)
Finite-Sample Guarantees for Learning Dynamics in Zero-Sum Polymatrix Games
by: Faizal, Fathima Zarin, et al.
Published: (2024)
by: Faizal, Fathima Zarin, et al.
Published: (2024)
Partially Observable Multi-Agent Reinforcement Learning with Information Sharing
by: Liu, Xiangyu, et al.
Published: (2023)
by: Liu, Xiangyu, et al.
Published: (2023)
Beyond RLHF and NLHF: Population-Proportional Alignment under an Axiomatic Framework
by: Kim, Kihyun, et al.
Published: (2025)
by: Kim, Kihyun, et al.
Published: (2025)
Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG
by: Han, Seungju, et al.
Published: (2026)
by: Han, Seungju, et al.
Published: (2026)
A Bayesian Framework for Human-AI Collaboration: Complementarity and Correlation Neglect
by: Amin, Saurabh, et al.
Published: (2026)
by: Amin, Saurabh, et al.
Published: (2026)
Addressing misspecification in contextual optimization
by: Bennouna, Omar, et al.
Published: (2024)
by: Bennouna, Omar, et al.
Published: (2024)
How AI Aggregation Affects Knowledge
by: Acemoglu, Daron, et al.
Published: (2026)
by: Acemoglu, Daron, et al.
Published: (2026)
What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization
by: Bennouna, Omar, et al.
Published: (2025)
by: Bennouna, Omar, et al.
Published: (2025)
Data Informativeness in Linear Optimization under Uncertainty
by: Bennouna, Omar, et al.
Published: (2026)
by: Bennouna, Omar, et al.
Published: (2026)
SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents
by: Zhan, Qiusi, et al.
Published: (2025)
by: Zhan, Qiusi, et al.
Published: (2025)
An Effective Energy Mask-based Adversarial Evasion Attacks against Misclassification in Speaker Recognition Systems
by: Park, Chanwoo, et al.
Published: (2026)
by: Park, Chanwoo, et al.
Published: (2026)
Reasoning-Based Approach with Chain-of-Thought for Alzheimer's Detection Using Speech and Large Language Models
by: Park, Chanwoo, et al.
Published: (2025)
by: Park, Chanwoo, et al.
Published: (2025)
Graph Elicitation for Guiding Multi-Step Reasoning in Large Language Models
by: Park, Jinyoung, et al.
Published: (2023)
by: Park, Jinyoung, et al.
Published: (2023)
Zeroth-Order Constrained Optimization from a Control Perspective via Feedback Linearization
by: Zhang, Runyu, et al.
Published: (2025)
by: Zhang, Runyu, et al.
Published: (2025)
Matching of Users and Creators in Two-Sided Markets with Departures
by: Huttenlocher, Daniel, et al.
Published: (2023)
by: Huttenlocher, Daniel, et al.
Published: (2023)
Optimal interventions in opinion dynamics on large-scale, time-varying, random networks
by: Cianfanelli, Leonardo, et al.
Published: (2025)
by: Cianfanelli, Leonardo, et al.
Published: (2025)
Enhancing Robustness in Incremental Learning with Adversarial Training
by: Cho, Seungju, et al.
Published: (2023)
by: Cho, Seungju, et al.
Published: (2023)
Enhancing Pre‐Service Teachers' Understanding of Quadrilaterals Through Wiki‐Supported Collaborative Learning
by: Asuman Duatepe‐Paksu
Published: (2026)
by: Asuman Duatepe‐Paksu
Published: (2026)
Prism: Spectral Parameter Sharing for Multi-Agent Reinforcement Learning
by: Kim, Kyungbeom, et al.
Published: (2026)
by: Kim, Kyungbeom, et al.
Published: (2026)
Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces
by: Yu, Jiapeng, et al.
Published: (2024)
by: Yu, Jiapeng, et al.
Published: (2024)
Wikipedia Contributions in the Wake of ChatGPT
by: Lyu, Liang, et al.
Published: (2025)
by: Lyu, Liang, et al.
Published: (2025)
Similar Items
-
Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach
by: Park, Chanwoo, et al.
Published: (2025) -
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
by: Park, Chanwoo, et al.
Published: (2023) -
Do LLM Agents Have Regret? A Case Study in Online Learning and Games
by: Park, Chanwoo, et al.
Published: (2024) -
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
by: Park, Chanwoo, et al.
Published: (2024) -
Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints
by: Ozdaglar, Asuman, et al.
Published: (2022)