:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Park, Jung Yeon, Bhatt, Sujay, Zeng, Sihan, Wong, Lawson L. S., Koppel, Alec, Ganesh, Sumitra, Walters, Robin
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2411.04225
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Hessian-Free Actor-Critic Algorithm for Bi-Level Reinforcement Learning with Applications to LLM Fine-Tuning
by: Zeng, Sihan, et al.
Published: (2026)

Partially Observable Contextual Bandits with Linear Payoffs
by: Zeng, Sihan, et al.
Published: (2024)

Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
by: Zeng, Sihan, et al.
Published: (2025)

Learning Payment-Free Resource Allocation Mechanisms
by: Zeng, Sihan, et al.
Published: (2023)

Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis
by: Zeng, Sihan, et al.
Published: (2025)

Rethinking Neural Network Learning Rates: A Stackelberg Perspective
by: Zeng, Sihan, et al.
Published: (2026)

Learning in Herding Mean Field Games: Single-Loop Algorithm with Finite-Time Convergence Analysis
by: Zeng, Sihan, et al.
Published: (2024)

Equivariant Action Sampling for Reinforcement Learning and Planning
by: Zhao, Linfeng, et al.
Published: (2024)

Efficient Inverse Multiagent Learning
by: Goktas, Denizalp, et al.
Published: (2025)

No One Size Fits All: QueryBandits for Hallucination Mitigation
by: Cho, Nicole, et al.
Published: (2026)

QueryBandits for Hallucination Mitigation: Exploiting Semantic Features for No-Regret Rewriting
by: Cho, Nicole, et al.
Published: (2025)

ADAGE: A generic two-layer framework for adaptive agent based modelling
by: Evans, Benjamin Patrick, et al.
Published: (2025)

Smoothness Errors in Dynamics Models and How to Avoid Them
by: Berman, Edward, et al.
Published: (2026)

Decentralized Convergence to Equilibrium Prices in Trading Networks
by: Lock, Edwin, et al.
Published: (2024)

Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning
by: Evans, Benjamin Patrick, et al.
Published: (2024)

Scalable Representation Learning for Multimodal Tabular Transactions
by: Raman, Natraj, et al.
Published: (2024)

Relaxed Equivariant Graph Neural Networks
by: Hofgard, Elyssa, et al.
Published: (2024)

On Uncertainty Calibration for Equivariant Functions
by: Berman, Edward, et al.
Published: (2025)

On Universality of Deep Equivariant Networks
by: Pacini, Marco, et al.
Published: (2025)

PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
by: Chakraborty, Souradip, et al.
Published: (2023)

Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
by: Zeng, Sihan, et al.
Published: (2024)

Nonparametric Sparse Online Learning of the Koopman Operator
by: Hou, Boya, et al.
Published: (2025)

Byzantine-Resilient Decentralized Multi-Armed Bandits
by: Zhu, Jingxuan, et al.
Published: (2023)

Sharpened Lazy Incremental Quasi-Newton Method
by: Lahoti, Aakash, et al.
Published: (2023)

Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles
by: Patel, Bhrij, et al.
Published: (2024)

Fourier Transporter: Bi-Equivariant Robotic Manipulation in 3D
by: Huang, Haojie, et al.
Published: (2024)

Entropy-informed Decoding: Adaptive Information-Driven Branching
by: Evans, Benjamin Patrick, et al.
Published: (2026)

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment
by: Chakraborty, Souradip, et al.
Published: (2025)

Equivariant Diffusion Policy
by: Wang, Dian, et al.
Published: (2024)

Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning
by: Zeng, Sihan, et al.
Published: (2024)

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
by: Zeng, Sihan, et al.
Published: (2021)

Enabling Approximate Joint Sampling in Diffusion LMs
by: Bansal, Parikshit, et al.
Published: (2025)

Decentralized Upper Confidence Bound Algorithms for Homogeneous Multi-Agent Multi-Armed Bandits
by: Zhu, Jingxuan, et al.
Published: (2021)

Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
by: Zhao, Linfeng, et al.
Published: (2024)

Modelling bounded rational decision-making through Wasserstein constraints
by: Evans, Benjamin Patrick, et al.
Published: (2025)

Catoni-Style Change Point Detection for Regret Minimization in Non-Stationary Heavy-Tailed Bandits
by: Genalti, Gianmarco, et al.
Published: (2025)

Partially Equivariant Reinforcement Learning in Symmetry-Breaking Environments
by: Chang, Junwoo, et al.
Published: (2025)

Equivariant Offline Reinforcement Learning
by: Tangri, Arsh, et al.
Published: (2024)

FIRE-GNN: Force-informed, Relaxed Equivariance Graph Neural Network for Rapid and Accurate Prediction of Surface Properties
by: Hsu, Circe, et al.
Published: (2025)

Generating Structured Plan Representation of Procedures with LLMs
by: Garg, Deepeka, et al.
Published: (2025)