Saved in:
| Main Authors: | Park, Jung Yeon, Bhatt, Sujay, Zeng, Sihan, Wong, Lawson L. S., Koppel, Alec, Ganesh, Sumitra, Walters, Robin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.04225 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Hessian-Free Actor-Critic Algorithm for Bi-Level Reinforcement Learning with Applications to LLM Fine-Tuning
by: Zeng, Sihan, et al.
Published: (2026)
by: Zeng, Sihan, et al.
Published: (2026)
Partially Observable Contextual Bandits with Linear Payoffs
by: Zeng, Sihan, et al.
Published: (2024)
by: Zeng, Sihan, et al.
Published: (2024)
Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
by: Zeng, Sihan, et al.
Published: (2025)
by: Zeng, Sihan, et al.
Published: (2025)
Learning Payment-Free Resource Allocation Mechanisms
by: Zeng, Sihan, et al.
Published: (2023)
by: Zeng, Sihan, et al.
Published: (2023)
Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis
by: Zeng, Sihan, et al.
Published: (2025)
by: Zeng, Sihan, et al.
Published: (2025)
Rethinking Neural Network Learning Rates: A Stackelberg Perspective
by: Zeng, Sihan, et al.
Published: (2026)
by: Zeng, Sihan, et al.
Published: (2026)
Learning in Herding Mean Field Games: Single-Loop Algorithm with Finite-Time Convergence Analysis
by: Zeng, Sihan, et al.
Published: (2024)
by: Zeng, Sihan, et al.
Published: (2024)
Equivariant Action Sampling for Reinforcement Learning and Planning
by: Zhao, Linfeng, et al.
Published: (2024)
by: Zhao, Linfeng, et al.
Published: (2024)
Efficient Inverse Multiagent Learning
by: Goktas, Denizalp, et al.
Published: (2025)
by: Goktas, Denizalp, et al.
Published: (2025)
No One Size Fits All: QueryBandits for Hallucination Mitigation
by: Cho, Nicole, et al.
Published: (2026)
by: Cho, Nicole, et al.
Published: (2026)
QueryBandits for Hallucination Mitigation: Exploiting Semantic Features for No-Regret Rewriting
by: Cho, Nicole, et al.
Published: (2025)
by: Cho, Nicole, et al.
Published: (2025)
ADAGE: A generic two-layer framework for adaptive agent based modelling
by: Evans, Benjamin Patrick, et al.
Published: (2025)
by: Evans, Benjamin Patrick, et al.
Published: (2025)
Smoothness Errors in Dynamics Models and How to Avoid Them
by: Berman, Edward, et al.
Published: (2026)
by: Berman, Edward, et al.
Published: (2026)
Decentralized Convergence to Equilibrium Prices in Trading Networks
by: Lock, Edwin, et al.
Published: (2024)
by: Lock, Edwin, et al.
Published: (2024)
Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning
by: Evans, Benjamin Patrick, et al.
Published: (2024)
by: Evans, Benjamin Patrick, et al.
Published: (2024)
Scalable Representation Learning for Multimodal Tabular Transactions
by: Raman, Natraj, et al.
Published: (2024)
by: Raman, Natraj, et al.
Published: (2024)
Relaxed Equivariant Graph Neural Networks
by: Hofgard, Elyssa, et al.
Published: (2024)
by: Hofgard, Elyssa, et al.
Published: (2024)
On Uncertainty Calibration for Equivariant Functions
by: Berman, Edward, et al.
Published: (2025)
by: Berman, Edward, et al.
Published: (2025)
On Universality of Deep Equivariant Networks
by: Pacini, Marco, et al.
Published: (2025)
by: Pacini, Marco, et al.
Published: (2025)
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
by: Chakraborty, Souradip, et al.
Published: (2023)
by: Chakraborty, Souradip, et al.
Published: (2023)
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
by: Zeng, Sihan, et al.
Published: (2024)
by: Zeng, Sihan, et al.
Published: (2024)
Nonparametric Sparse Online Learning of the Koopman Operator
by: Hou, Boya, et al.
Published: (2025)
by: Hou, Boya, et al.
Published: (2025)
Byzantine-Resilient Decentralized Multi-Armed Bandits
by: Zhu, Jingxuan, et al.
Published: (2023)
by: Zhu, Jingxuan, et al.
Published: (2023)
Sharpened Lazy Incremental Quasi-Newton Method
by: Lahoti, Aakash, et al.
Published: (2023)
by: Lahoti, Aakash, et al.
Published: (2023)
Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles
by: Patel, Bhrij, et al.
Published: (2024)
by: Patel, Bhrij, et al.
Published: (2024)
Fourier Transporter: Bi-Equivariant Robotic Manipulation in 3D
by: Huang, Haojie, et al.
Published: (2024)
by: Huang, Haojie, et al.
Published: (2024)
Entropy-informed Decoding: Adaptive Information-Driven Branching
by: Evans, Benjamin Patrick, et al.
Published: (2026)
by: Evans, Benjamin Patrick, et al.
Published: (2026)
Collab: Controlled Decoding using Mixture of Agents for LLM Alignment
by: Chakraborty, Souradip, et al.
Published: (2025)
by: Chakraborty, Souradip, et al.
Published: (2025)
Equivariant Diffusion Policy
by: Wang, Dian, et al.
Published: (2024)
by: Wang, Dian, et al.
Published: (2024)
Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning
by: Zeng, Sihan, et al.
Published: (2024)
by: Zeng, Sihan, et al.
Published: (2024)
A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
by: Zeng, Sihan, et al.
Published: (2021)
by: Zeng, Sihan, et al.
Published: (2021)
Enabling Approximate Joint Sampling in Diffusion LMs
by: Bansal, Parikshit, et al.
Published: (2025)
by: Bansal, Parikshit, et al.
Published: (2025)
Decentralized Upper Confidence Bound Algorithms for Homogeneous Multi-Agent Multi-Armed Bandits
by: Zhu, Jingxuan, et al.
Published: (2021)
by: Zhu, Jingxuan, et al.
Published: (2021)
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
by: Zhao, Linfeng, et al.
Published: (2024)
by: Zhao, Linfeng, et al.
Published: (2024)
Modelling bounded rational decision-making through Wasserstein constraints
by: Evans, Benjamin Patrick, et al.
Published: (2025)
by: Evans, Benjamin Patrick, et al.
Published: (2025)
Catoni-Style Change Point Detection for Regret Minimization in Non-Stationary Heavy-Tailed Bandits
by: Genalti, Gianmarco, et al.
Published: (2025)
by: Genalti, Gianmarco, et al.
Published: (2025)
Partially Equivariant Reinforcement Learning in Symmetry-Breaking Environments
by: Chang, Junwoo, et al.
Published: (2025)
by: Chang, Junwoo, et al.
Published: (2025)
Equivariant Offline Reinforcement Learning
by: Tangri, Arsh, et al.
Published: (2024)
by: Tangri, Arsh, et al.
Published: (2024)
FIRE-GNN: Force-informed, Relaxed Equivariance Graph Neural Network for Rapid and Accurate Prediction of Surface Properties
by: Hsu, Circe, et al.
Published: (2025)
by: Hsu, Circe, et al.
Published: (2025)
Generating Structured Plan Representation of Procedures with LLMs
by: Garg, Deepeka, et al.
Published: (2025)
by: Garg, Deepeka, et al.
Published: (2025)
Similar Items
-
A Hessian-Free Actor-Critic Algorithm for Bi-Level Reinforcement Learning with Applications to LLM Fine-Tuning
by: Zeng, Sihan, et al.
Published: (2026) -
Partially Observable Contextual Bandits with Linear Payoffs
by: Zeng, Sihan, et al.
Published: (2024) -
Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
by: Zeng, Sihan, et al.
Published: (2025) -
Learning Payment-Free Resource Allocation Mechanisms
by: Zeng, Sihan, et al.
Published: (2023) -
Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis
by: Zeng, Sihan, et al.
Published: (2025)