:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zamir, Nida, Hou, I-Hong
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2408.07205
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Restless Bandits with Individual Penalty Constraints: Near-Optimal Indices and Deep Reinforcement Learning
by: Zamir, Nida, et al.
Published: (2026)

Neural Index Policies for Restless Multi-Action Bandits with Heterogeneous Budgets
by: Pandey, Himadri S., et al.
Published: (2025)

GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits
by: Chen, Gongpu, et al.
Published: (2024)

Lagrangian Index Policy for Restless Bandits with Average Reward
by: Avrachenkov, Konstantin, et al.
Published: (2024)

Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation
by: Xiong, Guojun, et al.
Published: (2025)

IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health
by: Jain, Gauri, et al.
Published: (2024)

Fairness of Exposure in Online Restless Multi-armed Bandits
by: Sood, Archit, et al.
Published: (2024)

Global Rewards in Restless Multi-Armed Bandits
by: Raman, Naveen, et al.
Published: (2024)

Networked Restless Multi-Arm Bandits with Reinforcement Learning
by: Zhang, Hanmo, et al.
Published: (2025)

Lagrangian Relaxation for Multi-Action Partially Observable Restless Bandits: Heuristic Policies and Indexability
by: Meshram, Rahul, et al.
Published: (2025)

MARBLE: Multi-Armed Restless Bandits in Latent Markovian Environment
by: Amiri, Mohsen, et al.
Published: (2025)

Non-Stationary Restless Multi-Armed Bandits with Provable Guarantee
by: Hung, Yu-Heng, et al.
Published: (2025)

Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes
by: Mittal, Vishesh, et al.
Published: (2024)

Restless Linear Bandits
by: Khaleghi, Azadeh
Published: (2024)

Fairness for Workers Who Pull the Arms: An Index Based Policy for Allocation of Restless Bandit Tasks
by: Biswas, Arpita, et al.
Published: (2023)

The Bandit Whisperer: Communication Learning for Restless Bandits
by: Zhao, Yunfan, et al.
Published: (2024)

Distributed No-Regret Learning for Multi-Stage Systems with End-to-End Bandit Feedback
by: Hou, I-Hong
Published: (2024)

A Federated Online Restless Bandit Framework for Cooperative Resource Allocation
by: Tong, Jingwen, et al.
Published: (2024)

Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
by: Xiong, Guojun, et al.
Published: (2024)

Optimal Control of Fluid Restless Multi-armed Bandits: A Machine Learning Approach
by: Bertsimas, Dimitris, et al.
Published: (2025)

Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
by: Zhao, Yunfan, et al.
Published: (2023)

ContextWIN: Whittle Index Based Mixture-of-Experts Neural Model For Restless Bandits Via Deep RL
by: Guo, Zhanqiu, et al.
Published: (2024)

Model Predictive Control is almost Optimal for Heterogeneous Restless Multi-armed Bandits
by: Narasimha, Dheeraj, et al.
Published: (2025)

Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making
by: Chen, Xin, et al.
Published: (2024)

Faster Q-Learning Algorithms for Restless Bandits
by: Kakarapalli, Parvish, et al.
Published: (2024)

Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting
by: Genalti, Gianmarco, et al.
Published: (2024)

General Formulation and PCL-Analysis for Restless Bandits with Limited Observability
by: Liu, Keqin, et al.
Published: (2023)

A Modularized Framework for Piecewise-Stationary Restless Bandits
by: Li, Kuan-Ta, et al.
Published: (2026)

Low-Complexity Algorithm for Restless Bandits with Imperfect Observations
by: Liu, Keqin, et al.
Published: (2021)

Model Predictive Control is Almost Optimal for Restless Bandit
by: Gast, Nicolas, et al.
Published: (2024)

Adapter-Augmented Bandits for Online Multi-Constrained Multi-Modal Inference Scheduling
by: Zhang, Xianzhi, et al.
Published: (2026)

A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health
by: Behari, Nikhil, et al.
Published: (2024)

Optimal Best Arm Identification with Fixed Confidence in Restless Bandits
by: Karthik, P. N., et al.
Published: (2023)

Multi-Action Restless Bandits with Weakly Coupled Constraints: Simultaneous Learning and Control
by: Fu, Jing, et al.
Published: (2024)

Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption
by: Hong, Yige, et al.
Published: (2023)

Online Learning of Whittle Indices for Restless Bandits with Non-Stationary Transition Kernels
by: Shisher, Md Kamran Chowdhury, et al.
Published: (2025)

From Restless to Contextual: A Thresholding Bandit Reformulation For Finite-horizon Improvement
by: Xu, Jiamin, et al.
Published: (2025)

DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
by: Xiong, Guojun, et al.
Published: (2024)

Decisions and Deployment: The Five-Year SAHELI Project (2020-2025) on Restless Multi-Armed Bandits for Improving Maternal and Child Health
by: Verma, Shresth, et al.
Published: (2026)

Distributed Learning in Markovian Restless Bandits over Interference Graphs for Stable Spectrum Sharing
by: Didi, Liad Lea, et al.
Published: (2025)