Saved in:
| Main Authors: | Zamir, Nida, Hou, I-Hong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.07205 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Restless Bandits with Individual Penalty Constraints: Near-Optimal Indices and Deep Reinforcement Learning
by: Zamir, Nida, et al.
Published: (2026)
by: Zamir, Nida, et al.
Published: (2026)
Neural Index Policies for Restless Multi-Action Bandits with Heterogeneous Budgets
by: Pandey, Himadri S., et al.
Published: (2025)
by: Pandey, Himadri S., et al.
Published: (2025)
GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits
by: Chen, Gongpu, et al.
Published: (2024)
by: Chen, Gongpu, et al.
Published: (2024)
Lagrangian Index Policy for Restless Bandits with Average Reward
by: Avrachenkov, Konstantin, et al.
Published: (2024)
by: Avrachenkov, Konstantin, et al.
Published: (2024)
Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation
by: Xiong, Guojun, et al.
Published: (2025)
by: Xiong, Guojun, et al.
Published: (2025)
IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health
by: Jain, Gauri, et al.
Published: (2024)
by: Jain, Gauri, et al.
Published: (2024)
Fairness of Exposure in Online Restless Multi-armed Bandits
by: Sood, Archit, et al.
Published: (2024)
by: Sood, Archit, et al.
Published: (2024)
Global Rewards in Restless Multi-Armed Bandits
by: Raman, Naveen, et al.
Published: (2024)
by: Raman, Naveen, et al.
Published: (2024)
Networked Restless Multi-Arm Bandits with Reinforcement Learning
by: Zhang, Hanmo, et al.
Published: (2025)
by: Zhang, Hanmo, et al.
Published: (2025)
Lagrangian Relaxation for Multi-Action Partially Observable Restless Bandits: Heuristic Policies and Indexability
by: Meshram, Rahul, et al.
Published: (2025)
by: Meshram, Rahul, et al.
Published: (2025)
MARBLE: Multi-Armed Restless Bandits in Latent Markovian Environment
by: Amiri, Mohsen, et al.
Published: (2025)
by: Amiri, Mohsen, et al.
Published: (2025)
Non-Stationary Restless Multi-Armed Bandits with Provable Guarantee
by: Hung, Yu-Heng, et al.
Published: (2025)
by: Hung, Yu-Heng, et al.
Published: (2025)
Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes
by: Mittal, Vishesh, et al.
Published: (2024)
by: Mittal, Vishesh, et al.
Published: (2024)
Restless Linear Bandits
by: Khaleghi, Azadeh
Published: (2024)
by: Khaleghi, Azadeh
Published: (2024)
Fairness for Workers Who Pull the Arms: An Index Based Policy for Allocation of Restless Bandit Tasks
by: Biswas, Arpita, et al.
Published: (2023)
by: Biswas, Arpita, et al.
Published: (2023)
The Bandit Whisperer: Communication Learning for Restless Bandits
by: Zhao, Yunfan, et al.
Published: (2024)
by: Zhao, Yunfan, et al.
Published: (2024)
Distributed No-Regret Learning for Multi-Stage Systems with End-to-End Bandit Feedback
by: Hou, I-Hong
Published: (2024)
by: Hou, I-Hong
Published: (2024)
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation
by: Tong, Jingwen, et al.
Published: (2024)
by: Tong, Jingwen, et al.
Published: (2024)
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
by: Xiong, Guojun, et al.
Published: (2024)
by: Xiong, Guojun, et al.
Published: (2024)
Optimal Control of Fluid Restless Multi-armed Bandits: A Machine Learning Approach
by: Bertsimas, Dimitris, et al.
Published: (2025)
by: Bertsimas, Dimitris, et al.
Published: (2025)
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
by: Zhao, Yunfan, et al.
Published: (2023)
by: Zhao, Yunfan, et al.
Published: (2023)
ContextWIN: Whittle Index Based Mixture-of-Experts Neural Model For Restless Bandits Via Deep RL
by: Guo, Zhanqiu, et al.
Published: (2024)
by: Guo, Zhanqiu, et al.
Published: (2024)
Model Predictive Control is almost Optimal for Heterogeneous Restless Multi-armed Bandits
by: Narasimha, Dheeraj, et al.
Published: (2025)
by: Narasimha, Dheeraj, et al.
Published: (2025)
Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making
by: Chen, Xin, et al.
Published: (2024)
by: Chen, Xin, et al.
Published: (2024)
Faster Q-Learning Algorithms for Restless Bandits
by: Kakarapalli, Parvish, et al.
Published: (2024)
by: Kakarapalli, Parvish, et al.
Published: (2024)
Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting
by: Genalti, Gianmarco, et al.
Published: (2024)
by: Genalti, Gianmarco, et al.
Published: (2024)
General Formulation and PCL-Analysis for Restless Bandits with Limited Observability
by: Liu, Keqin, et al.
Published: (2023)
by: Liu, Keqin, et al.
Published: (2023)
A Modularized Framework for Piecewise-Stationary Restless Bandits
by: Li, Kuan-Ta, et al.
Published: (2026)
by: Li, Kuan-Ta, et al.
Published: (2026)
Low-Complexity Algorithm for Restless Bandits with Imperfect Observations
by: Liu, Keqin, et al.
Published: (2021)
by: Liu, Keqin, et al.
Published: (2021)
Model Predictive Control is Almost Optimal for Restless Bandit
by: Gast, Nicolas, et al.
Published: (2024)
by: Gast, Nicolas, et al.
Published: (2024)
Adapter-Augmented Bandits for Online Multi-Constrained Multi-Modal Inference Scheduling
by: Zhang, Xianzhi, et al.
Published: (2026)
by: Zhang, Xianzhi, et al.
Published: (2026)
A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health
by: Behari, Nikhil, et al.
Published: (2024)
by: Behari, Nikhil, et al.
Published: (2024)
Optimal Best Arm Identification with Fixed Confidence in Restless Bandits
by: Karthik, P. N., et al.
Published: (2023)
by: Karthik, P. N., et al.
Published: (2023)
Multi-Action Restless Bandits with Weakly Coupled Constraints: Simultaneous Learning and Control
by: Fu, Jing, et al.
Published: (2024)
by: Fu, Jing, et al.
Published: (2024)
Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption
by: Hong, Yige, et al.
Published: (2023)
by: Hong, Yige, et al.
Published: (2023)
Online Learning of Whittle Indices for Restless Bandits with Non-Stationary Transition Kernels
by: Shisher, Md Kamran Chowdhury, et al.
Published: (2025)
by: Shisher, Md Kamran Chowdhury, et al.
Published: (2025)
From Restless to Contextual: A Thresholding Bandit Reformulation For Finite-horizon Improvement
by: Xu, Jiamin, et al.
Published: (2025)
by: Xu, Jiamin, et al.
Published: (2025)
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
by: Xiong, Guojun, et al.
Published: (2024)
by: Xiong, Guojun, et al.
Published: (2024)
Decisions and Deployment: The Five-Year SAHELI Project (2020-2025) on Restless Multi-Armed Bandits for Improving Maternal and Child Health
by: Verma, Shresth, et al.
Published: (2026)
by: Verma, Shresth, et al.
Published: (2026)
Distributed Learning in Markovian Restless Bandits over Interference Graphs for Stable Spectrum Sharing
by: Didi, Liad Lea, et al.
Published: (2025)
by: Didi, Liad Lea, et al.
Published: (2025)
Similar Items
-
Restless Bandits with Individual Penalty Constraints: Near-Optimal Indices and Deep Reinforcement Learning
by: Zamir, Nida, et al.
Published: (2026) -
Neural Index Policies for Restless Multi-Action Bandits with Heterogeneous Budgets
by: Pandey, Himadri S., et al.
Published: (2025) -
GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits
by: Chen, Gongpu, et al.
Published: (2024) -
Lagrangian Index Policy for Restless Bandits with Average Reward
by: Avrachenkov, Konstantin, et al.
Published: (2024) -
Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation
by: Xiong, Guojun, et al.
Published: (2025)