Saved in:
| Main Authors: | Banerjee, Siddhartha, Sinclair, Sean R., Tambe, Milind, Xu, Lily, Yu, Christina Lee |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2210.00025 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Adaptive Discretization in Online Reinforcement Learning
by: Sinclair, Sean R., et al.
Published: (2021)
by: Sinclair, Sean R., et al.
Published: (2021)
Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits
by: Liang, Biyonka, et al.
Published: (2024)
by: Liang, Biyonka, et al.
Published: (2024)
Dual-Mandate Patrols: Multi-Armed Bandits for Green Security
by: Xu, Lily, et al.
Published: (2020)
by: Xu, Lily, et al.
Published: (2020)
Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents
by: Gordon, Lucia, et al.
Published: (2024)
by: Gordon, Lucia, et al.
Published: (2024)
The Bandit Whisperer: Communication Learning for Restless Bandits
by: Zhao, Yunfan, et al.
Published: (2024)
by: Zhao, Yunfan, et al.
Published: (2024)
Reinforcement learning with combinatorial actions for coupled restless bandits
by: Xu, Lily, et al.
Published: (2025)
by: Xu, Lily, et al.
Published: (2025)
Online Fair Allocation of Perishable Resources
by: Banerjee, Siddhartha, et al.
Published: (2024)
by: Banerjee, Siddhartha, et al.
Published: (2024)
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
by: Verma, Shresth, et al.
Published: (2024)
by: Verma, Shresth, et al.
Published: (2024)
IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health
by: Jain, Gauri, et al.
Published: (2024)
by: Jain, Gauri, et al.
Published: (2024)
Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal Health Program
by: Dasgupta, Arpan, et al.
Published: (2024)
by: Dasgupta, Arpan, et al.
Published: (2024)
Decisions and Deployment: The Five-Year SAHELI Project (2020-2025) on Restless Multi-Armed Bandits for Improving Maternal and Child Health
by: Verma, Shresth, et al.
Published: (2026)
by: Verma, Shresth, et al.
Published: (2026)
Analyzing Cost-Sensitive Surrogate Losses via $\mathcal{H}$-calibration
by: Shah, Sanket, et al.
Published: (2025)
by: Shah, Sanket, et al.
Published: (2025)
A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health
by: Behari, Nikhil, et al.
Published: (2024)
by: Behari, Nikhil, et al.
Published: (2024)
Fairness for Workers Who Pull the Arms: An Index Based Policy for Allocation of Restless Bandit Tasks
by: Biswas, Arpita, et al.
Published: (2023)
by: Biswas, Arpita, et al.
Published: (2023)
Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation
by: Kong, Lingkai, et al.
Published: (2025)
by: Kong, Lingkai, et al.
Published: (2025)
Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation
by: Xiong, Guojun, et al.
Published: (2025)
by: Xiong, Guojun, et al.
Published: (2025)
The SMART approach to instance-optimal online learning
by: Banerjee, Siddhartha, et al.
Published: (2024)
by: Banerjee, Siddhartha, et al.
Published: (2024)
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
by: Zhao, Yunfan, et al.
Published: (2023)
by: Zhao, Yunfan, et al.
Published: (2023)
Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data
by: Kong, Lingkai, et al.
Published: (2025)
by: Kong, Lingkai, et al.
Published: (2025)
Latent Spherical Flow Policy for Reinforcement Learning with Combinatorial Actions
by: Kong, Lingkai, et al.
Published: (2026)
by: Kong, Lingkai, et al.
Published: (2026)
Multilinguality in LLM-Designed Reward Functions for Restless Bandits: Effects on Task Performance and Fairness
by: Parthasarathy, Ambreesh, et al.
Published: (2025)
by: Parthasarathy, Ambreesh, et al.
Published: (2025)
Improving Health Information Access in the World's Largest Maternal Mobile Health Program via Bandit Algorithms
by: Lalan, Arshika, et al.
Published: (2024)
by: Lalan, Arshika, et al.
Published: (2024)
Leaving the Nest: Going Beyond Local Loss Functions for Predict-Then-Optimize
by: Shah, Sanket, et al.
Published: (2023)
by: Shah, Sanket, et al.
Published: (2023)
Efficient Public Health Intervention Planning Using Decomposition-Based Decision-Focused Learning
by: Shah, Sanket, et al.
Published: (2024)
by: Shah, Sanket, et al.
Published: (2024)
Lightweight Robust Direct Preference Optimization
by: Kim, Cheol Woo, et al.
Published: (2025)
by: Kim, Cheol Woo, et al.
Published: (2025)
Preference Robustness for DPO with Applications to Public Health
by: Kim, Cheol Woo, et al.
Published: (2025)
by: Kim, Cheol Woo, et al.
Published: (2025)
The Data-Driven Censored Newsvendor Problem
by: Hssaine, Chamsi, et al.
Published: (2024)
by: Hssaine, Chamsi, et al.
Published: (2024)
Contrasting local and global modeling with machine learning and satellite data: A case study estimating tree canopy height in African savannas
by: Rolf, Esther, et al.
Published: (2024)
by: Rolf, Esther, et al.
Published: (2024)
A Reduction Algorithm for Markovian Contextual Linear Bandits
by: Buyukkalayci, Kaan, et al.
Published: (2026)
by: Buyukkalayci, Kaan, et al.
Published: (2026)
On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
by: Wang, Tonghan, et al.
Published: (2024)
by: Wang, Tonghan, et al.
Published: (2024)
Improving the Prediction of Individual Engagement in Recommendations Using Cognitive Models
by: Seow, Roderick, et al.
Published: (2024)
by: Seow, Roderick, et al.
Published: (2024)
Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents
by: Tec, Mauricio, et al.
Published: (2025)
by: Tec, Mauricio, et al.
Published: (2025)
SelfReplay: Adapting Self-Supervised Sensory Models via Adaptive Meta-Task Replay
by: Yoon, Hyungjun, et al.
Published: (2024)
by: Yoon, Hyungjun, et al.
Published: (2024)
Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information
by: Ai, Rui, et al.
Published: (2025)
by: Ai, Rui, et al.
Published: (2025)
Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective
by: Wang, Haichuan, et al.
Published: (2026)
by: Wang, Haichuan, et al.
Published: (2026)
Reinforcement Learning in MDPs with Information-Ordered Policies
by: Zhang, Zhongjun, et al.
Published: (2025)
by: Zhang, Zhongjun, et al.
Published: (2025)
Offline-Online Reinforcement Learning for Linear Mixture MDPs
by: Zhang, Zhongjun, et al.
Published: (2026)
by: Zhang, Zhongjun, et al.
Published: (2026)
What is the Right Notion of Distance between Predict-then-Optimize Tasks?
by: Rodriguez-Diaz, Paula, et al.
Published: (2024)
by: Rodriguez-Diaz, Paula, et al.
Published: (2024)
Evaluating the Effectiveness of Index-Based Treatment Allocation
by: Boehmer, Niclas, et al.
Published: (2024)
by: Boehmer, Niclas, et al.
Published: (2024)
Navigating the Social Welfare Frontier: Portfolios for Multi-objective Reinforcement Learning
by: Kim, Cheol Woo, et al.
Published: (2025)
by: Kim, Cheol Woo, et al.
Published: (2025)
Similar Items
-
Adaptive Discretization in Online Reinforcement Learning
by: Sinclair, Sean R., et al.
Published: (2021) -
Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits
by: Liang, Biyonka, et al.
Published: (2024) -
Dual-Mandate Patrols: Multi-Armed Bandits for Green Security
by: Xu, Lily, et al.
Published: (2020) -
Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents
by: Gordon, Lucia, et al.
Published: (2024) -
The Bandit Whisperer: Communication Learning for Restless Bandits
by: Zhao, Yunfan, et al.
Published: (2024)