:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Banerjee, Siddhartha, Sinclair, Sean R., Tambe, Milind, Xu, Lily, Yu, Christina Lee
Format:	Preprint
Published:	2022
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2210.00025
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Adaptive Discretization in Online Reinforcement Learning
by: Sinclair, Sean R., et al.
Published: (2021)

Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits
by: Liang, Biyonka, et al.
Published: (2024)

Dual-Mandate Patrols: Multi-Armed Bandits for Green Security
by: Xu, Lily, et al.
Published: (2020)

Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents
by: Gordon, Lucia, et al.
Published: (2024)

The Bandit Whisperer: Communication Learning for Restless Bandits
by: Zhao, Yunfan, et al.
Published: (2024)

Reinforcement learning with combinatorial actions for coupled restless bandits
by: Xu, Lily, et al.
Published: (2025)

Online Fair Allocation of Perishable Resources
by: Banerjee, Siddhartha, et al.
Published: (2024)

Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
by: Verma, Shresth, et al.
Published: (2024)

IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health
by: Jain, Gauri, et al.
Published: (2024)

Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal Health Program
by: Dasgupta, Arpan, et al.
Published: (2024)

Decisions and Deployment: The Five-Year SAHELI Project (2020-2025) on Restless Multi-Armed Bandits for Improving Maternal and Child Health
by: Verma, Shresth, et al.
Published: (2026)

Analyzing Cost-Sensitive Surrogate Losses via $\mathcal{H}$-calibration
by: Shah, Sanket, et al.
Published: (2025)

A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health
by: Behari, Nikhil, et al.
Published: (2024)

Fairness for Workers Who Pull the Arms: An Index Based Policy for Allocation of Restless Bandit Tasks
by: Biswas, Arpita, et al.
Published: (2023)

Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation
by: Kong, Lingkai, et al.
Published: (2025)

Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation
by: Xiong, Guojun, et al.
Published: (2025)

The SMART approach to instance-optimal online learning
by: Banerjee, Siddhartha, et al.
Published: (2024)

Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
by: Zhao, Yunfan, et al.
Published: (2023)

Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data
by: Kong, Lingkai, et al.
Published: (2025)

Latent Spherical Flow Policy for Reinforcement Learning with Combinatorial Actions
by: Kong, Lingkai, et al.
Published: (2026)

Multilinguality in LLM-Designed Reward Functions for Restless Bandits: Effects on Task Performance and Fairness
by: Parthasarathy, Ambreesh, et al.
Published: (2025)

Improving Health Information Access in the World's Largest Maternal Mobile Health Program via Bandit Algorithms
by: Lalan, Arshika, et al.
Published: (2024)

Leaving the Nest: Going Beyond Local Loss Functions for Predict-Then-Optimize
by: Shah, Sanket, et al.
Published: (2023)

Efficient Public Health Intervention Planning Using Decomposition-Based Decision-Focused Learning
by: Shah, Sanket, et al.
Published: (2024)

Lightweight Robust Direct Preference Optimization
by: Kim, Cheol Woo, et al.
Published: (2025)

Preference Robustness for DPO with Applications to Public Health
by: Kim, Cheol Woo, et al.
Published: (2025)

The Data-Driven Censored Newsvendor Problem
by: Hssaine, Chamsi, et al.
Published: (2024)

Contrasting local and global modeling with machine learning and satellite data: A case study estimating tree canopy height in African savannas
by: Rolf, Esther, et al.
Published: (2024)

A Reduction Algorithm for Markovian Contextual Linear Bandits
by: Buyukkalayci, Kaan, et al.
Published: (2026)

On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
by: Wang, Tonghan, et al.
Published: (2024)

Improving the Prediction of Individual Engagement in Recommendations Using Cognitive Models
by: Seow, Roderick, et al.
Published: (2024)

Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents
by: Tec, Mauricio, et al.
Published: (2025)

SelfReplay: Adapting Self-Supervised Sensory Models via Adaptive Meta-Task Replay
by: Yoon, Hyungjun, et al.
Published: (2024)

Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information
by: Ai, Rui, et al.
Published: (2025)

Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective
by: Wang, Haichuan, et al.
Published: (2026)

Reinforcement Learning in MDPs with Information-Ordered Policies
by: Zhang, Zhongjun, et al.
Published: (2025)

Offline-Online Reinforcement Learning for Linear Mixture MDPs
by: Zhang, Zhongjun, et al.
Published: (2026)

What is the Right Notion of Distance between Predict-then-Optimize Tasks?
by: Rodriguez-Diaz, Paula, et al.
Published: (2024)

Evaluating the Effectiveness of Index-Based Treatment Allocation
by: Boehmer, Niclas, et al.
Published: (2024)

Navigating the Social Welfare Frontier: Portfolios for Multi-objective Reinforcement Learning
by: Kim, Cheol Woo, et al.
Published: (2025)