:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Donâncio, Henrique, Barrier, Antoine, South, Leah F., Forbes, Florence
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2410.12598
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
by: Donâncio, Henrique, et al.
Published: (2022)

Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments
by: Verma, Abhishek, et al.
Published: (2025)

Learning Rate Optimization for Deep Neural Networks Using Lipschitz Bandits
by: Priyanka, Padma, et al.
Published: (2024)

The Polynomial Stein Discrepancy for Assessing Moment Convergence
by: Srinivasan, Narayan, et al.
Published: (2024)

Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning
by: Liu, Junyan, et al.
Published: (2024)

PSAT: Pediatric Segmentation Approaches via Adult Augmentations and Transfer Learning
by: Kirscher, Tristan, et al.
Published: (2025)

Restless Bandits with Individual Penalty Constraints: Near-Optimal Indices and Deep Reinforcement Learning
by: Zamir, Nida, et al.
Published: (2026)

LLMs Are In-Context Bandit Reinforcement Learners
by: Monea, Giovanni, et al.
Published: (2024)

Deep Reinforcement Learning based Triggering Function for Early Classifiers of Time Series
by: Renault, Aurélien, et al.
Published: (2025)

Convergence of projected stochastic natural gradient variational inference for various step size and sample or batch size schedules
by: Guilmeau, Thomas, et al.
Published: (2026)

Networked Restless Multi-Arm Bandits with Reinforcement Learning
by: Zhang, Hanmo, et al.
Published: (2025)

Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards
by: Lu, Xiaodong, et al.
Published: (2026)

Deep Reinforcement Learning: A Convex Optimization Approach
by: Gattami, Ather
Published: (2024)

Multi-Objective Adaptive Rate Limiting in Microservices Using Deep Reinforcement Learning
by: Lyu, Ning, et al.
Published: (2025)

Bayesian Experimental Design via Contrastive Diffusions
by: Iollo, Jacopo, et al.
Published: (2024)

Learning for Bandits under Action Erasures
by: Hanna, Osama, et al.
Published: (2024)

MARVEL: MR Fingerprinting with Additional micRoVascular Estimates using bidirectional LSTMs
by: Barrier, Antoine, et al.
Published: (2024)

Dynamic Trust Calibration Using Contextual Bandits
by: Henrique, Bruno M., et al.
Published: (2025)

Pure Exploration for a Good Policy in Reinforcement Learning with Bandit Feedback
by: Li, Zitian, et al.
Published: (2026)

Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning
by: Lee, Harin, et al.
Published: (2026)

Learning to Attack: A Bandit Approach to Adversarial Context Poisoning
by: Telikani, Ray, et al.
Published: (2026)

On the Hardness of Bandit Learning
by: Brukhim, Nataly, et al.
Published: (2025)

Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
by: Xiong, Guojun, et al.
Published: (2024)

Online Learning to Rank under Corruption: A Robust Cascading Bandits Approach
by: Ghaffari, Fatemeh, et al.
Published: (2025)

Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
by: Liu, Xutong, et al.
Published: (2024)

Incentivized Learning in Principal-Agent Bandit Games
by: Scheid, Antoine, et al.
Published: (2024)

Faster Rates for Private Adversarial Bandits
by: Asi, Hilal, et al.
Published: (2025)

Solving The Dynamic Volatility Fitting Problem: A Deep Reinforcement Learning Approach
by: Gnabeyeu, Emmanuel, et al.
Published: (2024)

Symmetry-Preserving Architecture for Multi-NUMA Environments (SPANE): A Deep Reinforcement Learning Approach for Dynamic VM Scheduling
by: Chan, Tin Ping, et al.
Published: (2025)

Rating-based Reinforcement Learning
by: White, Devin, et al.
Published: (2023)

Reinforcement Learning for Machine Learning Model Deployment: Evaluating Multi-Armed Bandits in ML Ops Environments
by: McClendon, S. Aaron, et al.
Published: (2025)

PASOA- PArticle baSed Bayesian Optimal Adaptive design
by: Iollo, Jacopo, et al.
Published: (2024)

Predictable Reinforcement Learning Dynamics through Entropy Rate Minimization
by: Ornia, Daniel Jarne, et al.
Published: (2023)

The Bandit Whisperer: Communication Learning for Restless Bandits
by: Zhao, Yunfan, et al.
Published: (2024)

Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
by: Young, Rory, et al.
Published: (2024)

Jointly-Learned Exit and Inference for a Dynamic Neural Network : JEI-DNN
by: Regol, Florence, et al.
Published: (2023)

Few-Shot Learning for Dynamic Operations of Automated Electric Taxi Fleets under Evolving Charging Infrastructure: A Meta-Deep Reinforcement Learning Approach
by: Li, Xiaozhuang, et al.
Published: (2026)

Fast Rates for Inverse Reinforcement Learning
by: Schlaginhaufen, Andreas, et al.
Published: (2026)

Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation
by: Xu, Liyuan, et al.
Published: (2021)

Optimal Control of Fluid Restless Multi-armed Bandits: A Machine Learning Approach
by: Bertsimas, Dimitris, et al.
Published: (2025)