Saved in:
| Main Authors: | Donâncio, Henrique, Barrier, Antoine, South, Leah F., Forbes, Florence |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.12598 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
by: Donâncio, Henrique, et al.
Published: (2022)
by: Donâncio, Henrique, et al.
Published: (2022)
Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments
by: Verma, Abhishek, et al.
Published: (2025)
by: Verma, Abhishek, et al.
Published: (2025)
Learning Rate Optimization for Deep Neural Networks Using Lipschitz Bandits
by: Priyanka, Padma, et al.
Published: (2024)
by: Priyanka, Padma, et al.
Published: (2024)
The Polynomial Stein Discrepancy for Assessing Moment Convergence
by: Srinivasan, Narayan, et al.
Published: (2024)
by: Srinivasan, Narayan, et al.
Published: (2024)
Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning
by: Liu, Junyan, et al.
Published: (2024)
by: Liu, Junyan, et al.
Published: (2024)
PSAT: Pediatric Segmentation Approaches via Adult Augmentations and Transfer Learning
by: Kirscher, Tristan, et al.
Published: (2025)
by: Kirscher, Tristan, et al.
Published: (2025)
Restless Bandits with Individual Penalty Constraints: Near-Optimal Indices and Deep Reinforcement Learning
by: Zamir, Nida, et al.
Published: (2026)
by: Zamir, Nida, et al.
Published: (2026)
LLMs Are In-Context Bandit Reinforcement Learners
by: Monea, Giovanni, et al.
Published: (2024)
by: Monea, Giovanni, et al.
Published: (2024)
Deep Reinforcement Learning based Triggering Function for Early Classifiers of Time Series
by: Renault, Aurélien, et al.
Published: (2025)
by: Renault, Aurélien, et al.
Published: (2025)
Convergence of projected stochastic natural gradient variational inference for various step size and sample or batch size schedules
by: Guilmeau, Thomas, et al.
Published: (2026)
by: Guilmeau, Thomas, et al.
Published: (2026)
Networked Restless Multi-Arm Bandits with Reinforcement Learning
by: Zhang, Hanmo, et al.
Published: (2025)
by: Zhang, Hanmo, et al.
Published: (2025)
Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards
by: Lu, Xiaodong, et al.
Published: (2026)
by: Lu, Xiaodong, et al.
Published: (2026)
Deep Reinforcement Learning: A Convex Optimization Approach
by: Gattami, Ather
Published: (2024)
by: Gattami, Ather
Published: (2024)
Multi-Objective Adaptive Rate Limiting in Microservices Using Deep Reinforcement Learning
by: Lyu, Ning, et al.
Published: (2025)
by: Lyu, Ning, et al.
Published: (2025)
Bayesian Experimental Design via Contrastive Diffusions
by: Iollo, Jacopo, et al.
Published: (2024)
by: Iollo, Jacopo, et al.
Published: (2024)
Learning for Bandits under Action Erasures
by: Hanna, Osama, et al.
Published: (2024)
by: Hanna, Osama, et al.
Published: (2024)
MARVEL: MR Fingerprinting with Additional micRoVascular Estimates using bidirectional LSTMs
by: Barrier, Antoine, et al.
Published: (2024)
by: Barrier, Antoine, et al.
Published: (2024)
Dynamic Trust Calibration Using Contextual Bandits
by: Henrique, Bruno M., et al.
Published: (2025)
by: Henrique, Bruno M., et al.
Published: (2025)
Pure Exploration for a Good Policy in Reinforcement Learning with Bandit Feedback
by: Li, Zitian, et al.
Published: (2026)
by: Li, Zitian, et al.
Published: (2026)
Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning
by: Lee, Harin, et al.
Published: (2026)
by: Lee, Harin, et al.
Published: (2026)
Learning to Attack: A Bandit Approach to Adversarial Context Poisoning
by: Telikani, Ray, et al.
Published: (2026)
by: Telikani, Ray, et al.
Published: (2026)
On the Hardness of Bandit Learning
by: Brukhim, Nataly, et al.
Published: (2025)
by: Brukhim, Nataly, et al.
Published: (2025)
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
by: Xiong, Guojun, et al.
Published: (2024)
by: Xiong, Guojun, et al.
Published: (2024)
Online Learning to Rank under Corruption: A Robust Cascading Bandits Approach
by: Ghaffari, Fatemeh, et al.
Published: (2025)
by: Ghaffari, Fatemeh, et al.
Published: (2025)
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
by: Liu, Xutong, et al.
Published: (2024)
by: Liu, Xutong, et al.
Published: (2024)
Incentivized Learning in Principal-Agent Bandit Games
by: Scheid, Antoine, et al.
Published: (2024)
by: Scheid, Antoine, et al.
Published: (2024)
Faster Rates for Private Adversarial Bandits
by: Asi, Hilal, et al.
Published: (2025)
by: Asi, Hilal, et al.
Published: (2025)
Solving The Dynamic Volatility Fitting Problem: A Deep Reinforcement Learning Approach
by: Gnabeyeu, Emmanuel, et al.
Published: (2024)
by: Gnabeyeu, Emmanuel, et al.
Published: (2024)
Symmetry-Preserving Architecture for Multi-NUMA Environments (SPANE): A Deep Reinforcement Learning Approach for Dynamic VM Scheduling
by: Chan, Tin Ping, et al.
Published: (2025)
by: Chan, Tin Ping, et al.
Published: (2025)
Rating-based Reinforcement Learning
by: White, Devin, et al.
Published: (2023)
by: White, Devin, et al.
Published: (2023)
Reinforcement Learning for Machine Learning Model Deployment: Evaluating Multi-Armed Bandits in ML Ops Environments
by: McClendon, S. Aaron, et al.
Published: (2025)
by: McClendon, S. Aaron, et al.
Published: (2025)
PASOA- PArticle baSed Bayesian Optimal Adaptive design
by: Iollo, Jacopo, et al.
Published: (2024)
by: Iollo, Jacopo, et al.
Published: (2024)
Predictable Reinforcement Learning Dynamics through Entropy Rate Minimization
by: Ornia, Daniel Jarne, et al.
Published: (2023)
by: Ornia, Daniel Jarne, et al.
Published: (2023)
The Bandit Whisperer: Communication Learning for Restless Bandits
by: Zhao, Yunfan, et al.
Published: (2024)
by: Zhao, Yunfan, et al.
Published: (2024)
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
by: Young, Rory, et al.
Published: (2024)
by: Young, Rory, et al.
Published: (2024)
Jointly-Learned Exit and Inference for a Dynamic Neural Network : JEI-DNN
by: Regol, Florence, et al.
Published: (2023)
by: Regol, Florence, et al.
Published: (2023)
Few-Shot Learning for Dynamic Operations of Automated Electric Taxi Fleets under Evolving Charging Infrastructure: A Meta-Deep Reinforcement Learning Approach
by: Li, Xiaozhuang, et al.
Published: (2026)
by: Li, Xiaozhuang, et al.
Published: (2026)
Fast Rates for Inverse Reinforcement Learning
by: Schlaginhaufen, Andreas, et al.
Published: (2026)
by: Schlaginhaufen, Andreas, et al.
Published: (2026)
Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation
by: Xu, Liyuan, et al.
Published: (2021)
by: Xu, Liyuan, et al.
Published: (2021)
Optimal Control of Fluid Restless Multi-armed Bandits: A Machine Learning Approach
by: Bertsimas, Dimitris, et al.
Published: (2025)
by: Bertsimas, Dimitris, et al.
Published: (2025)
Similar Items
-
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
by: Donâncio, Henrique, et al.
Published: (2022) -
Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments
by: Verma, Abhishek, et al.
Published: (2025) -
Learning Rate Optimization for Deep Neural Networks Using Lipschitz Bandits
by: Priyanka, Padma, et al.
Published: (2024) -
The Polynomial Stein Discrepancy for Assessing Moment Convergence
by: Srinivasan, Narayan, et al.
Published: (2024) -
Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning
by: Liu, Junyan, et al.
Published: (2024)