Saved in:
| Main Authors: | Blanchard, Moïse, Goyal, Vineet |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.10313 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Individual Regret in Cooperative Stochastic Multi-Armed Bandits
by: Barnea, Idan, et al.
Published: (2024)
by: Barnea, Idan, et al.
Published: (2024)
Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference
by: Jamshidi, Fateme, et al.
Published: (2025)
by: Jamshidi, Fateme, et al.
Published: (2025)
Bandit Max-Min Fair Allocation
by: Harada, Tsubasa, et al.
Published: (2025)
by: Harada, Tsubasa, et al.
Published: (2025)
Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning
by: Lee, Harin, et al.
Published: (2026)
by: Lee, Harin, et al.
Published: (2026)
Collaborating in Multi-Armed Bandits with Strategic Agents
by: Barnea, Idan, et al.
Published: (2026)
by: Barnea, Idan, et al.
Published: (2026)
Stochastic Multi-Objective Multi-Armed Bandits: Regret Definition and Algorithm
by: Davoodi, Mansoor, et al.
Published: (2025)
by: Davoodi, Mansoor, et al.
Published: (2025)
Near-Optimal Regret for KL-Regularized Multi-Armed Bandits
by: Ji, Kaixuan, et al.
Published: (2026)
by: Ji, Kaixuan, et al.
Published: (2026)
On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits
by: Hou, Yunlong, et al.
Published: (2026)
by: Hou, Yunlong, et al.
Published: (2026)
Asymptotically and Minimax Optimal Regret Bounds for Multi-Armed Bandits with Abstention
by: Yang, Junwen, et al.
Published: (2024)
by: Yang, Junwen, et al.
Published: (2024)
Materials Discovery using Max K-Armed Bandit
by: Kikkawa, Nobuaki, et al.
Published: (2022)
by: Kikkawa, Nobuaki, et al.
Published: (2022)
Understanding Memory-Regret Trade-Off for Streaming Stochastic Multi-Armed Bandits
by: He, Yuchen, et al.
Published: (2024)
by: He, Yuchen, et al.
Published: (2024)
Agnostic Smoothed Online Learning without Knowledge of the Base Measure
by: Blanchard, Moïse
Published: (2024)
by: Blanchard, Moïse
Published: (2024)
Multi-Armed Bandits with Interference
by: Jia, Su, et al.
Published: (2024)
by: Jia, Su, et al.
Published: (2024)
Imprecise Multi-Armed Bandits
by: Kosoy, Vanessa
Published: (2024)
by: Kosoy, Vanessa
Published: (2024)
MNL-Bandit with Knapsacks: a near-optimal algorithm
by: Aznag, Abdellah, et al.
Published: (2021)
by: Aznag, Abdellah, et al.
Published: (2021)
Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits
by: Chawla, Ronshee, et al.
Published: (2023)
by: Chawla, Ronshee, et al.
Published: (2023)
Flickering Multi-Armed Bandits
by: Chakraborty, Sourav, et al.
Published: (2026)
by: Chakraborty, Sourav, et al.
Published: (2026)
Online Min-Max Optimization: From Individual Regrets to Cumulative Saddle Points
by: Vyas, Abhijeet, et al.
Published: (2026)
by: Vyas, Abhijeet, et al.
Published: (2026)
Multi-Armed Bandits with Network Interference
by: Agarwal, Abhineet, et al.
Published: (2024)
by: Agarwal, Abhineet, et al.
Published: (2024)
Introduction to Multi-Armed Bandits
by: Slivkins, Aleksandrs
Published: (2019)
by: Slivkins, Aleksandrs
Published: (2019)
Gradient Descent is Pareto-Optimal in the Oracle Complexity and Memory Tradeoff for Feasibility Problems
by: Blanchard, Moise
Published: (2024)
by: Blanchard, Moise
Published: (2024)
Near-Optimal Privacy-Preserving Learning for Max-Min Fair Multi-Agent Bandits
by: Leshem, Amir
Published: (2023)
by: Leshem, Amir
Published: (2023)
On the Regularity and Fairness of Combinatorial Multi-Armed Bandit
by: Wu, Xiaoyi, et al.
Published: (2025)
by: Wu, Xiaoyi, et al.
Published: (2025)
Rising Multi-Armed Bandits with Known Horizons
by: Song, Seockbean, et al.
Published: (2026)
by: Song, Seockbean, et al.
Published: (2026)
Optimal Streaming Algorithms for Multi-Armed Bandits
by: Jin, Tianyuan, et al.
Published: (2024)
by: Jin, Tianyuan, et al.
Published: (2024)
MinMaxMin $Q$-learning
by: Soffair, Nitsan, et al.
Published: (2024)
by: Soffair, Nitsan, et al.
Published: (2024)
Tight Gap-Dependent Memory-Regret Trade-Off for Single-Pass Streaming Stochastic Multi-Armed Bandits
by: Ye, Zichun, et al.
Published: (2025)
by: Ye, Zichun, et al.
Published: (2025)
Autonomous Drug Design with Multi-Armed Bandits
by: Svensson, Hampus Gummesson, et al.
Published: (2022)
by: Svensson, Hampus Gummesson, et al.
Published: (2022)
Put CASH on Bandits: A Max K-Armed Problem for Automated Machine Learning
by: Balef, Amir Rezaei, et al.
Published: (2025)
by: Balef, Amir Rezaei, et al.
Published: (2025)
Byzantine-Resilient Decentralized Multi-Armed Bandits
by: Zhu, Jingxuan, et al.
Published: (2023)
by: Zhu, Jingxuan, et al.
Published: (2023)
Multi-Armed Bandits With Best-Action Queries
by: Bacchiocchi, Francesco, et al.
Published: (2026)
by: Bacchiocchi, Francesco, et al.
Published: (2026)
Stochastic Multi-Armed Bandits with Limited Control Variates
by: Verma, Arun, et al.
Published: (2026)
by: Verma, Arun, et al.
Published: (2026)
Optimism in the Face of Ambiguity Principle for Multi-Armed Bandits
by: Li, Mengmeng, et al.
Published: (2024)
by: Li, Mengmeng, et al.
Published: (2024)
Federated Multi-Armed Bandits Under Byzantine Attacks
by: Saday, Artun, et al.
Published: (2022)
by: Saday, Artun, et al.
Published: (2022)
Distributed Algorithms for Multi-Agent Multi-Armed Bandits with Collision
by: Zhou, Daoyuan, et al.
Published: (2025)
by: Zhou, Daoyuan, et al.
Published: (2025)
Distributionally-Constrained Adversaries in Online Learning
by: Blanchard, Moïse, et al.
Published: (2025)
by: Blanchard, Moïse, et al.
Published: (2025)
Adversarial Attacks on Combinatorial Multi-Armed Bandits
by: Balasubramanian, Rishab, et al.
Published: (2023)
by: Balasubramanian, Rishab, et al.
Published: (2023)
Unlearning Offline Stochastic Multi-Armed Bandits
by: Ye, Zichun, et al.
Published: (2026)
by: Ye, Zichun, et al.
Published: (2026)
Distribution-Free Sequential Prediction with Abstentions
by: Yu, Jialin, et al.
Published: (2026)
by: Yu, Jialin, et al.
Published: (2026)
Global Rewards in Restless Multi-Armed Bandits
by: Raman, Naveen, et al.
Published: (2024)
by: Raman, Naveen, et al.
Published: (2024)
Similar Items
-
Individual Regret in Cooperative Stochastic Multi-Armed Bandits
by: Barnea, Idan, et al.
Published: (2024) -
Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference
by: Jamshidi, Fateme, et al.
Published: (2025) -
Bandit Max-Min Fair Allocation
by: Harada, Tsubasa, et al.
Published: (2025) -
Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning
by: Lee, Harin, et al.
Published: (2026) -
Collaborating in Multi-Armed Bandits with Strategic Agents
by: Barnea, Idan, et al.
Published: (2026)