Saved in:
| Main Authors: | Ashkezari, Sajad, Ben-David, Shai |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.17103 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Robust Online Learning
by: Ashkezari, Sajad
Published: (2026)
by: Ashkezari, Sajad
Published: (2026)
Beyond Bandit Feedback in Online Multiclass Classification
by: van der Hoeven, Dirk, et al.
Published: (2021)
by: van der Hoeven, Dirk, et al.
Published: (2021)
Multiclass Online Learnability under Bandit Feedback
by: Raman, Ananth, et al.
Published: (2023)
by: Raman, Ananth, et al.
Published: (2023)
Bandit-Feedback Online Multiclass Classification: Variants and Tradeoffs
by: Filmus, Yuval, et al.
Published: (2024)
by: Filmus, Yuval, et al.
Published: (2024)
Multiclass Transductive Online Learning
by: Hanneke, Steve, et al.
Published: (2024)
by: Hanneke, Steve, et al.
Published: (2024)
Universal Multiclass Transductive Online Learning
by: Hanneke, Steve, et al.
Published: (2026)
by: Hanneke, Steve, et al.
Published: (2026)
The Sample Complexity of Multiclass and Sparse Contextual Bandits
by: Erez, Liad, et al.
Published: (2026)
by: Erez, Liad, et al.
Published: (2026)
The Real Price of Bandit Information in Multiclass Classification
by: Erez, Liad, et al.
Published: (2024)
by: Erez, Liad, et al.
Published: (2024)
Fast Rates for Bandit PAC Multiclass Classification
by: Erez, Liad, et al.
Published: (2024)
by: Erez, Liad, et al.
Published: (2024)
Understanding Aggregations of Proper Learners in Multiclass Classification
by: Asilis, Julian, et al.
Published: (2024)
by: Asilis, Julian, et al.
Published: (2024)
Learning from positive and unlabeled examples -Finite size sample bounds
by: Mansouri, Farnam, et al.
Published: (2025)
by: Mansouri, Farnam, et al.
Published: (2025)
Online Budget Allocation with Censored Semi-Bandit Feedback
by: Bachoc, François, et al.
Published: (2025)
by: Bachoc, François, et al.
Published: (2025)
A Novel Data-Dependent Learning Paradigm for Large Hypothesis Classes
by: Pour, Alireza F., et al.
Published: (2025)
by: Pour, Alireza F., et al.
Published: (2025)
How Many Iterations to Jailbreak? Dynamic Budget Allocation for Multi-Turn LLM Evaluation
by: Feldman, Shai, et al.
Published: (2026)
by: Feldman, Shai, et al.
Published: (2026)
Principal-Agent Bandit Games with Self-Interested and Exploratory Learning Agents
by: Liu, Junyan, et al.
Published: (2024)
by: Liu, Junyan, et al.
Published: (2024)
Confounded Budgeted Causal Bandits
by: Jamshidi, Fateme, et al.
Published: (2024)
by: Jamshidi, Fateme, et al.
Published: (2024)
Multi-Task Combinatorial Bandits for Budget Allocation
by: Ge, Lin, et al.
Published: (2024)
by: Ge, Lin, et al.
Published: (2024)
Online Bandit Learning with Offline Preference Data for Improved RLHF
by: Agnihotri, Akhil, et al.
Published: (2024)
by: Agnihotri, Akhil, et al.
Published: (2024)
Multi-Agent Lipschitz Bandits
by: Chakraborty, Sourav, et al.
Published: (2026)
by: Chakraborty, Sourav, et al.
Published: (2026)
Online Structured Prediction with Fenchel--Young Losses and Improved Surrogate Regret for Online Multiclass Classification with Logistic Loss
by: Sakaue, Shinsaku, et al.
Published: (2024)
by: Sakaue, Shinsaku, et al.
Published: (2024)
Learning with Multiple Correct Answers -- A Trichotomy of Regret Bounds under Different Feedback Models
by: Pour, Alireza F., et al.
Published: (2026)
by: Pour, Alireza F., et al.
Published: (2026)
Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits
by: Jeong, Woojin, et al.
Published: (2024)
by: Jeong, Woojin, et al.
Published: (2024)
Improved Online Confidence Bounds for Multinomial Logistic Bandits
by: Lee, Joongkyu, et al.
Published: (2025)
by: Lee, Joongkyu, et al.
Published: (2025)
Incentivized Learning in Principal-Agent Bandit Games
by: Scheid, Antoine, et al.
Published: (2024)
by: Scheid, Antoine, et al.
Published: (2024)
Contextual Bandits with Budgeted Information Reveal
by: Gan, Kyra, et al.
Published: (2023)
by: Gan, Kyra, et al.
Published: (2023)
On the Computability of Multiclass PAC Learning
by: Gourdeau, Pascale, et al.
Published: (2025)
by: Gourdeau, Pascale, et al.
Published: (2025)
Regularization and Optimal Multiclass Learning
by: Asilis, Julian, et al.
Published: (2023)
by: Asilis, Julian, et al.
Published: (2023)
Improved Regret Bounds for Online Fair Division with Bandit Learning
by: Schiffer, Benjamin, et al.
Published: (2025)
by: Schiffer, Benjamin, et al.
Published: (2025)
Collaborating in Multi-Armed Bandits with Strategic Agents
by: Barnea, Idan, et al.
Published: (2026)
by: Barnea, Idan, et al.
Published: (2026)
Active learning from positive and unlabeled examples
by: Mansouri, Farnam, et al.
Published: (2026)
by: Mansouri, Farnam, et al.
Published: (2026)
Bandit Pareto Set Identification: the Fixed Budget Setting
by: Kone, Cyrille, et al.
Published: (2023)
by: Kone, Cyrille, et al.
Published: (2023)
LLMs Are In-Context Bandit Reinforcement Learners
by: Monea, Giovanni, et al.
Published: (2024)
by: Monea, Giovanni, et al.
Published: (2024)
Multi-Objective Multi-Agent Bandits: From Learning Efficiency to Fairness Optimization
by: Wang, John, et al.
Published: (2026)
by: Wang, John, et al.
Published: (2026)
Multi-Agent Stochastic Bandits Robust to Adversarial Corruptions
by: Ghaffari, Fatemeh, et al.
Published: (2024)
by: Ghaffari, Fatemeh, et al.
Published: (2024)
Adaptive Sample Sharing for Multi Agent Linear Bandits
by: Cherkaoui, Hamza, et al.
Published: (2023)
by: Cherkaoui, Hamza, et al.
Published: (2023)
BAGEN: Are LLM Agents Budget-Aware?
by: Lin, Yuxiang, et al.
Published: (2026)
by: Lin, Yuxiang, et al.
Published: (2026)
Personalized Reinforcement Learning with a Budget of Policies
by: Ivanov, Dmitry, et al.
Published: (2024)
by: Ivanov, Dmitry, et al.
Published: (2024)
GOAL: A Generalist Combinatorial Optimization Agent Learner
by: Drakulic, Darko, et al.
Published: (2024)
by: Drakulic, Darko, et al.
Published: (2024)
Fixed-Budget Constrained Best Arm Identification in Grouped Bandits
by: Mukherjee, Raunak, et al.
Published: (2026)
by: Mukherjee, Raunak, et al.
Published: (2026)
Fixed-Budget Change Point Identification in Piecewise Constant Bandits
by: Lazzaro, Joseph, et al.
Published: (2025)
by: Lazzaro, Joseph, et al.
Published: (2025)
Similar Items
-
Robust Online Learning
by: Ashkezari, Sajad
Published: (2026) -
Beyond Bandit Feedback in Online Multiclass Classification
by: van der Hoeven, Dirk, et al.
Published: (2021) -
Multiclass Online Learnability under Bandit Feedback
by: Raman, Ananth, et al.
Published: (2023) -
Bandit-Feedback Online Multiclass Classification: Variants and Tradeoffs
by: Filmus, Yuval, et al.
Published: (2024) -
Multiclass Transductive Online Learning
by: Hanneke, Steve, et al.
Published: (2024)