Saved in:
| Main Authors: | Ji, Wenlong, Pan, Yihan, Zhu, Ruihao, Lei, Lihua |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.16658 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Near-Optimal Regret for KL-Regularized Multi-Armed Bandits
by: Ji, Kaixuan, et al.
Published: (2026)
by: Ji, Kaixuan, et al.
Published: (2026)
On the Optimal Sample Complexity of Offline Multi-Armed Bandits with KL Regularization
by: Ji, Kaixuan, et al.
Published: (2026)
by: Ji, Kaixuan, et al.
Published: (2026)
Identifying All ε-Best Arms in (Misspecified) Linear Bandits
by: Li, Zhekai, et al.
Published: (2025)
by: Li, Zhekai, et al.
Published: (2025)
Kernel $ε$-Greedy for Multi-Armed Bandits with Covariates
by: Arya, Sakshi, et al.
Published: (2023)
by: Arya, Sakshi, et al.
Published: (2023)
Batched Single-Index Global Multi-Armed Bandits with Covariates
by: Arya, Sakshi, et al.
Published: (2025)
by: Arya, Sakshi, et al.
Published: (2025)
Transfer Learning for Contextual Multi-armed Bandits
by: Cai, Changxiao, et al.
Published: (2022)
by: Cai, Changxiao, et al.
Published: (2022)
Multi-Armed Sequential Hypothesis Testing by Betting
by: Sandoval, Ricardo J., et al.
Published: (2026)
by: Sandoval, Ricardo J., et al.
Published: (2026)
On Lai's Upper Confidence Bound in Multi-Armed Bandits
by: Ren, Huachen, et al.
Published: (2024)
by: Ren, Huachen, et al.
Published: (2024)
Optimal Batched Linear Bandits
by: Ren, Xuanfei, et al.
Published: (2024)
by: Ren, Xuanfei, et al.
Published: (2024)
Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed Gaussian Bandits with Unknown Variances
by: Kato, Masahiro
Published: (2023)
by: Kato, Masahiro
Published: (2023)
Model-Agnostic Covariate-Assisted Inference on Partially Identified Causal Effects
by: Ji, Wenlong, et al.
Published: (2023)
by: Ji, Wenlong, et al.
Published: (2023)
Spectral Estimators for Structured Generalized Linear Models via Approximate Message Passing
by: Zhang, Yihan, et al.
Published: (2023)
by: Zhang, Yihan, et al.
Published: (2023)
Optimal Estimation in Orthogonally Invariant Generalized Linear Models: Spectral Initialization and Approximate Message Passing
by: Zhang, Yihan, et al.
Published: (2026)
by: Zhang, Yihan, et al.
Published: (2026)
Predictions as Surrogates: Revisiting Surrogate Outcomes in the Age of AI
by: Ji, Wenlong, et al.
Published: (2025)
by: Ji, Wenlong, et al.
Published: (2025)
Design Experiments to Compare Multi-armed Bandit Algorithms
by: Meng, Huiling, et al.
Published: (2026)
by: Meng, Huiling, et al.
Published: (2026)
Multitask Learning and Bandits via Robust Statistics
by: Xu, Kan, et al.
Published: (2021)
by: Xu, Kan, et al.
Published: (2021)
Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models
by: Zhang, Yihan, et al.
Published: (2022)
by: Zhang, Yihan, et al.
Published: (2022)
Balancing Accuracy and Speed: A Multi-Fidelity Ensemble Kalman Filter with a Machine Learning Surrogate Model
by: van der Voort, Jeffrey, et al.
Published: (2025)
by: van der Voort, Jeffrey, et al.
Published: (2025)
Extended UCB Policies for Multi-armed Bandit Problems
by: Liu, Keqin, et al.
Published: (2011)
by: Liu, Keqin, et al.
Published: (2011)
Asymptotically Optimal Problem-Dependent Bandit Policies for Transfer Learning
by: Prevost, Adrien, et al.
Published: (2025)
by: Prevost, Adrien, et al.
Published: (2025)
Spectral Estimators for Multi-Index Models: Precise Asymptotics and Optimal Weak Recovery
by: Kovačević, Filip, et al.
Published: (2025)
by: Kovačević, Filip, et al.
Published: (2025)
Statistical Inference under Performativity
by: Li, Xiang, et al.
Published: (2025)
by: Li, Xiang, et al.
Published: (2025)
Local Asymptotic Normality for Multi-Armed Bandits
by: Akker, Ramon van den, et al.
Published: (2025)
by: Akker, Ramon van den, et al.
Published: (2025)
Sharp One-Dimensional Sub-Gaussian Comparison in Convex Order
by: Zhang, Yihan
Published: (2026)
by: Zhang, Yihan
Published: (2026)
Towards a Sharp Analysis of Offline Policy Learning for $f$-Divergence-Regularized Contextual Bandits
by: Zhao, Qingyue, et al.
Published: (2025)
by: Zhao, Qingyue, et al.
Published: (2025)
The Fragility of Optimized Bandit Algorithms
by: Fan, Lin, et al.
Published: (2021)
by: Fan, Lin, et al.
Published: (2021)
Batched Nonparametric Contextual Bandits
by: Jiang, Rong, et al.
Published: (2024)
by: Jiang, Rong, et al.
Published: (2024)
To bootstrap or to rollout? An optimal and adaptive interpolation
by: Mou, Wenlong, et al.
Published: (2024)
by: Mou, Wenlong, et al.
Published: (2024)
Worst-Case Optimal Multi-Armed Gaussian Best Arm Identification with a Fixed Budget
by: Kato, Masahiro
Published: (2023)
by: Kato, Masahiro
Published: (2023)
A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits
by: Simchi-Levi, David, et al.
Published: (2022)
by: Simchi-Levi, David, et al.
Published: (2022)
Adaptive Smooth Non-Stationary Bandits
by: Suk, Joe
Published: (2024)
by: Suk, Joe
Published: (2024)
Adversarial Surrogate Risk Bounds for Binary Classification
by: Frank, Natalie S.
Published: (2025)
by: Frank, Natalie S.
Published: (2025)
The Adversarial Consistency of Surrogate Risks for Binary Classification
by: Frank, Natalie, et al.
Published: (2023)
by: Frank, Natalie, et al.
Published: (2023)
Testing for Outliers with Conformal p-values
by: Bates, Stephen, et al.
Published: (2021)
by: Bates, Stephen, et al.
Published: (2021)
Continuous-time reinforcement learning: ellipticity enables model-free value function approximation
by: Mou, Wenlong
Published: (2026)
by: Mou, Wenlong
Published: (2026)
Testing the Feasibility of Linear Programs with Bandit Feedback
by: Gangrade, Aditya, et al.
Published: (2024)
by: Gangrade, Aditya, et al.
Published: (2024)
Truncated LinUCB for Stochastic Linear Bandits
by: Song, Yanglei, et al.
Published: (2022)
by: Song, Yanglei, et al.
Published: (2022)
Learning Spectral Methods by Transformers
by: He, Yihan, et al.
Published: (2025)
by: He, Yihan, et al.
Published: (2025)
Regret Distribution in Stochastic Bandits: Optimal Trade-off between Expectation and Tail Risk
by: Simchi-Levi, David, et al.
Published: (2023)
by: Simchi-Levi, David, et al.
Published: (2023)
Online Clustering of Data Sequences with Bandit Information
by: Chandran, G Dhinesh, et al.
Published: (2025)
by: Chandran, G Dhinesh, et al.
Published: (2025)
Similar Items
-
Near-Optimal Regret for KL-Regularized Multi-Armed Bandits
by: Ji, Kaixuan, et al.
Published: (2026) -
On the Optimal Sample Complexity of Offline Multi-Armed Bandits with KL Regularization
by: Ji, Kaixuan, et al.
Published: (2026) -
Identifying All ε-Best Arms in (Misspecified) Linear Bandits
by: Li, Zhekai, et al.
Published: (2025) -
Kernel $ε$-Greedy for Multi-Armed Bandits with Covariates
by: Arya, Sakshi, et al.
Published: (2023) -
Batched Single-Index Global Multi-Armed Bandits with Covariates
by: Arya, Sakshi, et al.
Published: (2025)