:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ji, Wenlong, Pan, Yihan, Zhu, Ruihao, Lei, Lihua
Format:	Preprint
Published:	2025
Subjects:	Statistics Theory Machine Learning
Online Access:	https://arxiv.org/abs/2506.16658
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Near-Optimal Regret for KL-Regularized Multi-Armed Bandits
by: Ji, Kaixuan, et al.
Published: (2026)

On the Optimal Sample Complexity of Offline Multi-Armed Bandits with KL Regularization
by: Ji, Kaixuan, et al.
Published: (2026)

Identifying All ε-Best Arms in (Misspecified) Linear Bandits
by: Li, Zhekai, et al.
Published: (2025)

Kernel $ε$-Greedy for Multi-Armed Bandits with Covariates
by: Arya, Sakshi, et al.
Published: (2023)

Batched Single-Index Global Multi-Armed Bandits with Covariates
by: Arya, Sakshi, et al.
Published: (2025)

Transfer Learning for Contextual Multi-armed Bandits
by: Cai, Changxiao, et al.
Published: (2022)

Multi-Armed Sequential Hypothesis Testing by Betting
by: Sandoval, Ricardo J., et al.
Published: (2026)

On Lai's Upper Confidence Bound in Multi-Armed Bandits
by: Ren, Huachen, et al.
Published: (2024)

Optimal Batched Linear Bandits
by: Ren, Xuanfei, et al.
Published: (2024)

Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed Gaussian Bandits with Unknown Variances
by: Kato, Masahiro
Published: (2023)

Model-Agnostic Covariate-Assisted Inference on Partially Identified Causal Effects
by: Ji, Wenlong, et al.
Published: (2023)

Spectral Estimators for Structured Generalized Linear Models via Approximate Message Passing
by: Zhang, Yihan, et al.
Published: (2023)

Optimal Estimation in Orthogonally Invariant Generalized Linear Models: Spectral Initialization and Approximate Message Passing
by: Zhang, Yihan, et al.
Published: (2026)

Predictions as Surrogates: Revisiting Surrogate Outcomes in the Age of AI
by: Ji, Wenlong, et al.
Published: (2025)

Design Experiments to Compare Multi-armed Bandit Algorithms
by: Meng, Huiling, et al.
Published: (2026)

Multitask Learning and Bandits via Robust Statistics
by: Xu, Kan, et al.
Published: (2021)

Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models
by: Zhang, Yihan, et al.
Published: (2022)

Balancing Accuracy and Speed: A Multi-Fidelity Ensemble Kalman Filter with a Machine Learning Surrogate Model
by: van der Voort, Jeffrey, et al.
Published: (2025)

Extended UCB Policies for Multi-armed Bandit Problems
by: Liu, Keqin, et al.
Published: (2011)

Asymptotically Optimal Problem-Dependent Bandit Policies for Transfer Learning
by: Prevost, Adrien, et al.
Published: (2025)

Spectral Estimators for Multi-Index Models: Precise Asymptotics and Optimal Weak Recovery
by: Kovačević, Filip, et al.
Published: (2025)

Statistical Inference under Performativity
by: Li, Xiang, et al.
Published: (2025)

Local Asymptotic Normality for Multi-Armed Bandits
by: Akker, Ramon van den, et al.
Published: (2025)

Sharp One-Dimensional Sub-Gaussian Comparison in Convex Order
by: Zhang, Yihan
Published: (2026)

Towards a Sharp Analysis of Offline Policy Learning for $f$-Divergence-Regularized Contextual Bandits
by: Zhao, Qingyue, et al.
Published: (2025)

The Fragility of Optimized Bandit Algorithms
by: Fan, Lin, et al.
Published: (2021)

Batched Nonparametric Contextual Bandits
by: Jiang, Rong, et al.
Published: (2024)

To bootstrap or to rollout? An optimal and adaptive interpolation
by: Mou, Wenlong, et al.
Published: (2024)

Worst-Case Optimal Multi-Armed Gaussian Best Arm Identification with a Fixed Budget
by: Kato, Masahiro
Published: (2023)

A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits
by: Simchi-Levi, David, et al.
Published: (2022)

Adaptive Smooth Non-Stationary Bandits
by: Suk, Joe
Published: (2024)

Adversarial Surrogate Risk Bounds for Binary Classification
by: Frank, Natalie S.
Published: (2025)

The Adversarial Consistency of Surrogate Risks for Binary Classification
by: Frank, Natalie, et al.
Published: (2023)

Testing for Outliers with Conformal p-values
by: Bates, Stephen, et al.
Published: (2021)

Continuous-time reinforcement learning: ellipticity enables model-free value function approximation
by: Mou, Wenlong
Published: (2026)

Testing the Feasibility of Linear Programs with Bandit Feedback
by: Gangrade, Aditya, et al.
Published: (2024)

Truncated LinUCB for Stochastic Linear Bandits
by: Song, Yanglei, et al.
Published: (2022)

Learning Spectral Methods by Transformers
by: He, Yihan, et al.
Published: (2025)

Regret Distribution in Stochastic Bandits: Optimal Trade-off between Expectation and Tail Risk
by: Simchi-Levi, David, et al.
Published: (2023)

Online Clustering of Data Sequences with Bandit Information
by: Chandran, G Dhinesh, et al.
Published: (2025)