:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Yinan, Jun, Kwang-Sung
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2507.12584
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Improved Offline Contextual Bandits with Second-Order Bounds: Betting and Freezing
by: Ryu, J. Jon, et al.
Published: (2025)

$\varepsilon$-Good Action Identification in Fixed-Budget Monte Carlo Tree Search
by: Li, Yinan, et al.
Published: (2026)

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion
by: Lee, Junghyun, et al.
Published: (2023)

Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards
by: Qin, Hao, et al.
Published: (2023)

HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search
by: Nguyen, Tuan Ngo, et al.
Published: (2024)

Noise-Augmented $\ell_0$ Regularization of Tensor Regression with Tucker Decomposition
by: Yan, Tian, et al.
Published: (2023)

Better-than-KL PAC-Bayes Bounds
by: Kuzborskij, Ilja, et al.
Published: (2024)

Minimum Empirical Divergence for Sub-Gaussian Linear Bandits
by: Balagopalan, Kapilan, et al.
Published: (2024)

Nearly Optimal Active Preference Learning and Its Application to LLM Alignment
by: Zhao, Yao, et al.
Published: (2026)

Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization
by: Jun, Kwang-Sung, et al.
Published: (2024)

Fixed Budget is No Harder Than Fixed Confidence in Best-Arm Identification up to Logarithmic Factors
by: Balagopalan, Kapilan, et al.
Published: (2026)

GL-LowPopArt: A Nearly Instance-Wise Minimax-Optimal Estimator for Generalized Low-Rank Trace Regression
by: Lee, Junghyun, et al.
Published: (2025)

I Bet You Did Not Mean That: Testing Semantic Importance via Betting
by: Teneggi, Jacopo, et al.
Published: (2024)

Online Detection of LLM-Generated Texts via Sequential Hypothesis Testing by Betting
by: Chen, Can, et al.
Published: (2024)

Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
by: Jang, Kyoungseok, et al.
Published: (2024)

Second Order Bounds for Contextual Bandits with Function Approximation
by: Pacchiano, Aldo
Published: (2024)

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
by: Wang, Kaiwen, et al.
Published: (2024)

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
by: Lee, Junghyun, et al.
Published: (2024)

Optimistic Interior Point Methods for Sequential Hypothesis Testing by Betting
by: Chen, Can, et al.
Published: (2025)

HawkEye: Advancing Robust Regression with Bounded, Smooth, and Insensitive Loss Function
by: Akhtar, Mushir, et al.
Published: (2024)

Sample Compression Unleashed: New Generalization Bounds for Real Valued Losses
by: Bazinet, Mathieu, et al.
Published: (2024)

STaR-Bets: Sequential Target-Recalculating Bets for Tighter Confidence Intervals
by: Voráček, Václav, et al.
Published: (2025)

Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leibler Maillard Sampling
by: Qin, Hao, et al.
Published: (2025)

Prediction via Shapley Value Regression
by: Alkhatib, Amr, et al.
Published: (2025)

Learning Explainable Dense Reward Shapes via Bayesian Optimization
by: Koo, Ryan, et al.
Published: (2025)

Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds
by: Wang, Zhiyong, et al.
Published: (2024)

Adaptive Conformal Inference by Betting
by: Podkopaev, Aleksandr, et al.
Published: (2024)

Adaptive Experimentation When You Can't Experiment
by: Zhao, Yao, et al.
Published: (2024)

Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification
by: Balagopalan, Kapilan, et al.
Published: (2024)

Coverage Improvement and Fast Convergence of On-policy Preference Learning
by: Kim, Juno, et al.
Published: (2026)

Mitigating Task-Order Sensitivity and Forgetting via Hierarchical Second-Order Consolidation
by: Nag, Protik, et al.
Published: (2026)

Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression
by: Fu, Deqing, et al.
Published: (2023)

Gradient Aligned Regression via Pairwise Losses
by: Zhu, Dixian, et al.
Published: (2024)

Order Optimal Bounds for One-Shot Federated Learning over non-Convex Loss Functions
by: Sharifnassab, Arsalan, et al.
Published: (2021)

From Betting to Empirical Bernstein LIL
by: Orabona, Francesco
Published: (2026)

Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin
by: Wang, Fangyikang, et al.
Published: (2025)

Semi-Supervised Hypothesis Testing by Betting on Predictions
by: Tenzer, Yaniv, et al.
Published: (2026)

Risk Bounds For Distributional Regression
by: Padilla, Carlos Misael Madrid, et al.
Published: (2025)

Enhancing Differentially Private Linear Regression via Public Second-Moment
by: Cao, Zilong, et al.
Published: (2025)

Auditing Fairness by Betting
by: Chugg, Ben, et al.
Published: (2023)