Saved in:
| Main Authors: | Li, Yinan, Jun, Kwang-Sung |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.12584 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Improved Offline Contextual Bandits with Second-Order Bounds: Betting and Freezing
by: Ryu, J. Jon, et al.
Published: (2025)
by: Ryu, J. Jon, et al.
Published: (2025)
$\varepsilon$-Good Action Identification in Fixed-Budget Monte Carlo Tree Search
by: Li, Yinan, et al.
Published: (2026)
by: Li, Yinan, et al.
Published: (2026)
Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion
by: Lee, Junghyun, et al.
Published: (2023)
by: Lee, Junghyun, et al.
Published: (2023)
Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards
by: Qin, Hao, et al.
Published: (2023)
by: Qin, Hao, et al.
Published: (2023)
HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search
by: Nguyen, Tuan Ngo, et al.
Published: (2024)
by: Nguyen, Tuan Ngo, et al.
Published: (2024)
Noise-Augmented $\ell_0$ Regularization of Tensor Regression with Tucker Decomposition
by: Yan, Tian, et al.
Published: (2023)
by: Yan, Tian, et al.
Published: (2023)
Better-than-KL PAC-Bayes Bounds
by: Kuzborskij, Ilja, et al.
Published: (2024)
by: Kuzborskij, Ilja, et al.
Published: (2024)
Minimum Empirical Divergence for Sub-Gaussian Linear Bandits
by: Balagopalan, Kapilan, et al.
Published: (2024)
by: Balagopalan, Kapilan, et al.
Published: (2024)
Nearly Optimal Active Preference Learning and Its Application to LLM Alignment
by: Zhao, Yao, et al.
Published: (2026)
by: Zhao, Yao, et al.
Published: (2026)
Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization
by: Jun, Kwang-Sung, et al.
Published: (2024)
by: Jun, Kwang-Sung, et al.
Published: (2024)
Fixed Budget is No Harder Than Fixed Confidence in Best-Arm Identification up to Logarithmic Factors
by: Balagopalan, Kapilan, et al.
Published: (2026)
by: Balagopalan, Kapilan, et al.
Published: (2026)
GL-LowPopArt: A Nearly Instance-Wise Minimax-Optimal Estimator for Generalized Low-Rank Trace Regression
by: Lee, Junghyun, et al.
Published: (2025)
by: Lee, Junghyun, et al.
Published: (2025)
I Bet You Did Not Mean That: Testing Semantic Importance via Betting
by: Teneggi, Jacopo, et al.
Published: (2024)
by: Teneggi, Jacopo, et al.
Published: (2024)
Online Detection of LLM-Generated Texts via Sequential Hypothesis Testing by Betting
by: Chen, Can, et al.
Published: (2024)
by: Chen, Can, et al.
Published: (2024)
Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
by: Jang, Kyoungseok, et al.
Published: (2024)
by: Jang, Kyoungseok, et al.
Published: (2024)
Second Order Bounds for Contextual Bandits with Function Approximation
by: Pacchiano, Aldo
Published: (2024)
by: Pacchiano, Aldo
Published: (2024)
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
by: Wang, Kaiwen, et al.
Published: (2024)
by: Wang, Kaiwen, et al.
Published: (2024)
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
by: Lee, Junghyun, et al.
Published: (2024)
by: Lee, Junghyun, et al.
Published: (2024)
Optimistic Interior Point Methods for Sequential Hypothesis Testing by Betting
by: Chen, Can, et al.
Published: (2025)
by: Chen, Can, et al.
Published: (2025)
HawkEye: Advancing Robust Regression with Bounded, Smooth, and Insensitive Loss Function
by: Akhtar, Mushir, et al.
Published: (2024)
by: Akhtar, Mushir, et al.
Published: (2024)
Sample Compression Unleashed: New Generalization Bounds for Real Valued Losses
by: Bazinet, Mathieu, et al.
Published: (2024)
by: Bazinet, Mathieu, et al.
Published: (2024)
STaR-Bets: Sequential Target-Recalculating Bets for Tighter Confidence Intervals
by: Voráček, Václav, et al.
Published: (2025)
by: Voráček, Václav, et al.
Published: (2025)
Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leibler Maillard Sampling
by: Qin, Hao, et al.
Published: (2025)
by: Qin, Hao, et al.
Published: (2025)
Prediction via Shapley Value Regression
by: Alkhatib, Amr, et al.
Published: (2025)
by: Alkhatib, Amr, et al.
Published: (2025)
Learning Explainable Dense Reward Shapes via Bayesian Optimization
by: Koo, Ryan, et al.
Published: (2025)
by: Koo, Ryan, et al.
Published: (2025)
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds
by: Wang, Zhiyong, et al.
Published: (2024)
by: Wang, Zhiyong, et al.
Published: (2024)
Adaptive Conformal Inference by Betting
by: Podkopaev, Aleksandr, et al.
Published: (2024)
by: Podkopaev, Aleksandr, et al.
Published: (2024)
Adaptive Experimentation When You Can't Experiment
by: Zhao, Yao, et al.
Published: (2024)
by: Zhao, Yao, et al.
Published: (2024)
Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification
by: Balagopalan, Kapilan, et al.
Published: (2024)
by: Balagopalan, Kapilan, et al.
Published: (2024)
Coverage Improvement and Fast Convergence of On-policy Preference Learning
by: Kim, Juno, et al.
Published: (2026)
by: Kim, Juno, et al.
Published: (2026)
Mitigating Task-Order Sensitivity and Forgetting via Hierarchical Second-Order Consolidation
by: Nag, Protik, et al.
Published: (2026)
by: Nag, Protik, et al.
Published: (2026)
Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression
by: Fu, Deqing, et al.
Published: (2023)
by: Fu, Deqing, et al.
Published: (2023)
Gradient Aligned Regression via Pairwise Losses
by: Zhu, Dixian, et al.
Published: (2024)
by: Zhu, Dixian, et al.
Published: (2024)
Order Optimal Bounds for One-Shot Federated Learning over non-Convex Loss Functions
by: Sharifnassab, Arsalan, et al.
Published: (2021)
by: Sharifnassab, Arsalan, et al.
Published: (2021)
From Betting to Empirical Bernstein LIL
by: Orabona, Francesco
Published: (2026)
by: Orabona, Francesco
Published: (2026)
Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin
by: Wang, Fangyikang, et al.
Published: (2025)
by: Wang, Fangyikang, et al.
Published: (2025)
Semi-Supervised Hypothesis Testing by Betting on Predictions
by: Tenzer, Yaniv, et al.
Published: (2026)
by: Tenzer, Yaniv, et al.
Published: (2026)
Risk Bounds For Distributional Regression
by: Padilla, Carlos Misael Madrid, et al.
Published: (2025)
by: Padilla, Carlos Misael Madrid, et al.
Published: (2025)
Enhancing Differentially Private Linear Regression via Public Second-Moment
by: Cao, Zilong, et al.
Published: (2025)
by: Cao, Zilong, et al.
Published: (2025)
Auditing Fairness by Betting
by: Chugg, Ben, et al.
Published: (2023)
by: Chugg, Ben, et al.
Published: (2023)
Similar Items
-
Improved Offline Contextual Bandits with Second-Order Bounds: Betting and Freezing
by: Ryu, J. Jon, et al.
Published: (2025) -
$\varepsilon$-Good Action Identification in Fixed-Budget Monte Carlo Tree Search
by: Li, Yinan, et al.
Published: (2026) -
Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion
by: Lee, Junghyun, et al.
Published: (2023) -
Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards
by: Qin, Hao, et al.
Published: (2023) -
HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search
by: Nguyen, Tuan Ngo, et al.
Published: (2024)