:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Halder, Budhaditya, Pan, Shubhayan, Khamaru, Koulik
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Machine Learning
Online-Zugang:	https://arxiv.org/abs/2505.23260
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Stability and Robustness via Regularization: Bandit Inference via Regularized Stochastic Mirror Descent
von: Halder, Budhaditya, et al.
Veröffentlicht: (2026)

Avoiding the Price of Adaptivity: Inference in Linear Contextual Bandits via Stability
von: Praharaj, Samya, et al.
Veröffentlicht: (2025)

Stochastic Optimization with Constraints: A Non-asymptotic Instance-Dependent Analysis
von: Khamaru, Koulik
Veröffentlicht: (2024)

Inference with the Upper Confidence Bound Algorithm
von: Khamaru, Koulik, et al.
Veröffentlicht: (2024)

On Instability of Minimax Optimal Optimism-Based Bandit Algorithms
von: Praharaj, Samya, et al.
Veröffentlicht: (2025)

Efficient Inference after Directionally Stable Adaptive Experiments
von: Shen, Zikai, et al.
Veröffentlicht: (2026)

Bandit Simulation for Average Reward Inference
von: Praharaj, Samya, et al.
Veröffentlicht: (2026)

Semi-parametric inference based on adaptively collected data
von: Lin, Licong, et al.
Veröffentlicht: (2023)

UCB algorithms for multi-armed bandits: Precise regret and adaptive inference
von: Han, Qiyang, et al.
Veröffentlicht: (2024)

Uncertainty Quantification With Multiple Sources
von: Ying, Mufang, et al.
Veröffentlicht: (2024)

Design Stability in Adaptive Experiments: Implications for Treatment Effect Estimation
von: Sengupta, Saikat, et al.
Veröffentlicht: (2025)

Variance-Aware Feel-Good Thompson Sampling for Contextual Bandits
von: Li, Xuheng, et al.
Veröffentlicht: (2025)

Counterfactual Inference under Thompson Sampling
von: Jeunen, Olivier
Veröffentlicht: (2025)

A New Framework for Convex Clustering in Kernel Spaces: Finite Sample Bounds, Consistency and Performance Insights
von: Pan, Shubhayan, et al.
Veröffentlicht: (2025)

VITS : Variational Inference Thompson Sampling for contextual bandits
von: Clavier, Pierre, et al.
Veröffentlicht: (2023)

Quantum Reinforcement Learning in Non-Abelian Environments: Unveiling Novel Formulations and Quantum Advantage Exploration
von: Ghosal, Shubhayan
Veröffentlicht: (2024)

PICS: A sequential approach to obtain optimal designs for non-linear models leveraging closed-form solutions for faster convergence
von: Ghosh, Suvrojit, et al.
Veröffentlicht: (2024)

Distilled Thompson Sampling: Practical and Efficient Thompson Sampling via Imitation Learning
von: Namkoong, Hongseok, et al.
Veröffentlicht: (2020)

Optimism Stabilizes Thompson Sampling for Adaptive Inference
von: Yan, Shunxing, et al.
Veröffentlicht: (2026)

Variance-sensitive Thompson sampling for generalised linear bandits, revisited
von: Perneczky, Tom, et al.
Veröffentlicht: (2026)

Thompson Sampling in Function Spaces via Neural Operators
von: Oliveira, Rafael, et al.
Veröffentlicht: (2025)

Thompson Sampling via Fine-Tuning of LLMs
von: Menet, Nicolas, et al.
Veröffentlicht: (2025)

Gaussian Process Thompson Sampling via Rootfinding
von: Adebiyi, Taiwo A., et al.
Veröffentlicht: (2024)

Thompson Sampling for Repeated Newsvendor
von: Chen, Li, et al.
Veröffentlicht: (2025)

Constrained Linear Thompson Sampling
von: Gangrade, Aditya, et al.
Veröffentlicht: (2025)

Worst-Case Regret Bounds for Combinatorial Thompson Sampling in Sleeping Semi-Bandits
von: Huang, Zhiming, et al.
Veröffentlicht: (2026)

Regenerative Particle Thompson Sampling
von: Zhou, Zeyu, et al.
Veröffentlicht: (2022)

Contextual Thompson Sampling via Generation of Missing Data
von: Zhang, Kelly W., et al.
Veröffentlicht: (2025)

Adaptive Data Augmentation for Thompson Sampling
von: Kim, Wonyoung
Veröffentlicht: (2025)

A Broader View of Thompson Sampling
von: Qu, Yanlin, et al.
Veröffentlicht: (2025)

Graph Neural Thompson Sampling
von: Wu, Shuang, et al.
Veröffentlicht: (2024)

Rethinking Langevin Thompson Sampling from A Stochastic Approximation Perspective
von: Wang, Weixin, et al.
Veröffentlicht: (2025)

Fast, Precise Thompson Sampling for Bayesian Optimization
von: Sweet, David
Veröffentlicht: (2024)

Thompson Sampling in Partially Observable Contextual Bandits
von: Park, Hongju, et al.
Veröffentlicht: (2024)

Online Learning of Decision Trees with Thompson Sampling
von: Chaouki, Ayman, et al.
Veröffentlicht: (2024)

On Regret Bounds of Thompson Sampling for Bayesian Optimization
von: Takeno, Shion, et al.
Veröffentlicht: (2026)

MINTS: Minimalist Thompson Sampling
von: Wang, Kaizheng
Veröffentlicht: (2026)

Thompson Sampling For Combinatorial Bandits: Polynomial Regret and Mismatched Sampling Paradox
von: Zhang, Raymond, et al.
Veröffentlicht: (2024)

FedRTS: Federated Robust Pruning via Combinatorial Thompson Sampling
von: Huang, Hong, et al.
Veröffentlicht: (2025)

Thompson Sampling Itself is Differentially Private
von: Ou, Tingting, et al.
Veröffentlicht: (2024)