Guardado en:
| Autores principales: | Zhou, Zeyu, Hajek, Bruce, Choi, Nakjung, Walid, Anwar |
|---|---|
| Formato: | Preprint |
| Publicado: |
2022
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2203.08082 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Is Thompson Sampling Susceptible to Algorithmic Collusion?
por: Xiong, Yi, et al.
Publicado: (2024)
por: Xiong, Yi, et al.
Publicado: (2024)
Learning-augmented Online Algorithm for Two-level Ski-rental Problem
por: Zhang, Keyuan, et al.
Publicado: (2024)
por: Zhang, Keyuan, et al.
Publicado: (2024)
Optimistic Thompson Sampling for No-Regret Learning in Unknown Games
por: Li, Yingru, et al.
Publicado: (2024)
por: Li, Yingru, et al.
Publicado: (2024)
Thompson Sampling for Repeated Newsvendor
por: Chen, Li, et al.
Publicado: (2025)
por: Chen, Li, et al.
Publicado: (2025)
Constrained Linear Thompson Sampling
por: Gangrade, Aditya, et al.
Publicado: (2025)
por: Gangrade, Aditya, et al.
Publicado: (2025)
Distilled Thompson Sampling: Practical and Efficient Thompson Sampling via Imitation Learning
por: Namkoong, Hongseok, et al.
Publicado: (2020)
por: Namkoong, Hongseok, et al.
Publicado: (2020)
Deep Reinforcement Learning for Automated Stock Trading: An Ensemble Strategy
por: Yang, Hongyang, et al.
Publicado: (2025)
por: Yang, Hongyang, et al.
Publicado: (2025)
Adaptive Data Augmentation for Thompson Sampling
por: Kim, Wonyoung
Publicado: (2025)
por: Kim, Wonyoung
Publicado: (2025)
A Broader View of Thompson Sampling
por: Qu, Yanlin, et al.
Publicado: (2025)
por: Qu, Yanlin, et al.
Publicado: (2025)
Graph Neural Thompson Sampling
por: Wu, Shuang, et al.
Publicado: (2024)
por: Wu, Shuang, et al.
Publicado: (2024)
FinGPT-HPC: Efficient Pretraining and Finetuning Large Language Models for Financial Applications with High-Performance Computing
por: Liu, Xiao-Yang, et al.
Publicado: (2024)
por: Liu, Xiao-Yang, et al.
Publicado: (2024)
Fast, Precise Thompson Sampling for Bayesian Optimization
por: Sweet, David
Publicado: (2024)
por: Sweet, David
Publicado: (2024)
Thompson Sampling in Partially Observable Contextual Bandits
por: Park, Hongju, et al.
Publicado: (2024)
por: Park, Hongju, et al.
Publicado: (2024)
Online Learning of Decision Trees with Thompson Sampling
por: Chaouki, Ayman, et al.
Publicado: (2024)
por: Chaouki, Ayman, et al.
Publicado: (2024)
On Regret Bounds of Thompson Sampling for Bayesian Optimization
por: Takeno, Shion, et al.
Publicado: (2026)
por: Takeno, Shion, et al.
Publicado: (2026)
Gated Graph Attention Networks for Predicting Duration of Large Scale Power Outages Induced by Natural Disasters
por: Duan, Chenghao, et al.
Publicado: (2026)
por: Duan, Chenghao, et al.
Publicado: (2026)
MINTS: Minimalist Thompson Sampling
por: Wang, Kaizheng
Publicado: (2026)
por: Wang, Kaizheng
Publicado: (2026)
Diffusion Models for Solving Inverse Problems via Posterior Sampling with Piecewise Guidance
por: Mohseni-Sehdeh, Saeed, et al.
Publicado: (2025)
por: Mohseni-Sehdeh, Saeed, et al.
Publicado: (2025)
TS-Insight: Visualizing Thompson Sampling for Verification and XAI
por: Vares, Parsa, et al.
Publicado: (2025)
por: Vares, Parsa, et al.
Publicado: (2025)
Thompson Sampling For Combinatorial Bandits: Polynomial Regret and Mismatched Sampling Paradox
por: Zhang, Raymond, et al.
Publicado: (2024)
por: Zhang, Raymond, et al.
Publicado: (2024)
Reinterpreting 'the Company a Word Keeps': Towards Explainable and Ontologically Grounded Language Models
por: Saba, Walid S.
Publicado: (2024)
por: Saba, Walid S.
Publicado: (2024)
Thompson Sampling Itself is Differentially Private
por: Ou, Tingting, et al.
Publicado: (2024)
por: Ou, Tingting, et al.
Publicado: (2024)
Counterfactual Inference under Thompson Sampling
por: Jeunen, Olivier
Publicado: (2025)
por: Jeunen, Olivier
Publicado: (2025)
Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models
por: Fang, Zeyu, et al.
Publicado: (2024)
por: Fang, Zeyu, et al.
Publicado: (2024)
Thompson Sampling for Multi-Objective Linear Contextual Bandit
por: Park, Somangchan, et al.
Publicado: (2025)
por: Park, Somangchan, et al.
Publicado: (2025)
An Information-Theoretic Analysis of Thompson Sampling for Logistic Bandits
por: Gouverneur, Amaury, et al.
Publicado: (2024)
por: Gouverneur, Amaury, et al.
Publicado: (2024)
On Thompson Sampling and Bilateral Uncertainty in Additive Bayesian Optimization
por: Wycoff, Nathan
Publicado: (2025)
por: Wycoff, Nathan
Publicado: (2025)
Thompson Sampling in Online RLHF with General Function Approximation
por: Feng, Songtao, et al.
Publicado: (2025)
por: Feng, Songtao, et al.
Publicado: (2025)
Thompson Sampling-like Algorithms for Stochastic Rising Bandits
por: Fiandri, Marco, et al.
Publicado: (2025)
por: Fiandri, Marco, et al.
Publicado: (2025)
Sliding-Window Thompson Sampling for Non-Stationary Settings
por: Fiandri, Marco, et al.
Publicado: (2024)
por: Fiandri, Marco, et al.
Publicado: (2024)
Stochastically Constrained Best Arm Identification with Thompson Sampling
por: Yang, Le, et al.
Publicado: (2025)
por: Yang, Le, et al.
Publicado: (2025)
VITS : Variational Inference Thompson Sampling for contextual bandits
por: Clavier, Pierre, et al.
Publicado: (2023)
por: Clavier, Pierre, et al.
Publicado: (2023)
Thompson Sampling in Function Spaces via Neural Operators
por: Oliveira, Rafael, et al.
Publicado: (2025)
por: Oliveira, Rafael, et al.
Publicado: (2025)
Prior-Aligned Meta-RL: Thompson Sampling with Learned Priors and Guarantees in Finite-Horizon MDPs
por: Zhou, Runlin, et al.
Publicado: (2025)
por: Zhou, Runlin, et al.
Publicado: (2025)
Thompson Sampling via Fine-Tuning of LLMs
por: Menet, Nicolas, et al.
Publicado: (2025)
por: Menet, Nicolas, et al.
Publicado: (2025)
Gaussian Process Thompson Sampling via Rootfinding
por: Adebiyi, Taiwo A., et al.
Publicado: (2024)
por: Adebiyi, Taiwo A., et al.
Publicado: (2024)
Epsilon-Greedy Thompson Sampling to Bayesian Optimization
por: Do, Bach, et al.
Publicado: (2024)
por: Do, Bach, et al.
Publicado: (2024)
Sample-Mean Anchored Thompson Sampling for Offline-to-Online Learning with Distribution Shift
por: Li, Bochao, et al.
Publicado: (2026)
por: Li, Bochao, et al.
Publicado: (2026)
BFTS: Thompson Sampling with Bayesian Additive Regression Trees
por: Deng, Ruizhe, et al.
Publicado: (2026)
por: Deng, Ruizhe, et al.
Publicado: (2026)
Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks
por: Xu, Yinglun, et al.
Publicado: (2024)
por: Xu, Yinglun, et al.
Publicado: (2024)
Ejemplares similares
-
Is Thompson Sampling Susceptible to Algorithmic Collusion?
por: Xiong, Yi, et al.
Publicado: (2024) -
Learning-augmented Online Algorithm for Two-level Ski-rental Problem
por: Zhang, Keyuan, et al.
Publicado: (2024) -
Optimistic Thompson Sampling for No-Regret Learning in Unknown Games
por: Li, Yingru, et al.
Publicado: (2024) -
Thompson Sampling for Repeated Newsvendor
por: Chen, Li, et al.
Publicado: (2025) -
Constrained Linear Thompson Sampling
por: Gangrade, Aditya, et al.
Publicado: (2025)