Saved in:
| Main Authors: | Ji, Jingwei, Xu, Renyuan, Zhu, Ruihao |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2208.02389 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multi-Task Dynamic Pricing in Credit Market with Contextual Information
by: Javanmard, Adel, et al.
Published: (2024)
by: Javanmard, Adel, et al.
Published: (2024)
Efficient and Interpretable Bandit Algorithms
by: Mukherjee, Subhojyoti, et al.
Published: (2023)
by: Mukherjee, Subhojyoti, et al.
Published: (2023)
Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards
by: Ji, Wenlong, et al.
Published: (2025)
by: Ji, Wenlong, et al.
Published: (2025)
Identifying All ε-Best Arms in (Misspecified) Linear Bandits
by: Li, Zhekai, et al.
Published: (2025)
by: Li, Zhekai, et al.
Published: (2025)
Satisficing Regret Minimization in Bandits: Constant Rate and Light-Tailed Distribution
by: Feng, Qing, et al.
Published: (2024)
by: Feng, Qing, et al.
Published: (2024)
Risk-sensitive Markov Decision Process and Learning under General Utility Functions
by: Wu, Zhengqi, et al.
Published: (2023)
by: Wu, Zhengqi, et al.
Published: (2023)
Fast Policy Learning for Linear Quadratic Control with Entropy Regularization
by: Guo, Xin, et al.
Published: (2023)
by: Guo, Xin, et al.
Published: (2023)
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators
by: Han, Yinbin, et al.
Published: (2023)
by: Han, Yinbin, et al.
Published: (2023)
Risk-Aware Decision Making in Restless Bandits: Theory and Algorithms for Planning and Learning
by: Akbarzadeh, Nima, et al.
Published: (2024)
by: Akbarzadeh, Nima, et al.
Published: (2024)
Linear Contextual Bandits with Interference
by: Xu, Yang, et al.
Published: (2024)
by: Xu, Yang, et al.
Published: (2024)
Pessimistic Risk-Aware Policy Learning in Contextual Bandits
by: Wan, Yilong, et al.
Published: (2026)
by: Wan, Yilong, et al.
Published: (2026)
Direction-Aware Offline-to-Online Learning in Linear Contextual Bandits
by: Han, Zean, et al.
Published: (2026)
by: Han, Zean, et al.
Published: (2026)
Latent Order Bandits
by: Carlsson, Emil, et al.
Published: (2026)
by: Carlsson, Emil, et al.
Published: (2026)
Optimal Batched Linear Bandits
by: Ren, Xuanfei, et al.
Published: (2024)
by: Ren, Xuanfei, et al.
Published: (2024)
Prior Diffusiveness and Regret in the Linear-Gaussian Bandit
by: Zhu, Yifan, et al.
Published: (2026)
by: Zhu, Yifan, et al.
Published: (2026)
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
by: Bui, Ha Manh, et al.
Published: (2024)
by: Bui, Ha Manh, et al.
Published: (2024)
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
by: Luo, Yuwei, et al.
Published: (2023)
by: Luo, Yuwei, et al.
Published: (2023)
Risk-Aware Continuous Control with Neural Contextual Bandits
by: Ayala-Romero, Jose A., et al.
Published: (2023)
by: Ayala-Romero, Jose A., et al.
Published: (2023)
Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization
by: Han, Yinbin, et al.
Published: (2024)
by: Han, Yinbin, et al.
Published: (2024)
Adversarial Bandits with Multi-User Delayed Feedback: Theory and Application
by: Li, Yandi, et al.
Published: (2023)
by: Li, Yandi, et al.
Published: (2023)
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
by: Lee, Junghyun, et al.
Published: (2024)
by: Lee, Junghyun, et al.
Published: (2024)
Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization
by: Jun, Kwang-Sung, et al.
Published: (2024)
by: Jun, Kwang-Sung, et al.
Published: (2024)
Differentially Private Linear Bandits with Partial Distributed Feedback
by: Li, Fengjiao, et al.
Published: (2022)
by: Li, Fengjiao, et al.
Published: (2022)
Strategic Linear Contextual Bandits
by: Buening, Thomas Kleine, et al.
Published: (2024)
by: Buening, Thomas Kleine, et al.
Published: (2024)
HR-Bandit: Human-AI Collaborated Linear Recourse Bandit
by: Cao, Junyu, et al.
Published: (2024)
by: Cao, Junyu, et al.
Published: (2024)
Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits
by: Huang, Ziyi, et al.
Published: (2024)
by: Huang, Ziyi, et al.
Published: (2024)
Infrequent Exploration in Linear Bandits
by: Lee, Harin, et al.
Published: (2025)
by: Lee, Harin, et al.
Published: (2025)
Optimal Thresholding Linear Bandit
by: Rivera, Eduardo Ochoa, et al.
Published: (2024)
by: Rivera, Eduardo Ochoa, et al.
Published: (2024)
Federated Linear Dueling Bandits
by: Huang, Xuhan, et al.
Published: (2025)
by: Huang, Xuhan, et al.
Published: (2025)
Restless Linear Bandits
by: Khaleghi, Azadeh
Published: (2024)
by: Khaleghi, Azadeh
Published: (2024)
Latency-Aware Contextual Bandit: Application to Cryo-EM Data Collection
by: Wei, Lai, et al.
Published: (2024)
by: Wei, Lai, et al.
Published: (2024)
On the Power of Adaptivity for $\varepsilon$-Best Arm Identification in Linear Bandits
by: Maiti, Arnab, et al.
Published: (2026)
by: Maiti, Arnab, et al.
Published: (2026)
Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence
by: Han, Yinbin, et al.
Published: (2024)
by: Han, Yinbin, et al.
Published: (2024)
Contextual Online Pricing with (Biased) Offline Data
by: Zhang, Yixuan, et al.
Published: (2025)
by: Zhang, Yixuan, et al.
Published: (2025)
On the Peril of (Even a Little) Nonstationarity in Satisficing Regret Minimization
by: Zhang, Yixuan, et al.
Published: (2026)
by: Zhang, Yixuan, et al.
Published: (2026)
Generalized Linear Bandits with Limited Adaptivity
by: Sawarni, Ayush, et al.
Published: (2024)
by: Sawarni, Ayush, et al.
Published: (2024)
Symmetric Linear Bandits with Hidden Symmetry
by: Tran, Nam Phuong, et al.
Published: (2024)
by: Tran, Nam Phuong, et al.
Published: (2024)
Pure Exploration in Bandits with Linear Constraints
by: Carlsson, Emil, et al.
Published: (2023)
by: Carlsson, Emil, et al.
Published: (2023)
Directional Optimism for Safe Linear Bandits
by: Hutchinson, Spencer, et al.
Published: (2023)
by: Hutchinson, Spencer, et al.
Published: (2023)
Linear Bandits with Partially Observable Features
by: Kim, Wonyoung, et al.
Published: (2025)
by: Kim, Wonyoung, et al.
Published: (2025)
Similar Items
-
Multi-Task Dynamic Pricing in Credit Market with Contextual Information
by: Javanmard, Adel, et al.
Published: (2024) -
Efficient and Interpretable Bandit Algorithms
by: Mukherjee, Subhojyoti, et al.
Published: (2023) -
Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards
by: Ji, Wenlong, et al.
Published: (2025) -
Identifying All ε-Best Arms in (Misspecified) Linear Bandits
by: Li, Zhekai, et al.
Published: (2025) -
Satisficing Regret Minimization in Bandits: Constant Rate and Light-Tailed Distribution
by: Feng, Qing, et al.
Published: (2024)