Saved in:
| Main Authors: | Li, Yexin, Mu, Zhancun, Qi, Siyuan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.00567 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Contextual Combinatorial Bandits with Probabilistically Triggered Arms
by: Liu, Xutong, et al.
Published: (2023)
by: Liu, Xutong, et al.
Published: (2023)
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
by: Shimizu, Tatsuhiro, et al.
Published: (2024)
by: Shimizu, Tatsuhiro, et al.
Published: (2024)
Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning
by: Mu, Zhancun, et al.
Published: (2026)
by: Mu, Zhancun, et al.
Published: (2026)
From Contextual Combinatorial Semi-Bandits to Bandit List Classification: Improved Sample Complexity with Sparse Rewards
by: Erez, Liad, et al.
Published: (2025)
by: Erez, Liad, et al.
Published: (2025)
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
by: Li, Yexin
Published: (2025)
by: Li, Yexin
Published: (2025)
ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment
by: Cai, Shaofei, et al.
Published: (2025)
by: Cai, Shaofei, et al.
Published: (2025)
Tree Ensembles for Contextual Bandits
by: Nilsson, Hannes, et al.
Published: (2024)
by: Nilsson, Hannes, et al.
Published: (2024)
Federated Linear Contextual Bandits with Heterogeneous Clients
by: Blaser, Ethan, et al.
Published: (2024)
by: Blaser, Ethan, et al.
Published: (2024)
Neural Combinatorial Clustered Bandits for Recommendation Systems
by: Atalar, Baran, et al.
Published: (2024)
by: Atalar, Baran, et al.
Published: (2024)
Bayesian Analysis of Combinatorial Gaussian Process Bandits
by: Sandberg, Jack, et al.
Published: (2023)
by: Sandberg, Jack, et al.
Published: (2023)
Causal Contextual Bandits with Adaptive Context
by: Madhavan, Rahul, et al.
Published: (2024)
by: Madhavan, Rahul, et al.
Published: (2024)
Diffusion Models Meet Contextual Bandits
by: Aouali, Imad
Published: (2024)
by: Aouali, Imad
Published: (2024)
Learning When to Trust in Contextual Bandits
by: Ghasemi, Majid, et al.
Published: (2026)
by: Ghasemi, Majid, et al.
Published: (2026)
Conservative Contextual Bandits: Beyond Linear Representations
by: Deb, Rohan, et al.
Published: (2024)
by: Deb, Rohan, et al.
Published: (2024)
Linear Contextual Bandits with Hybrid Payoff: Revisited
by: Das, Nirjhar, et al.
Published: (2024)
by: Das, Nirjhar, et al.
Published: (2024)
The Sample Complexity of Multiclass and Sparse Contextual Bandits
by: Erez, Liad, et al.
Published: (2026)
by: Erez, Liad, et al.
Published: (2026)
DeFlow: Decoupling Manifold Modeling and Value Maximization for Offline Policy Extraction
by: Mu, Zhancun
Published: (2026)
by: Mu, Zhancun
Published: (2026)
Adaptive Budget Optimization for Multichannel Advertising Using Combinatorial Bandits
by: Gangopadhyay, Briti, et al.
Published: (2025)
by: Gangopadhyay, Briti, et al.
Published: (2025)
Leveraging Offline Data in Linear Latent Contextual Bandits
by: Kausik, Chinmaya, et al.
Published: (2024)
by: Kausik, Chinmaya, et al.
Published: (2024)
Second Order Bounds for Contextual Bandits with Function Approximation
by: Pacchiano, Aldo
Published: (2024)
by: Pacchiano, Aldo
Published: (2024)
Variance-Dependent Regret Lower Bounds for Contextual Bandits
by: He, Jiafan, et al.
Published: (2025)
by: He, Jiafan, et al.
Published: (2025)
Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards
by: Lu, Xiaodong, et al.
Published: (2026)
by: Lu, Xiaodong, et al.
Published: (2026)
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
by: Mu, Siyuan, et al.
Published: (2025)
by: Mu, Siyuan, et al.
Published: (2025)
Online Prompt Pricing based on Combinatorial Multi-Armed Bandit and Hierarchical Stackelberg Game
by: Li, Meiling, et al.
Published: (2024)
by: Li, Meiling, et al.
Published: (2024)
Leveraging the Power of Conversations: Optimal Key Term Selection in Conversational Contextual Bandits
by: Liu, Maoli, et al.
Published: (2025)
by: Liu, Maoli, et al.
Published: (2025)
Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
by: Yang, Hantao, et al.
Published: (2024)
by: Yang, Hantao, et al.
Published: (2024)
Provable Anytime Ensemble Sampling Algorithms in Nonlinear Contextual Bandits
by: Sun, Jiazheng, et al.
Published: (2025)
by: Sun, Jiazheng, et al.
Published: (2025)
COBRA: Contextual Bandit Algorithm for Ensuring Truthful Strategic Agents
by: Verma, Arun, et al.
Published: (2025)
by: Verma, Arun, et al.
Published: (2025)
Calibration-Gated LLM Pseudo-Observations for Online Contextual Bandits
by: Pershin, Maksim, et al.
Published: (2026)
by: Pershin, Maksim, et al.
Published: (2026)
Demystifying Design Choices of Reinforcement Fine-tuning: A Batched Contextual Bandit Learning Perspective
by: Xie, Hong, et al.
Published: (2026)
by: Xie, Hong, et al.
Published: (2026)
Contextual Linear Bandits under Noisy Features: Towards Bayesian Oracles
by: Kim, Jung-hun, et al.
Published: (2017)
by: Kim, Jung-hun, et al.
Published: (2017)
AdaptEx: A Self-Service Contextual Bandit Platform
by: Black, William, et al.
Published: (2023)
by: Black, William, et al.
Published: (2023)
Contextual Bandit with Herding Effects: Algorithms and Recommendation Applications
by: Xu, Luyue, et al.
Published: (2024)
by: Xu, Luyue, et al.
Published: (2024)
Contextual Budget Bandit for Food Rescue Volunteer Engagement
by: Tang, Ariana, et al.
Published: (2025)
by: Tang, Ariana, et al.
Published: (2025)
Bi-Level Contextual Bandits for Individualized Resource Allocation under Delayed Feedback
by: Almasi, Mohammadsina, et al.
Published: (2025)
by: Almasi, Mohammadsina, et al.
Published: (2025)
Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments
by: Verma, Abhishek, et al.
Published: (2025)
by: Verma, Abhishek, et al.
Published: (2025)
Dynamic Trust Calibration Using Contextual Bandits
by: Henrique, Bruno M., et al.
Published: (2025)
by: Henrique, Bruno M., et al.
Published: (2025)
Combinatorial Multi-armed Bandits: Arm Selection via Group Testing
by: Mukherjee, Arpan, et al.
Published: (2024)
by: Mukherjee, Arpan, et al.
Published: (2024)
CoCoB: Adaptive Collaborative Combinatorial Bandits for Online Recommendation
by: Yan, Cairong, et al.
Published: (2025)
by: Yan, Cairong, et al.
Published: (2025)
Federated Combinatorial Multi-Agent Multi-Armed Bandits
by: Fourati, Fares, et al.
Published: (2024)
by: Fourati, Fares, et al.
Published: (2024)
Similar Items
-
Contextual Combinatorial Bandits with Probabilistically Triggered Arms
by: Liu, Xutong, et al.
Published: (2023) -
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
by: Shimizu, Tatsuhiro, et al.
Published: (2024) -
Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning
by: Mu, Zhancun, et al.
Published: (2026) -
From Contextual Combinatorial Semi-Bandits to Bandit List Classification: Improved Sample Complexity with Sparse Rewards
by: Erez, Liad, et al.
Published: (2025) -
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
by: Li, Yexin
Published: (2025)