Saved in:
| Main Authors: | Rastogi, Richa, Saito, Yuta, Joachims, Thorsten |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.17674 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fairness in Ranking under Disparate Uncertainty
by: Rastogi, Richa, et al.
Published: (2023)
by: Rastogi, Richa, et al.
Published: (2023)
Prompt Optimization with Logged Bandit Data
by: Kiyohara, Haruka, et al.
Published: (2025)
by: Kiyohara, Haruka, et al.
Published: (2025)
POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
by: Saito, Yuta, et al.
Published: (2024)
by: Saito, Yuta, et al.
Published: (2024)
Thompson Sampling for Multi-Objective Linear Contextual Bandit
by: Park, Somangchan, et al.
Published: (2025)
by: Park, Somangchan, et al.
Published: (2025)
Offline Contextual Bandits in the Presence of New Actions
by: Kishimoto, Ren, et al.
Published: (2026)
by: Kishimoto, Ren, et al.
Published: (2026)
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
by: Shimizu, Tatsuhiro, et al.
Published: (2024)
by: Shimizu, Tatsuhiro, et al.
Published: (2024)
From Restless to Contextual: A Thresholding Bandit Reformulation For Finite-horizon Improvement
by: Xu, Jiamin, et al.
Published: (2025)
by: Xu, Jiamin, et al.
Published: (2025)
Language-Based User Profiles for Recommendation
by: Zhou, Joyce, et al.
Published: (2024)
by: Zhou, Joyce, et al.
Published: (2024)
Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
by: Kiyohara, Haruka, et al.
Published: (2024)
by: Kiyohara, Haruka, et al.
Published: (2024)
End-to-end Training for Recommendation with Language-based User Profiles
by: Gao, Zhaolin, et al.
Published: (2024)
by: Gao, Zhaolin, et al.
Published: (2024)
Combinatorial Allocation Bandits with Nonlinear Arm Utility
by: Shibukawa, Yuki, et al.
Published: (2026)
by: Shibukawa, Yuki, et al.
Published: (2026)
Scaling Federated Linear Contextual Bandits via Sketching
by: Yang, Hantao, et al.
Published: (2026)
by: Yang, Hantao, et al.
Published: (2026)
Best Group Identification in Multi-Objective Bandits
by: Shahverdikondori, Mohammad, et al.
Published: (2025)
by: Shahverdikondori, Mohammad, et al.
Published: (2025)
Maximal Objectives in the Multi-armed Bandit with Applications
by: Ozbay, Eren, et al.
Published: (2020)
by: Ozbay, Eren, et al.
Published: (2020)
Transfer Learning for Contextual Multi-armed Bandits
by: Cai, Changxiao, et al.
Published: (2022)
by: Cai, Changxiao, et al.
Published: (2022)
Robust Multi-Objective Preference Alignment with Online DPO
by: Gupta, Raghav, et al.
Published: (2025)
by: Gupta, Raghav, et al.
Published: (2025)
Leveraging the Power of Conversations: Optimal Key Term Selection in Conversational Contextual Bandits
by: Liu, Maoli, et al.
Published: (2025)
by: Liu, Maoli, et al.
Published: (2025)
Locally Private Nonparametric Contextual Multi-armed Bandits
by: Ma, Yuheng, et al.
Published: (2025)
by: Ma, Yuheng, et al.
Published: (2025)
Ranking with Long-Term Constraints
by: Brantley, Kianté, et al.
Published: (2023)
by: Brantley, Kianté, et al.
Published: (2023)
Sparse Nonparametric Contextual Bandits
by: Flynn, Hamish, et al.
Published: (2025)
by: Flynn, Hamish, et al.
Published: (2025)
Statistical Inverse Problems in Hilbert Scales
by: Rastogi, Abhishake
Published: (2022)
by: Rastogi, Abhishake
Published: (2022)
$p1$: Better Prompt Optimization with Fewer Prompts
by: Gao, Zhaolin, et al.
Published: (2026)
by: Gao, Zhaolin, et al.
Published: (2026)
Communication-Corruption Coupling and Verification in Cooperative Multi-Objective Bandits
by: Shi, Ming
Published: (2026)
by: Shi, Ming
Published: (2026)
Blessings of Multiple Good Arms in Multi-Objective Linear Bandits
by: Ann, Heesang, et al.
Published: (2026)
by: Ann, Heesang, et al.
Published: (2026)
Multi-User Contextual Cascading Bandits for Personalized Recommendation
by: Park, Jiho, et al.
Published: (2025)
by: Park, Jiho, et al.
Published: (2025)
Impatient Bandits: Optimizing for the Long-Term Without Delay
by: Zhang, Kelly W., et al.
Published: (2025)
by: Zhang, Kelly W., et al.
Published: (2025)
Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback
by: Goyal, Tanmay, et al.
Published: (2025)
by: Goyal, Tanmay, et al.
Published: (2025)
Linear Contextual Bandits with Interference
by: Xu, Yang, et al.
Published: (2024)
by: Xu, Yang, et al.
Published: (2024)
Multi-Objective Multi-Agent Bandits: From Learning Efficiency to Fairness Optimization
by: Wang, John, et al.
Published: (2026)
by: Wang, John, et al.
Published: (2026)
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
by: Min, Do June, et al.
Published: (2023)
by: Min, Do June, et al.
Published: (2023)
Contextual Linear Bandits with Delay as Payoff
by: Zhang, Mengxiao, et al.
Published: (2025)
by: Zhang, Mengxiao, et al.
Published: (2025)
Group-Sensitive Offline Contextual Bandits
by: Guo, Yihong, et al.
Published: (2025)
by: Guo, Yihong, et al.
Published: (2025)
Multiplayer Information Asymmetric Contextual Bandits
by: Chang, William, et al.
Published: (2025)
by: Chang, William, et al.
Published: (2025)
Differentially Private Kernelized Contextual Bandits
by: Pavlovic, Nikola, et al.
Published: (2025)
by: Pavlovic, Nikola, et al.
Published: (2025)
Contextual Bandits for Unbounded Context Distributions
by: Zhao, Puning, et al.
Published: (2024)
by: Zhao, Puning, et al.
Published: (2024)
Neural Exploitation and Exploration of Contextual Bandits
by: Ban, Yikun, et al.
Published: (2023)
by: Ban, Yikun, et al.
Published: (2023)
Uncertainty of Joint Neural Contextual Bandit
by: Guo, Hongbo, et al.
Published: (2024)
by: Guo, Hongbo, et al.
Published: (2024)
Contextual Bandits with Stage-wise Constraints
by: Pacchiano, Aldo, et al.
Published: (2024)
by: Pacchiano, Aldo, et al.
Published: (2024)
Constrained Contextual Bandits with Adversarial Contexts
by: Sarkar, Dhruv, et al.
Published: (2026)
by: Sarkar, Dhruv, et al.
Published: (2026)
Designing an Interpretable Interface for Contextual Bandits
by: Maher, Andrew, et al.
Published: (2024)
by: Maher, Andrew, et al.
Published: (2024)
Similar Items
-
Fairness in Ranking under Disparate Uncertainty
by: Rastogi, Richa, et al.
Published: (2023) -
Prompt Optimization with Logged Bandit Data
by: Kiyohara, Haruka, et al.
Published: (2025) -
POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
by: Saito, Yuta, et al.
Published: (2024) -
Thompson Sampling for Multi-Objective Linear Contextual Bandit
by: Park, Somangchan, et al.
Published: (2025) -
Offline Contextual Bandits in the Presence of New Actions
by: Kishimoto, Ren, et al.
Published: (2026)