Saved in:
| Main Authors: | Shen, Owen, Jaillet, Patrick |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02283 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Single-Sample Polylogarithmic Regret Bound for Nonstationary Online Linear Programming
by: Xu, Haoran, et al.
Published: (2026)
by: Xu, Haoran, et al.
Published: (2026)
Distribution-Dependent Rates for Multi-Distribution Learning
by: Hanashiro, Rafael, et al.
Published: (2023)
by: Hanashiro, Rafael, et al.
Published: (2023)
Multi-Timescale Primal Dual Hybrid Gradient with Application to Distributed Optimization
by: Zhang, Junhui, et al.
Published: (2025)
by: Zhang, Junhui, et al.
Published: (2025)
Grace Period is All You Need: Individual Fairness without Revenue Loss in Revenue Management
by: Jaillet, Patrick, et al.
Published: (2024)
by: Jaillet, Patrick, et al.
Published: (2024)
Is Multi-Distribution Learning as Easy as PAC Learning: Sharp Rates with Bounded Label Noise
by: Hanashiro, Rafael, et al.
Published: (2026)
by: Hanashiro, Rafael, et al.
Published: (2026)
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
by: Verma, Arun, et al.
Published: (2024)
by: Verma, Arun, et al.
Published: (2024)
Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management
by: Meng, Huiling, et al.
Published: (2024)
by: Meng, Huiling, et al.
Published: (2024)
Prompt Optimization with Human Feedback
by: Lin, Xiaoqiang, et al.
Published: (2024)
by: Lin, Xiaoqiang, et al.
Published: (2024)
Incentive-Aware Dynamic Resource Allocation under Long-Term Cost Constraints
by: Dai, Yan, et al.
Published: (2025)
by: Dai, Yan, et al.
Published: (2025)
Online Resource Allocation with Convex-set Machine-Learned Advice
by: Golrezaei, Negin, et al.
Published: (2023)
by: Golrezaei, Negin, et al.
Published: (2023)
Dynamic Retail Pricing via Q-Learning -- A Reinforcement Learning Framework for Enhanced Revenue Management
by: Apte, Mohit, et al.
Published: (2024)
by: Apte, Mohit, et al.
Published: (2024)
Learning with Exact Invariances in Polynomial Time
by: Soleymani, Ashkan, et al.
Published: (2025)
by: Soleymani, Ashkan, et al.
Published: (2025)
A Universal Class of Sharpness-Aware Minimization Algorithms
by: Tahmasebi, Behrooz, et al.
Published: (2024)
by: Tahmasebi, Behrooz, et al.
Published: (2024)
Double Machine Learning Based Structure Identification from Temporal Data
by: Angelis, Emmanouil, et al.
Published: (2023)
by: Angelis, Emmanouil, et al.
Published: (2023)
Data-Driven Revenue Management for Air Cargo
by: Eren, Ezgi, et al.
Published: (2024)
by: Eren, Ezgi, et al.
Published: (2024)
Budgeted Recommendation with Delayed Feedback
by: Liu, Kweiguu, et al.
Published: (2024)
by: Liu, Kweiguu, et al.
Published: (2024)
Demand Balancing in Primal-Dual Optimization for Blind Network Revenue Management
by: Miao, Sentao, et al.
Published: (2024)
by: Miao, Sentao, et al.
Published: (2024)
Learning with Posterior Sampling for Revenue Management under Time-varying Demand
by: Shimizu, Kazuma, et al.
Published: (2024)
by: Shimizu, Kazuma, et al.
Published: (2024)
DelayPTC-LLM: Metro Passenger Travel Choice Prediction under Train Delays with Large Language Models
by: Chen, Chen, et al.
Published: (2024)
by: Chen, Chen, et al.
Published: (2024)
Delayed Feedback Modeling with Influence Functions
by: Ding, Chenlu, et al.
Published: (2025)
by: Ding, Chenlu, et al.
Published: (2025)
Degeneracy is OK: Logarithmic Regret for Network Revenue Management with Indiscrete Distributions
by: Jiang, Jiashuo, et al.
Published: (2022)
by: Jiang, Jiashuo, et al.
Published: (2022)
Lipschitz Bandits with Stochastic Delayed Feedback
by: Liu, Zhongxuan, et al.
Published: (2025)
by: Liu, Zhongxuan, et al.
Published: (2025)
Modeling Attention during Dimensional Shifts with Counterfactual and Delayed Feedback
by: Malloy, Tailia, et al.
Published: (2025)
by: Malloy, Tailia, et al.
Published: (2025)
Movie Revenue Prediction using Machine Learning Models
by: Udandarao, Vikranth, et al.
Published: (2024)
by: Udandarao, Vikranth, et al.
Published: (2024)
Online Scheduling for LLM Inference with KV Cache Constraints
by: Jaillet, Patrick, et al.
Published: (2025)
by: Jaillet, Patrick, et al.
Published: (2025)
Bandit and Delayed Feedback in Online Structured Prediction
by: Shibukawa, Yuki, et al.
Published: (2025)
by: Shibukawa, Yuki, et al.
Published: (2025)
Biased Dueling Bandits with Stochastic Delayed Feedback
by: Yi, Bongsoo, et al.
Published: (2024)
by: Yi, Bongsoo, et al.
Published: (2024)
A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays
by: Masoudian, Saeed, et al.
Published: (2023)
by: Masoudian, Saeed, et al.
Published: (2023)
Rankability-enhanced Revenue Uplift Modeling Framework for Online Marketing
by: He, Bowei, et al.
Published: (2024)
by: He, Bowei, et al.
Published: (2024)
Revenue Maximization and Learning in Products Ranking
by: Chen, Ningyuan, et al.
Published: (2020)
by: Chen, Ningyuan, et al.
Published: (2020)
Differentiable Attenuation Filters for Feedback Delay Networks
by: Ibnyahya, Ilias, et al.
Published: (2025)
by: Ibnyahya, Ilias, et al.
Published: (2025)
Exploiting Curvature in Online Convex Optimization with Delayed Feedback
by: Qiu, Hao, et al.
Published: (2025)
by: Qiu, Hao, et al.
Published: (2025)
Improved Regret for Bandit Convex Optimization with Delayed Feedback
by: Wan, Yuanyu, et al.
Published: (2024)
by: Wan, Yuanyu, et al.
Published: (2024)
Online Nonsubmodular Optimization with Delayed Feedback in the Bandit Setting
by: Yang, Sifan, et al.
Published: (2025)
by: Yang, Sifan, et al.
Published: (2025)
Neural Contextual Bandits Under Delayed Feedback Constraints
by: Moghimi, Mohammadali, et al.
Published: (2025)
by: Moghimi, Mohammadali, et al.
Published: (2025)
Incentives in Private Collaborative Machine Learning
by: Sim, Rachael Hwee Ling, et al.
Published: (2024)
by: Sim, Rachael Hwee Ling, et al.
Published: (2024)
Improved Q-learning based Multi-hop Routing for UAV-Assisted Communication
by: Sharvari, N P, et al.
Published: (2024)
by: Sharvari, N P, et al.
Published: (2024)
Regularized Q-learning
by: Lim, Han-Dong, et al.
Published: (2022)
by: Lim, Han-Dong, et al.
Published: (2022)
Linear and Neural Dueling Bandits with Delayed Feedback
by: Wang, Xiangyi, et al.
Published: (2026)
by: Wang, Xiangyi, et al.
Published: (2026)
Improved Best-of-Both-Worlds Regret for Bandits with Delayed Feedback
by: Schlisselberg, Ofir, et al.
Published: (2025)
by: Schlisselberg, Ofir, et al.
Published: (2025)
Similar Items
-
A Single-Sample Polylogarithmic Regret Bound for Nonstationary Online Linear Programming
by: Xu, Haoran, et al.
Published: (2026) -
Distribution-Dependent Rates for Multi-Distribution Learning
by: Hanashiro, Rafael, et al.
Published: (2023) -
Multi-Timescale Primal Dual Hybrid Gradient with Application to Distributed Optimization
by: Zhang, Junhui, et al.
Published: (2025) -
Grace Period is All You Need: Individual Fairness without Revenue Loss in Revenue Management
by: Jaillet, Patrick, et al.
Published: (2024) -
Is Multi-Distribution Learning as Easy as PAC Learning: Sharp Rates with Bounded Label Noise
by: Hanashiro, Rafael, et al.
Published: (2026)