:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shen, Owen, Jaillet, Patrick
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.02283
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Single-Sample Polylogarithmic Regret Bound for Nonstationary Online Linear Programming
by: Xu, Haoran, et al.
Published: (2026)

Distribution-Dependent Rates for Multi-Distribution Learning
by: Hanashiro, Rafael, et al.
Published: (2023)

Multi-Timescale Primal Dual Hybrid Gradient with Application to Distributed Optimization
by: Zhang, Junhui, et al.
Published: (2025)

Grace Period is All You Need: Individual Fairness without Revenue Loss in Revenue Management
by: Jaillet, Patrick, et al.
Published: (2024)

Is Multi-Distribution Learning as Easy as PAC Learning: Sharp Rates with Bounded Label Noise
by: Hanashiro, Rafael, et al.
Published: (2026)

Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
by: Verma, Arun, et al.
Published: (2024)

Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management
by: Meng, Huiling, et al.
Published: (2024)

Prompt Optimization with Human Feedback
by: Lin, Xiaoqiang, et al.
Published: (2024)

Incentive-Aware Dynamic Resource Allocation under Long-Term Cost Constraints
by: Dai, Yan, et al.
Published: (2025)

Online Resource Allocation with Convex-set Machine-Learned Advice
by: Golrezaei, Negin, et al.
Published: (2023)

Dynamic Retail Pricing via Q-Learning -- A Reinforcement Learning Framework for Enhanced Revenue Management
by: Apte, Mohit, et al.
Published: (2024)

Learning with Exact Invariances in Polynomial Time
by: Soleymani, Ashkan, et al.
Published: (2025)

A Universal Class of Sharpness-Aware Minimization Algorithms
by: Tahmasebi, Behrooz, et al.
Published: (2024)

Double Machine Learning Based Structure Identification from Temporal Data
by: Angelis, Emmanouil, et al.
Published: (2023)

Data-Driven Revenue Management for Air Cargo
by: Eren, Ezgi, et al.
Published: (2024)

Budgeted Recommendation with Delayed Feedback
by: Liu, Kweiguu, et al.
Published: (2024)

Demand Balancing in Primal-Dual Optimization for Blind Network Revenue Management
by: Miao, Sentao, et al.
Published: (2024)

Learning with Posterior Sampling for Revenue Management under Time-varying Demand
by: Shimizu, Kazuma, et al.
Published: (2024)

DelayPTC-LLM: Metro Passenger Travel Choice Prediction under Train Delays with Large Language Models
by: Chen, Chen, et al.
Published: (2024)

Delayed Feedback Modeling with Influence Functions
by: Ding, Chenlu, et al.
Published: (2025)

Degeneracy is OK: Logarithmic Regret for Network Revenue Management with Indiscrete Distributions
by: Jiang, Jiashuo, et al.
Published: (2022)

Lipschitz Bandits with Stochastic Delayed Feedback
by: Liu, Zhongxuan, et al.
Published: (2025)

Modeling Attention during Dimensional Shifts with Counterfactual and Delayed Feedback
by: Malloy, Tailia, et al.
Published: (2025)

Movie Revenue Prediction using Machine Learning Models
by: Udandarao, Vikranth, et al.
Published: (2024)

Online Scheduling for LLM Inference with KV Cache Constraints
by: Jaillet, Patrick, et al.
Published: (2025)

Bandit and Delayed Feedback in Online Structured Prediction
by: Shibukawa, Yuki, et al.
Published: (2025)

Biased Dueling Bandits with Stochastic Delayed Feedback
by: Yi, Bongsoo, et al.
Published: (2024)

A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays
by: Masoudian, Saeed, et al.
Published: (2023)

Rankability-enhanced Revenue Uplift Modeling Framework for Online Marketing
by: He, Bowei, et al.
Published: (2024)

Revenue Maximization and Learning in Products Ranking
by: Chen, Ningyuan, et al.
Published: (2020)

Differentiable Attenuation Filters for Feedback Delay Networks
by: Ibnyahya, Ilias, et al.
Published: (2025)

Exploiting Curvature in Online Convex Optimization with Delayed Feedback
by: Qiu, Hao, et al.
Published: (2025)

Improved Regret for Bandit Convex Optimization with Delayed Feedback
by: Wan, Yuanyu, et al.
Published: (2024)

Online Nonsubmodular Optimization with Delayed Feedback in the Bandit Setting
by: Yang, Sifan, et al.
Published: (2025)

Neural Contextual Bandits Under Delayed Feedback Constraints
by: Moghimi, Mohammadali, et al.
Published: (2025)

Incentives in Private Collaborative Machine Learning
by: Sim, Rachael Hwee Ling, et al.
Published: (2024)

Improved Q-learning based Multi-hop Routing for UAV-Assisted Communication
by: Sharvari, N P, et al.
Published: (2024)

Regularized Q-learning
by: Lim, Han-Dong, et al.
Published: (2022)

Linear and Neural Dueling Bandits with Delayed Feedback
by: Wang, Xiangyi, et al.
Published: (2026)

Improved Best-of-Both-Worlds Regret for Bandits with Delayed Feedback
by: Schlisselberg, Ofir, et al.
Published: (2025)