Saved in:
| Main Authors: | Shen, Chenglei, Zhan, Yi, Yu, Weijie, Zhang, Xiao, Xu, Jun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.08067 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
by: Xu, Yi, et al.
Published: (2024)
by: Xu, Yi, et al.
Published: (2024)
Optimal Streaming Algorithms for Multi-Armed Bandits
by: Jin, Tianyuan, et al.
Published: (2024)
by: Jin, Tianyuan, et al.
Published: (2024)
Enhancing Preference-based Linear Bandits via Human Response Time
by: Li, Shen, et al.
Published: (2024)
by: Li, Shen, et al.
Published: (2024)
A Survey of Controllable Learning: Methods and Applications in Information Retrieval
by: Shen, Chenglei, et al.
Published: (2024)
by: Shen, Chenglei, et al.
Published: (2024)
COURIER: Contrastive User Intention Reconstruction for Large-Scale Visual Recommendation
by: Yang, Jia-Qi, et al.
Published: (2023)
by: Yang, Jia-Qi, et al.
Published: (2023)
When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs
by: Sun, Zhongxiang, et al.
Published: (2026)
by: Sun, Zhongxiang, et al.
Published: (2026)
GenRecEdit: Adapting Model Editing for Generative Recommendation with Cold-Start Items
by: Shen, Chenglei, et al.
Published: (2026)
by: Shen, Chenglei, et al.
Published: (2026)
Tweedie Regression for Video Recommendation System
by: Zheng, Yan, et al.
Published: (2025)
by: Zheng, Yan, et al.
Published: (2025)
Contextual Bandit with Herding Effects: Algorithms and Recommendation Applications
by: Xu, Luyue, et al.
Published: (2024)
by: Xu, Luyue, et al.
Published: (2024)
Multi-User Contextual Cascading Bandits for Personalized Recommendation
by: Park, Jiho, et al.
Published: (2025)
by: Park, Jiho, et al.
Published: (2025)
Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms
by: Tatlı, Meltem, et al.
Published: (2025)
by: Tatlı, Meltem, et al.
Published: (2025)
Latent Preference Bandits
by: Mwai, Newton, et al.
Published: (2025)
by: Mwai, Newton, et al.
Published: (2025)
Unlocking Reasoning Capabilities in LLMs via Reinforcement Learning Exploration
by: Deng, Wenhao, et al.
Published: (2025)
by: Deng, Wenhao, et al.
Published: (2025)
Dynamic User Interest Augmentation via Stream Clustering and Memory Networks in Large-Scale Recommender Systems
by: Liu, Peng, et al.
Published: (2024)
by: Liu, Peng, et al.
Published: (2024)
Adapting Job Recommendations to User Preference Drift with Behavioral-Semantic Fusion Learning
by: Han, Xiao, et al.
Published: (2024)
by: Han, Xiao, et al.
Published: (2024)
Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization
by: Cao, Linfeng, et al.
Published: (2025)
by: Cao, Linfeng, et al.
Published: (2025)
Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits
by: Shen, Yi, et al.
Published: (2023)
by: Shen, Yi, et al.
Published: (2023)
The Bandit's Blind Spot: The Critical Role of User State Representation in Recommender Systems
by: Pires, Pedro R., et al.
Published: (2026)
by: Pires, Pedro R., et al.
Published: (2026)
Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits
by: Huang, Ziyi, et al.
Published: (2024)
by: Huang, Ziyi, et al.
Published: (2024)
The Nah Bandit: Modeling User Non-compliance in Recommendation Systems
by: Zhou, Tianyue, et al.
Published: (2024)
by: Zhou, Tianyue, et al.
Published: (2024)
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization
by: Xiao, Jiancong, et al.
Published: (2024)
by: Xiao, Jiancong, et al.
Published: (2024)
Queueing Matching Bandits with Preference Feedback
by: Kim, Jung-hun, et al.
Published: (2024)
by: Kim, Jung-hun, et al.
Published: (2024)
Modeling User Preferences as Distributions for Optimal Transport-Based Cross-Domain Recommendation under Non-Overlapping Settings
by: Xiao, Ziyin, et al.
Published: (2025)
by: Xiao, Ziyin, et al.
Published: (2025)
Learning to Route and Schedule LLMs from User Retrials via Contextual Queueing Bandits
by: Bae, Seoungbin, et al.
Published: (2026)
by: Bae, Seoungbin, et al.
Published: (2026)
Save, Revisit, Retain: A Scalable Framework for Enhancing User Retention in Large-Scale Recommender Systems
by: Jiang, Weijie, et al.
Published: (2025)
by: Jiang, Weijie, et al.
Published: (2025)
Optimal and Practical Batched Linear Bandit Algorithm
by: Yu, Sanghoon, et al.
Published: (2025)
by: Yu, Sanghoon, et al.
Published: (2025)
FLDmamba: Integrating Fourier and Laplace Transform Decomposition with Mamba for Enhanced Time Series Prediction
by: Zhang, Qianru, et al.
Published: (2025)
by: Zhang, Qianru, et al.
Published: (2025)
Algorithmic Assistance with Recommendation-Dependent Preferences
by: McLaughlin, Bryce, et al.
Published: (2022)
by: McLaughlin, Bryce, et al.
Published: (2022)
MoRE: A Mixture of Reflectors Framework for Large Language Model-Based Sequential Recommendation
by: Qin, Weicong, et al.
Published: (2024)
by: Qin, Weicong, et al.
Published: (2024)
Separating and Learning Latent Confounders to Enhancing User Preferences Modeling
by: Xu, Hangtong, et al.
Published: (2023)
by: Xu, Hangtong, et al.
Published: (2023)
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
by: Hou, Yunlong, et al.
Published: (2025)
by: Hou, Yunlong, et al.
Published: (2025)
Calibrated Recommendations with Contextual Bandits
by: Feijer, Diego, et al.
Published: (2025)
by: Feijer, Diego, et al.
Published: (2025)
Quantum-Enhanced Neural Contextual Bandit Algorithms
by: Huang, Yuqi, et al.
Published: (2026)
by: Huang, Yuqi, et al.
Published: (2026)
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
by: Banerjee, Siddhartha, et al.
Published: (2022)
by: Banerjee, Siddhartha, et al.
Published: (2022)
Harm Mitigation in Recommender Systems under User Preference Dynamics
by: Chee, Jerry, et al.
Published: (2024)
by: Chee, Jerry, et al.
Published: (2024)
Fine-Tuning Attention Modules Only: Enhancing Weight Disentanglement in Task Arithmetic
by: Jin, Ruochen, et al.
Published: (2024)
by: Jin, Ruochen, et al.
Published: (2024)
Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback
by: Goyal, Tanmay, et al.
Published: (2025)
by: Goyal, Tanmay, et al.
Published: (2025)
Linear Bandits on Ellipsoids: Minimax Optimal Algorithms
by: Zhang, Raymond, et al.
Published: (2025)
by: Zhang, Raymond, et al.
Published: (2025)
Aligning LLMs by Predicting Preferences from User Writing Samples
by: Aroca-Ouellette, Stéphane, et al.
Published: (2025)
by: Aroca-Ouellette, Stéphane, et al.
Published: (2025)
Efficient and Interpretable Bandit Algorithms
by: Mukherjee, Subhojyoti, et al.
Published: (2023)
by: Mukherjee, Subhojyoti, et al.
Published: (2023)
Similar Items
-
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
by: Xu, Yi, et al.
Published: (2024) -
Optimal Streaming Algorithms for Multi-Armed Bandits
by: Jin, Tianyuan, et al.
Published: (2024) -
Enhancing Preference-based Linear Bandits via Human Response Time
by: Li, Shen, et al.
Published: (2024) -
A Survey of Controllable Learning: Methods and Applications in Information Retrieval
by: Shen, Chenglei, et al.
Published: (2024) -
COURIER: Contrastive User Intention Reconstruction for Large-Scale Visual Recommendation
by: Yang, Jia-Qi, et al.
Published: (2023)