:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shen, Chenglei, Zhan, Yi, Yu, Weijie, Zhang, Xiao, Xu, Jun
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.08067
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
by: Xu, Yi, et al.
Published: (2024)

Optimal Streaming Algorithms for Multi-Armed Bandits
by: Jin, Tianyuan, et al.
Published: (2024)

Enhancing Preference-based Linear Bandits via Human Response Time
by: Li, Shen, et al.
Published: (2024)

A Survey of Controllable Learning: Methods and Applications in Information Retrieval
by: Shen, Chenglei, et al.
Published: (2024)

COURIER: Contrastive User Intention Reconstruction for Large-Scale Visual Recommendation
by: Yang, Jia-Qi, et al.
Published: (2023)

When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs
by: Sun, Zhongxiang, et al.
Published: (2026)

GenRecEdit: Adapting Model Editing for Generative Recommendation with Cold-Start Items
by: Shen, Chenglei, et al.
Published: (2026)

Tweedie Regression for Video Recommendation System
by: Zheng, Yan, et al.
Published: (2025)

Contextual Bandit with Herding Effects: Algorithms and Recommendation Applications
by: Xu, Luyue, et al.
Published: (2024)

Multi-User Contextual Cascading Bandits for Personalized Recommendation
by: Park, Jiho, et al.
Published: (2025)

Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms
by: Tatlı, Meltem, et al.
Published: (2025)

Latent Preference Bandits
by: Mwai, Newton, et al.
Published: (2025)

Unlocking Reasoning Capabilities in LLMs via Reinforcement Learning Exploration
by: Deng, Wenhao, et al.
Published: (2025)

Dynamic User Interest Augmentation via Stream Clustering and Memory Networks in Large-Scale Recommender Systems
by: Liu, Peng, et al.
Published: (2024)

Adapting Job Recommendations to User Preference Drift with Behavioral-Semantic Fusion Learning
by: Han, Xiao, et al.
Published: (2024)

Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization
by: Cao, Linfeng, et al.
Published: (2025)

Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits
by: Shen, Yi, et al.
Published: (2023)

The Bandit's Blind Spot: The Critical Role of User State Representation in Recommender Systems
by: Pires, Pedro R., et al.
Published: (2026)

Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits
by: Huang, Ziyi, et al.
Published: (2024)

The Nah Bandit: Modeling User Non-compliance in Recommendation Systems
by: Zhou, Tianyue, et al.
Published: (2024)

On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization
by: Xiao, Jiancong, et al.
Published: (2024)

Queueing Matching Bandits with Preference Feedback
by: Kim, Jung-hun, et al.
Published: (2024)

Modeling User Preferences as Distributions for Optimal Transport-Based Cross-Domain Recommendation under Non-Overlapping Settings
by: Xiao, Ziyin, et al.
Published: (2025)

Learning to Route and Schedule LLMs from User Retrials via Contextual Queueing Bandits
by: Bae, Seoungbin, et al.
Published: (2026)

Save, Revisit, Retain: A Scalable Framework for Enhancing User Retention in Large-Scale Recommender Systems
by: Jiang, Weijie, et al.
Published: (2025)

Optimal and Practical Batched Linear Bandit Algorithm
by: Yu, Sanghoon, et al.
Published: (2025)

FLDmamba: Integrating Fourier and Laplace Transform Decomposition with Mamba for Enhanced Time Series Prediction
by: Zhang, Qianru, et al.
Published: (2025)

Algorithmic Assistance with Recommendation-Dependent Preferences
by: McLaughlin, Bryce, et al.
Published: (2022)

MoRE: A Mixture of Reflectors Framework for Large Language Model-Based Sequential Recommendation
by: Qin, Weicong, et al.
Published: (2024)

Separating and Learning Latent Confounders to Enhancing User Preferences Modeling
by: Xu, Hangtong, et al.
Published: (2023)

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
by: Hou, Yunlong, et al.
Published: (2025)

Calibrated Recommendations with Contextual Bandits
by: Feijer, Diego, et al.
Published: (2025)

Quantum-Enhanced Neural Contextual Bandit Algorithms
by: Huang, Yuqi, et al.
Published: (2026)

Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
by: Banerjee, Siddhartha, et al.
Published: (2022)

Harm Mitigation in Recommender Systems under User Preference Dynamics
by: Chee, Jerry, et al.
Published: (2024)

Fine-Tuning Attention Modules Only: Enhancing Weight Disentanglement in Task Arithmetic
by: Jin, Ruochen, et al.
Published: (2024)

Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback
by: Goyal, Tanmay, et al.
Published: (2025)

Linear Bandits on Ellipsoids: Minimax Optimal Algorithms
by: Zhang, Raymond, et al.
Published: (2025)

Aligning LLMs by Predicting Preferences from User Writing Samples
by: Aroca-Ouellette, Stéphane, et al.
Published: (2025)

Efficient and Interpretable Bandit Algorithms
by: Mukherjee, Subhojyoti, et al.
Published: (2023)