:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhao, Zihao, Jing, Yi, Feng, Fuli, Wu, Jiancan, Gao, Chongming, He, Xiangnan
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2403.17745
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Fine-grained List-wise Alignment for Generative Medication Recommendation
by: Fan, Chenxiao, et al.
Published: (2025)

Large Language Models are Learnable Planners for Long-Term Recommendation
by: Shi, Wentao, et al.
Published: (2024)

Fine-grained Alignment of Large Language Models for General Medication Recommendation without Overprescription
by: Zhao, Zihao, et al.
Published: (2025)

Reinforced Prompt Personalization for Recommendation with Large Language Models
by: Mao, Wenyu, et al.
Published: (2024)

Fair Recommendations with Limited Sensitive Attributes: A Distributionally Robust Optimization Approach
by: Shi, Tianhao, et al.
Published: (2024)

Lower-Left Partial AUC: An Effective and Efficient Optimization Metric for Recommendation
by: Shi, Wentao, et al.
Published: (2024)

Uncertainty-aware Generative Recommendation
by: Fan, Chenxiao, et al.
Published: (2026)

Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning
by: Wu, Junkang, et al.
Published: (2025)

Leave No One Behind: Online Self-Supervised Self-Distillation for Sequential Recommendation
by: Wei, Shaowei, et al.
Published: (2024)

$β$-DPO: Direct Preference Optimization with Dynamic $β$
by: Wu, Junkang, et al.
Published: (2024)

RePO: Understanding Preference Learning Through ReLU-Based Optimization
by: Wu, Junkang, et al.
Published: (2025)

Agentic Feedback Loop Modeling Improves Recommendation and User Simulation
by: Cai, Shihao, et al.
Published: (2024)

AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)

MLLMEraser: Achieving Test-Time Unlearning in Multimodal Large Language Models through Activation Steering
by: Ding, Chenlu, et al.
Published: (2025)

Be Aware of the Neighborhood Effect: Modeling Selection Bias under Interference
by: Li, Haoxuan, et al.
Published: (2024)

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)

Less is More: Improving LLM Alignment via Preference Data Selection
by: Deng, Xun, et al.
Published: (2025)

Larger or Smaller Reward Margins to Select Preferences for Alignment?
by: Huang, Kexin, et al.
Published: (2025)

Unified Parameter-Efficient Unlearning for LLMs
by: Ding, Chenlu, et al.
Published: (2024)

Dynamic Sparse Learning: A Novel Paradigm for Efficient Recommendation
by: Wang, Shuyao, et al.
Published: (2024)

A3S: A General Active Clustering Method with Pairwise Constraints
by: Deng, Xun, et al.
Published: (2024)

Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
by: Bao, Keqin, et al.
Published: (2025)

Breaking User-Centric Agency: A Tri-Party Framework for Agent-Based Recommendation
by: Gong, Yaxin, et al.
Published: (2026)

ARMR: Adaptively Responsive Network for Medication Recommendation
by: Wu, Feiyue, et al.
Published: (2025)

Position-aware Graph Transformer for Recommendation
by: Chen, Jiajia, et al.
Published: (2024)

Debiased Recommendation with Noisy Feedback
by: Li, Haoxuan, et al.
Published: (2024)

Generative Multi-Target Cross-Domain Recommendation
by: Jin, Jinqiu, et al.
Published: (2025)

Beyond Static Best-of-N: Bayesian List-wise Alignment for LLM-based Recommendation
by: Chen, Ruijun, et al.
Published: (2026)

EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems
by: Yu, Yuanqing, et al.
Published: (2024)

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
by: Munkhdalai, Tsendsuren, et al.
Published: (2024)

CausalMed: Causality-Based Personalized Medication Recommendation Centered on Patient health state
by: Li, Xiang, et al.
Published: (2024)

Process-Supervised LLM Recommenders via Flow-guided Tuning
by: Gao, Chongming, et al.
Published: (2025)

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation
by: Huang, Kexin, et al.
Published: (2026)

Alleviating Structural Distribution Shift in Graph Anomaly Detection
by: Gao, Yuan, et al.
Published: (2024)

Medical Reasoning with Large Language Models: A Survey and MR-Bench
by: Ren, Xiaohan, et al.
Published: (2026)

Medical Reasoning With Large Language Models: A Systematic Review and Evaluation
by: Xiaohan Ren, et al.
Published: (2026)

Large Language Model Distilling Medication Recommendation Model
by: Liu, Qidong, et al.
Published: (2024)

SPRec: Self-Play to Debias LLM-based Recommendation
by: Gao, Chongming, et al.
Published: (2024)

On the Maximal Local Disparity of Fairness-Aware Classifiers
by: Jin, Jinqiu, et al.
Published: (2024)

Leveraging LLMs for Influence Path Planning in Proactive Recommendation
by: Wang, Mingze, et al.
Published: (2024)