Saved in:
| Main Authors: | Zhao, Zihao, Jing, Yi, Feng, Fuli, Wu, Jiancan, Gao, Chongming, He, Xiangnan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.17745 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fine-grained List-wise Alignment for Generative Medication Recommendation
by: Fan, Chenxiao, et al.
Published: (2025)
by: Fan, Chenxiao, et al.
Published: (2025)
Large Language Models are Learnable Planners for Long-Term Recommendation
by: Shi, Wentao, et al.
Published: (2024)
by: Shi, Wentao, et al.
Published: (2024)
Fine-grained Alignment of Large Language Models for General Medication Recommendation without Overprescription
by: Zhao, Zihao, et al.
Published: (2025)
by: Zhao, Zihao, et al.
Published: (2025)
Reinforced Prompt Personalization for Recommendation with Large Language Models
by: Mao, Wenyu, et al.
Published: (2024)
by: Mao, Wenyu, et al.
Published: (2024)
Fair Recommendations with Limited Sensitive Attributes: A Distributionally Robust Optimization Approach
by: Shi, Tianhao, et al.
Published: (2024)
by: Shi, Tianhao, et al.
Published: (2024)
Lower-Left Partial AUC: An Effective and Efficient Optimization Metric for Recommendation
by: Shi, Wentao, et al.
Published: (2024)
by: Shi, Wentao, et al.
Published: (2024)
Uncertainty-aware Generative Recommendation
by: Fan, Chenxiao, et al.
Published: (2026)
by: Fan, Chenxiao, et al.
Published: (2026)
Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning
by: Wu, Junkang, et al.
Published: (2025)
by: Wu, Junkang, et al.
Published: (2025)
Leave No One Behind: Online Self-Supervised Self-Distillation for Sequential Recommendation
by: Wei, Shaowei, et al.
Published: (2024)
by: Wei, Shaowei, et al.
Published: (2024)
$β$-DPO: Direct Preference Optimization with Dynamic $β$
by: Wu, Junkang, et al.
Published: (2024)
by: Wu, Junkang, et al.
Published: (2024)
RePO: Understanding Preference Learning Through ReLU-Based Optimization
by: Wu, Junkang, et al.
Published: (2025)
by: Wu, Junkang, et al.
Published: (2025)
Agentic Feedback Loop Modeling Improves Recommendation and User Simulation
by: Cai, Shihao, et al.
Published: (2024)
by: Cai, Shihao, et al.
Published: (2024)
AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)
by: Wu, Junkang, et al.
Published: (2024)
MLLMEraser: Achieving Test-Time Unlearning in Multimodal Large Language Models through Activation Steering
by: Ding, Chenlu, et al.
Published: (2025)
by: Ding, Chenlu, et al.
Published: (2025)
Be Aware of the Neighborhood Effect: Modeling Selection Bias under Interference
by: Li, Haoxuan, et al.
Published: (2024)
by: Li, Haoxuan, et al.
Published: (2024)
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)
by: Wu, Junkang, et al.
Published: (2024)
Less is More: Improving LLM Alignment via Preference Data Selection
by: Deng, Xun, et al.
Published: (2025)
by: Deng, Xun, et al.
Published: (2025)
Larger or Smaller Reward Margins to Select Preferences for Alignment?
by: Huang, Kexin, et al.
Published: (2025)
by: Huang, Kexin, et al.
Published: (2025)
Unified Parameter-Efficient Unlearning for LLMs
by: Ding, Chenlu, et al.
Published: (2024)
by: Ding, Chenlu, et al.
Published: (2024)
Dynamic Sparse Learning: A Novel Paradigm for Efficient Recommendation
by: Wang, Shuyao, et al.
Published: (2024)
by: Wang, Shuyao, et al.
Published: (2024)
A3S: A General Active Clustering Method with Pairwise Constraints
by: Deng, Xun, et al.
Published: (2024)
by: Deng, Xun, et al.
Published: (2024)
Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
by: Bao, Keqin, et al.
Published: (2025)
by: Bao, Keqin, et al.
Published: (2025)
Breaking User-Centric Agency: A Tri-Party Framework for Agent-Based Recommendation
by: Gong, Yaxin, et al.
Published: (2026)
by: Gong, Yaxin, et al.
Published: (2026)
ARMR: Adaptively Responsive Network for Medication Recommendation
by: Wu, Feiyue, et al.
Published: (2025)
by: Wu, Feiyue, et al.
Published: (2025)
Position-aware Graph Transformer for Recommendation
by: Chen, Jiajia, et al.
Published: (2024)
by: Chen, Jiajia, et al.
Published: (2024)
Debiased Recommendation with Noisy Feedback
by: Li, Haoxuan, et al.
Published: (2024)
by: Li, Haoxuan, et al.
Published: (2024)
Generative Multi-Target Cross-Domain Recommendation
by: Jin, Jinqiu, et al.
Published: (2025)
by: Jin, Jinqiu, et al.
Published: (2025)
Beyond Static Best-of-N: Bayesian List-wise Alignment for LLM-based Recommendation
by: Chen, Ruijun, et al.
Published: (2026)
by: Chen, Ruijun, et al.
Published: (2026)
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems
by: Yu, Yuanqing, et al.
Published: (2024)
by: Yu, Yuanqing, et al.
Published: (2024)
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
by: Munkhdalai, Tsendsuren, et al.
Published: (2024)
by: Munkhdalai, Tsendsuren, et al.
Published: (2024)
CausalMed: Causality-Based Personalized Medication Recommendation Centered on Patient health state
by: Li, Xiang, et al.
Published: (2024)
by: Li, Xiang, et al.
Published: (2024)
Process-Supervised LLM Recommenders via Flow-guided Tuning
by: Gao, Chongming, et al.
Published: (2025)
by: Gao, Chongming, et al.
Published: (2025)
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation
by: Huang, Kexin, et al.
Published: (2026)
by: Huang, Kexin, et al.
Published: (2026)
Alleviating Structural Distribution Shift in Graph Anomaly Detection
by: Gao, Yuan, et al.
Published: (2024)
by: Gao, Yuan, et al.
Published: (2024)
Medical Reasoning with Large Language Models: A Survey and MR-Bench
by: Ren, Xiaohan, et al.
Published: (2026)
by: Ren, Xiaohan, et al.
Published: (2026)
Medical Reasoning With Large Language Models: A Systematic Review and Evaluation
by: Xiaohan Ren, et al.
Published: (2026)
by: Xiaohan Ren, et al.
Published: (2026)
Large Language Model Distilling Medication Recommendation Model
by: Liu, Qidong, et al.
Published: (2024)
by: Liu, Qidong, et al.
Published: (2024)
SPRec: Self-Play to Debias LLM-based Recommendation
by: Gao, Chongming, et al.
Published: (2024)
by: Gao, Chongming, et al.
Published: (2024)
On the Maximal Local Disparity of Fairness-Aware Classifiers
by: Jin, Jinqiu, et al.
Published: (2024)
by: Jin, Jinqiu, et al.
Published: (2024)
Leveraging LLMs for Influence Path Planning in Proactive Recommendation
by: Wang, Mingze, et al.
Published: (2024)
by: Wang, Mingze, et al.
Published: (2024)
Similar Items
-
Fine-grained List-wise Alignment for Generative Medication Recommendation
by: Fan, Chenxiao, et al.
Published: (2025) -
Large Language Models are Learnable Planners for Long-Term Recommendation
by: Shi, Wentao, et al.
Published: (2024) -
Fine-grained Alignment of Large Language Models for General Medication Recommendation without Overprescription
by: Zhao, Zihao, et al.
Published: (2025) -
Reinforced Prompt Personalization for Recommendation with Large Language Models
by: Mao, Wenyu, et al.
Published: (2024) -
Fair Recommendations with Limited Sensitive Attributes: A Distributionally Robust Optimization Approach
by: Shi, Tianhao, et al.
Published: (2024)