Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Fan, Chenxiao, Gao, Chongming, Shi, Wentao, Gong, Yaxin, Zhao, Zihao, Feng, Fuli
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2505.20218
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918508369018880
author	Fan, Chenxiao Gao, Chongming Shi, Wentao Gong, Yaxin Zhao, Zihao Feng, Fuli
author_facet	Fan, Chenxiao Gao, Chongming Shi, Wentao Gong, Yaxin Zhao, Zihao Feng, Fuli
contents	Accurate and safe medication recommendations are critical for effective clinical decision-making, especially in multimorbidity cases. However, existing systems rely on point-wise prediction paradigms that overlook synergistic drug effects and potential adverse drug-drug interactions (DDIs). We propose FLAME, a fine-grained list-wise alignment framework for large language models (LLMs), enabling drug-by-drug generation of drug lists. FLAME formulates recommendation as a sequential decision process, where each step adds or removes a single drug. To provide fine-grained learning signals, we devise step-wise Group Relative Policy Optimization (GRPO) with potential-based reward shaping, which explicitly models DDIs and optimizes the contribution of each drug to the overall prescription. Furthermore, FLAME enhances patient modeling by integrating structured clinical knowledge and collaborative information into the representation space of LLMs. Experiments on benchmark datasets demonstrate that FLAME achieves state-of-the-art performance, delivering superior accuracy, controllable safety-accuracy trade-offs, and strong generalization across diverse clinical scenarios. Our code is available at https://github.com/cxfann/Flame.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_20218
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Fine-grained List-wise Alignment for Generative Medication Recommendation Fan, Chenxiao Gao, Chongming Shi, Wentao Gong, Yaxin Zhao, Zihao Feng, Fuli Machine Learning Accurate and safe medication recommendations are critical for effective clinical decision-making, especially in multimorbidity cases. However, existing systems rely on point-wise prediction paradigms that overlook synergistic drug effects and potential adverse drug-drug interactions (DDIs). We propose FLAME, a fine-grained list-wise alignment framework for large language models (LLMs), enabling drug-by-drug generation of drug lists. FLAME formulates recommendation as a sequential decision process, where each step adds or removes a single drug. To provide fine-grained learning signals, we devise step-wise Group Relative Policy Optimization (GRPO) with potential-based reward shaping, which explicitly models DDIs and optimizes the contribution of each drug to the overall prescription. Furthermore, FLAME enhances patient modeling by integrating structured clinical knowledge and collaborative information into the representation space of LLMs. Experiments on benchmark datasets demonstrate that FLAME achieves state-of-the-art performance, delivering superior accuracy, controllable safety-accuracy trade-offs, and strong generalization across diverse clinical scenarios. Our code is available at https://github.com/cxfann/Flame.
title	Fine-grained List-wise Alignment for Generative Medication Recommendation
topic	Machine Learning
url	https://arxiv.org/abs/2505.20218

Similar Items