Saved in:
Bibliographic Details
Main Authors: Feng, Yuming, Jiang, Xinrui
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.11755
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909958781534208
author Feng, Yuming
Jiang, Xinrui
author_facet Feng, Yuming
Jiang, Xinrui
contents Online product reviews contain rich but noisy signals that overwhelm users and hinder effective decision-making. Existing LLM-based summarizers remain generic and fail to account for individual preferences, limiting their practical utility. We propose SUMFORU, a steerable review summarization framework that aligns outputs with explicit user personas to support personalized purchase decisions. Our approach integrates a high-quality data pipeline built from the Amazon 2023 Review Dataset with a two-stage alignment procedure: (1) persona-aware Supervised Fine-Tuning (SFT) via asymmetric knowledge distillation, and (2) Reinforcement Learning with AI Feedback (RLAIF) using a preference estimator to capture fine-grained, persona-relevant signals. We evaluate the model across rule-based, LLM-based, and human-centered metrics, demonstrating consistent improvements in consistency, grounding, and preference alignment. Our framework achieves the highest performance across all evaluation settings and generalizes effectively to unseen product categories. Our results highlight the promise of steerable pluralistic alignment for building next-generation personalized decision-support systems.
format Preprint
id arxiv_https___arxiv_org_abs_2512_11755
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle SUMFORU: An LLM-Based Review Summarization Framework for Personalized Purchase Decision Support
Feng, Yuming
Jiang, Xinrui
Computation and Language
Online product reviews contain rich but noisy signals that overwhelm users and hinder effective decision-making. Existing LLM-based summarizers remain generic and fail to account for individual preferences, limiting their practical utility. We propose SUMFORU, a steerable review summarization framework that aligns outputs with explicit user personas to support personalized purchase decisions. Our approach integrates a high-quality data pipeline built from the Amazon 2023 Review Dataset with a two-stage alignment procedure: (1) persona-aware Supervised Fine-Tuning (SFT) via asymmetric knowledge distillation, and (2) Reinforcement Learning with AI Feedback (RLAIF) using a preference estimator to capture fine-grained, persona-relevant signals. We evaluate the model across rule-based, LLM-based, and human-centered metrics, demonstrating consistent improvements in consistency, grounding, and preference alignment. Our framework achieves the highest performance across all evaluation settings and generalizes effectively to unseen product categories. Our results highlight the promise of steerable pluralistic alignment for building next-generation personalized decision-support systems.
title SUMFORU: An LLM-Based Review Summarization Framework for Personalized Purchase Decision Support
topic Computation and Language
url https://arxiv.org/abs/2512.11755